EMBL-EBI User Survey 2024

Do data resources managed by EMBL-EBI and our collaborators make a difference to your work?

Please take 10 minutes to fill in our annual user survey, and help us make the case for why sustaining open data resources is critical for life sciences research.

Survey link: https://www.surveymonkey.com/r/HJKYKTT?channel=[webpage]

Anopheles quadriannulatus (Mosquito, SANGWE) (AquaS1)

Anopheles quadriannulatus (Mosquito, SANGWE) Assembly and Gene Annotation

The Anopheles quadriannulatus data and its display on Ensembl Genomes are made possible through a joint effort by the Ensembl Genomes group and VectorBase, a component of VEuPathDB.

The assembly name may not match that from INSDC due to additional community contributions applied by VEuPathDB to the initial INSDC assembly (recorded by the assembly accession).

About Anopheles quadriannulatus

Anopheles quadriannulatus A belongs to the Anopheles gambiae species complex, which consists of at least seven species, it is found in southern Africa and is not considered to be a malaria vector.

Strain SANGWE (aka SANGQUA)

Originally isolated in South Africa the colony subsequently undergone rounds of isofemale selection prior to genome sequencing. The SANGWE strain is a synonym for the SANGQUA strain (this strain is available from BEI resources).

Source: VectorBase

Picture credit: James Gathany, CDC Public domain via Wikimedia Commons (Image source)

AquaS1 assembly

This assembly was generated using 101 bp paired-end Illumina HiSeq2000 reads generated from three libraries: a 180 bp insert 'fragment' library, a 1.5 kb 'jump' library, and a 38 kb 'fosill' library. Sequencing template for the fragment and jump libraries was derived from genomic DNA extracted from a single individual, which was preserved by freezing at -80C. Native genomic DNA was used for the fragment library and whole genome amplified DNA was used for the jump library. Template for the fosill library was generated from a pooled extraction of many individuals. Reads were assembled at the Broad Institute using the ALLPATHS LG algorithm, with the Haploidify option enabled to address high allelic heterozygosity in the template.

AquaS1.12 gene set

Community annotation patch build for July 2019

Statistics

Summary

AssemblyAquaS1, INSDC Assembly GCA_000349065.1, Mar 2013
Database version112.1
Golden Path Length283,828,998
Genebuild byVEuPathDB
Genebuild methodImport
Data sourceVectorBase

Gene counts

Coding genes13,436
Non coding genes404
Small non coding genes402
Long non coding genes2
Gene transcripts14,304

Other

Short Variants11,069,637