Drosophila ananassae (Fruit fly, TSC#14024-0371.13) Assembly and Gene Annotation
About Drosophila ananassae
Drosophila ananassae is a useful model organism for genetic studies because it is easily cultured in the laboratory, and was one of 12 fruitfly genomes sequenced for a large comparative study [1]. Ensembl Genomes imports data from FlyBase, who also have more information about the biology of Drosophila ananassae, and a phylogeny of the 12 sequenced fruitfly species.
Picture credit (Creative Commons BY-NC-SA 2.0 FR): Nicolas Gompel 2008. Image shows a female fly.
Assembly
This is the August 2005 genome assembly of Drosophila ananassae. The 8.9x whole-genome shotgun sequencing was performed and assembled by Agencourt Bioscience.
Annotation
Protein-coding and RNA genes, which were annotated with the NCBI eukaryotic genome annotation pipeline, were imported from FlyBase, release dana_r1.06 (FB2018_04).
Repeats were annotated with the Ensembl Genomes repeat feature pipeline. There are: 409,506 Low complexity (Dust) features, covering 30 Mb (13.0% of the genome); 122,490 RepeatMasker features (with the RepBase library), covering 64 Mb (27.5% of the genome); 173,334 Tandem repeats (TRF) features, covering 39 Mb (17.0% of the genome).
Protein domains were annotated with the Ensembl Genomes protein feature pipeline.
References
- Evolution of genes and genomes on the Drosophila
phylogeny.
Drosophila 12 Genomes Consortium, Clark AG, Eisen MB, Smith DR, Bergman CM, Oliver B, Markow TA, Kaufman TC, Kellis M, Gelbart W et al. 2007. Nature. 450:203-218.
Statistics
Summary
Assembly | dana_caf1, INSDC Assembly GCA_000005115.1, |
Database version | 113.2 |
Golden Path Length | 230,993,012 |
Genebuild by | FlyBase |
Genebuild method | Import |
Data source | FlyBase |
Gene counts
Coding genes | 14,365 |
Non coding genes | 1,248 |
Small non coding genes | 1,248 |
Pseudogenes | 274 |
Gene transcripts | 23,261 |