Drosophila melanogaster is a cosmopolitan species of fruitfly that has been used as a model organism for over a hundred years, particularly with respect to genetics and developmental biology. It was the second metazoan (the first being Caenorhabditis elegans) to have its genome sequenced [1], and was one of 12 fruitfly genomes included in a large comparative study [2]. Ensembl Genomes imports data from FlyBase, who also have much more information about the biology of Drosophila melanogaster, and a phylogeny of the 12 sequenced fruitfly species.

Picture credit (Creative Commons BY-NC-SA 2.0 FR): Nicolas Gompel 2008. Image shows a female fly.


Ensembl Metazoa uses the Berkeley Drosophila Genome Project (BDGP) assembly release 6 (July 2014). In contrast to release 5, regions without a chromosome assignment are stored as distinct scaffolds, rather than on a single pseudo-chromosome, and heterochromatin regions are incorporated in the chromosomal arms.


Protein-coding and RNA genes were imported from FlyBase, release dmel_r6.32 (FB2020_01).

Repeats were annotated with the Ensembl Genomes repeat feature pipeline. There are: 222,250 Low complexity (Dust) features, covering 8 Mb (5.5% of the genome); 42,298 RepeatMasker features (with a custom library), covering 20 Mb (13.7% of the genome); 41,694 RepeatMasker features (with the RepBase library), covering 26 Mb (17.8% of the genome); 72,600 Tandem repeats (TRF) features, covering 6 Mb (4.1% of the genome).

Protein domains were annotated with the Ensembl Genomes protein feature pipeline.


AssemblyBDGP6.32, INSDC Assembly GCA_000001215.4,
Database version103.9
Golden Path Length143,726,002
Genebuild byFlyBase
Genebuild methodImport
Data sourceFlyBase

Gene counts

Coding genes13,968
Non coding genes4,044
Small non coding genes4,044
Gene transcripts41,209

