Drosophila persimilis (Fruit fly, MSH-3) (dper_caf1)

Drosophila persimilis (Fruit fly, MSH-3) Assembly and Gene Annotation

About Drosophila persimilis

Drosophila persimilis is a sister species to D. pseudoobscura, and was one of 12 fruitfly genomes sequenced for a large comparative study [1]. Ensembl Genomes imports data from FlyBase, who also have more information about the biology of Drosophila persimilis, and a phylogeny of the 12 sequenced fruitfly species.

Picture credit (Creative Commons BY-NC-SA 2.0 FR): Nicolas Gompel 2008. Image shows a female fly.

Assembly

This is the October 2005 genome assembly of Drosophila persimilis. The 4.1x whole-genome shotgun sequencing was performed and assembly provided by the Broad Institute.

Annotation

Protein-coding and RNA genes were imported from FlyBase, release dper_r1.3 (FB2017_04).

Repeats were annotated with the Ensembl Genomes repeat feature pipeline. There are: 311,606 Low complexity (Dust) features, covering 23 Mb (12.1% of the genome); 67,484 RepeatMasker features (with the RepBase library), covering 37 Mb (19.8% of the genome); 191,184 Tandem repeats (TRF) features, covering 12 Mb (6.3% of the genome).

Protein domains were annotated with the Ensembl Genomes protein feature pipeline.

References

  1. Evolution of genes and genomes on the Drosophila phylogeny.
    Drosophila 12 Genomes Consortium, Clark AG, Eisen MB, Smith DR, Bergman CM, Oliver B, Markow TA, Kaufman TC, Kellis M, Gelbart W et al. 2007. Nature. 450:203-218.

Statistics

Summary

Assemblydper_caf1, INSDC Assembly GCA_000005195.1,
Database version111.2
Golden Path Length188,374,079
Genebuild byFlyBase
Genebuild methodImport
Data sourceFlyBase

Gene counts

Coding genes16,874
Non coding genes779
Small non coding genes779
Pseudogenes1
Gene transcripts17,658