Drosophila willistoni (Fruit fly, TSC#14030-0811.24) (dwil_caf1)

Drosophila willistoni (Fruit fly, TSC#14030-0811.24) Assembly and Gene Annotation

About Drosophila willistoni

Drosophila willistoni has a distribution across Central and South America, and was one of 12 fruitfly genomes sequenced for a large comparative study [1]. Ensembl Genomes imports data from FlyBase, who also have more information about the biology of Drosophila willistoni, and a phylogeny of the 12 sequenced fruitfly species.

Picture credit (Creative Commons BY-NC-SA 2.0 FR): Nicolas Gompel 2008. Image shows a female fly.

Assembly

This is the genome assembly 1.0 of Drosophila willistoni. The 8.4x whole-genome shotgun sequencing was performed and assembly provided by the J. Craig Venter Institute (JCVI).

Annotation

Protein-coding and RNA genes, which were annotated with the NCBI eukaryotic genome annotation pipeline, were imported from FlyBase, release dwil_r1.05 (FB2017_04).

Repeats were annotated with the Ensembl Genomes repeat feature pipeline. There are: 577,783 Low complexity (Dust) features, covering 31 Mb (13.1% of the genome); 111,787 RepeatMasker features (with the RepBase library), covering 51 Mb (21.7% of the genome); 226,941 Tandem repeats (TRF) features, covering 22 Mb (9.4% of the genome).

Protein domains were annotated with the Ensembl Genomes protein feature pipeline.

References

  1. Evolution of genes and genomes on the Drosophila phylogeny.
    Drosophila 12 Genomes Consortium, Clark AG, Eisen MB, Smith DR, Bergman CM, Oliver B, Markow TA, Kaufman TC, Kellis M, Gelbart W et al. 2007. Nature. 450:203-218.

Statistics

Summary

Assemblydwil_caf1, INSDC Assembly GCA_000005925.1,
Database version113.2
Golden Path Length235,516,348
Genebuild byFlyBase
Genebuild methodImport
Data sourceFlyBase

Gene counts

Coding genes13,783
Non coding genes880
Small non coding genes880
Pseudogenes277
Gene transcripts16,363