Drosophila mojavensis (dmoj_caf1)

Drosophila mojavensis Assembly and Gene Annotation

About Drosophila mojavensis

Drosophila mojavensis is a cactophilic fruitfly from the southwestern United States and Mexico, and was one of 12 fruitfly genomes sequenced for a large comparative study [1]. Ensembl Genomes imports data from FlyBase, who also have more information about the biology of Drosophila mojavensis, and a phylogeny of the 12 sequenced fruitfly species.

Picture credit (Creative Commons BY-NC-SA 2.0 FR): Nicolas Gompel 2008. Image shows a female fly.

Assembly

This is the August 2005 genome assembly of Drosophila mojavensis. The 8.2x whole-genome shotgun sequencing was performed and assembly provided by Agencourt Bioscience.

Annotation

Protein-coding and RNA genes, which were annotated with the NCBI eukaryotic genome annotation pipeline, were imported from FlyBase, release dmoj_r1.04 (FB2017_04).

Repeats were annotated with the Ensembl Genomes repeat feature pipeline. There are: 462,080 Low complexity (Dust) features, covering 35 Mb (18.0% of the genome); 90,970 RepeatMasker features (with the RepBase library), covering 26 Mb (13.6% of the genome); 303,230 Tandem repeats (TRF) features, covering 25 Mb (12.8% of the genome).

Protein domains were annotated with the Ensembl Genomes protein feature pipeline.

EST Alignments

Drosophila mojavensis ESTs from dbEST were mapped onto the genome using Exonerate.

References

  1. Evolution of genes and genomes on the Drosophila phylogeny.
    Drosophila 12 Genomes Consortium, Clark AG, Eisen MB, Smith DR, Bergman CM, Oliver B, Markow TA, Kaufman TC, Kellis M, Gelbart W et al. 2007. Nature. 450:203-218.

More information

General information about this species can be found in Wikipedia.

Statistics

Summary

Assemblydmoj_caf1, INSDC Assembly GCA_000005175.1, Jul 2008
Database version98.1
Base Pairs180,207,831
Golden Path Length193,826,310
Genebuild byFlyBase
Genebuild methodImport
Data sourceFlyBase

Gene counts

Coding genes13,425
Non coding genes1,090
Small non coding genes1,090
Pseudogenes165
Gene transcripts21,837

About this species