What's New in Release 36
- New data
- Alignment of Rfam 12.2 covariance models for all species
- Automated RNA gene annotation for 37 species (i.e. all species that have not been imported from FlyBase, VectorBase or WormBase)
- Updated data
Release 37 of Ensembl Genomes is scheduled for 22nd August 2017. For more details, see the release intentions.
Automated RNA gene annotation
Most gene sets that are imported into Ensembl Metazoa do not have RNA gene annotation, and in these cases we perform automated annotation. For release 36 of Ensembl Metazoa RNA genes were annotated on all species except those imported from FlyBase, VectorBase, and WormBase. The RNA gene annotation uses three sources:
- miRBase provide genomic locations of precursor microRNA genes for a subset of species (Apis mellifera, Heliconius melpomene, Nasonia vitripennis, Strongylocentrotus purpuratus).
- tRNAscan-SE is used to predict tRNA genes (with a score threshold of 65).
- Rfam covariance model alignments are used to annotate RNA genes of all classes except tRNA. Models are restricted to those that are taxonomically appropriate, and are filtered to prevent overlapping genes (on either strand).
RNA genes have been updated for 26 species (21 of which had out-dated RNA gene annotation from earlier Ensembl Metazoa releases, the remaining 5 having partial annotation from the geneset provider); and added to 11 species which previously had no RNA gene annotation.
The chart below shows the numbers of RNA genes annotated, broken down by biotype, across all 37 species (click image for a larger version).
Ensembl Metazoa recieves funding from the Infravec2, to serve as a data delivery mechanism for that project. Infravec2 is an international and interdisciplinary research project on insect vectors of human and animal disease, including mosquitoes, sandflies and other flies. This project has received funding from the European Unions Horizon 2020 research and innovation programme under grant agreement No 731060.