The Zoonomia Project is comparing genomes of diverse mammals to understanding the basis of remarkable phenotypes, and the origins of disease. In addition to the 131 new assemblies produced, we have generated three types of comparative data.
*Please note that these files are quite large, and will occupy a large amount of computer memory.
To access genome assemblies used in our Zoonomia alignment, please visit the “mammalian tree” and click on your species of interest.
Our 240-species Cactus genome alignment (file format: .hal) was made without reference to any single genome. As a result, it includes both regions shared across eutherian mammals, and regions unique to specific lineages.
TOGA annotation of orthologous genes
Annotation of genes across the 240 Zoonomia species —plus many more — inferred using TOGA (Tool to infer Orthologs from Genome Alignments). Files are available as gtf and bed12, with options to reference on human, mouse, chicken, and several other species.
Conservation scores calculated from the Zoonomia alignment identify sites and regions under purifying selection. With 240 species, we find 3.1% of sites in the human genome to be under purifying selection using PhyloP, with a false discover rate threshold of 5%.