This is the preliminary display of the assembly of the sperm whale genome (Physeter_macrocephalus-2.0.2, GCA_000472045.1), produced in September 2013 by The Aquatic Genome Models Consortium. The assembly does not contain any assembled chromosomes or linkage groups and is instead made up of 11,711 unplaced scaffolds. It has a total sequence length of 2.28Gb with 47.03Mb of gaps, a scaffold N50 of 427.3kb and a contig N50 of 35.3kb. The N50 size is the length such that 50% of the assembled genome lies in blocks of the N50 size or longer.
What can I find? Protein-coding and non-coding genes, splice variants, cDNA and protein sequences, non-coding RNAs.
This preliminary gene annotation was mainly generated through protein sequence alignments. We aligned 10,150 dolphin and 20,345 human translations from Ensembl release 76, providing us with 8,916 and 16,329 gene models, respectively. Other predictions were made using vertebrate proteins. Finally, we also provide ab initio gene predictions and alignments of sequences from UniProt, UniGene and the ENA vertebrate RNA collection.
|Assembly:||PhyMac_2.0.2, Sep 2013|
|Golden Path Length:||2,280,711,356|
|Genscan gene predictions:||47,272|