This release features the second assembly of the olive baboon (Papio anubis) genome, Panu_2.0, provided by the Baylor College of Medicine in June 2012. The assembly has 20 autosomal chromosomes numbered 1-20 and a sex chromosome labelled X. The assembly contains a total sequence length of 2.95Gb with 55.1Mb of gaps. There are 72,500 scaffolds (63,229 unplaced) comprised of 198,931 contigs with a scaffold N50 of 528.9Kb and a contig N50 of 40.2Kb. The N50 size is the length such that 50% of the assembled genome lies in blocks of the N50 size or longer.
What can I find? Protein-coding and non-coding genes, splice variants, cDNA and protein sequences, non-coding RNAs.
Preliminary gene annotation in the olive baboon has been generated by alignments of nucleotide and protein sequences from a number different sources: 20,826 Ensembl human translations from the January 2013 genebuild (Ensembl release 70, GRCh37 assembly) and olive baboon-specific data consisting of 558 proteins obtained from UniprotKB and RefSeq and 462 RefSeq cDNAs. We successfully aligned 19,593 human translations (cut-off of percent identity > 80 and coverage > 50) giving 19,593 gene models. The olive baboon-specific alignments covered 535 proteins (no cut-off) and 462 cDNAs (no cut-off) and produced 641 and 525 gene models respectively.
In addition to these models, alignments of sequences from UniProt, UniGene, Genbank (olive baboon ESTs) and the ENA vertebrate RNA collection are also provided.
|Assembly:||Panu_2.0, Mar 2012|
|Golden Path Length:||2,948,380,710|
|Genscan gene predictions:||49,256|