XENLA Mayball

From TaejoonLab
Jump to: navigation, search

Statistics

cDNA sequences

  • Count: 35532
  • Mean(bp): 2649
  • Median(bp): 2245
  • >400 bp: 35273
  • >1000 bp: 31274
  • >10 kbp: 133

Protein sequences

  • Count: 35532
  • Mean(aa): 534
  • Median(aa): 425
  • >100 aa: 34571 (97.295)
  • >1000 aa: 3718
  • >3000 aa: 87

Files

http://genome.taejoonlab.org/pub/xenopus/annotation/MayBall_201305/

Official sequences

  • XENLA_2013mayball_cdna_longest.fa.gz - cDNA sequences (27M)
  • XENLA_2013mayball_prot_longest.fa.gz - protein sequences (11M)

Original sequences with scaffold coordinates

(If you are not interested in the location on JGIv71 scaffolds, you don't need to use these files. Sequences are identical to official sequences; only the header info is different.)

  • XENLA_2013mayball_prot_longest_coord.fa.gz - original protein sequences with scaffold coordinates on JGIv70, JGIv71 and NIGv2 (12M)
  • XENLA_2013mayball_cdna_longest_coord.fa.gz - original cDNA sequences with scaffold coordinates on JGIv70, JGIv71 and NIGv2 (28M)

Alignments

  • XENLA_2013mayball_cdna_longest.XENLA_JGIv91_dna_final.gmap.gff3.gz - GFF3 annotation file to JGIv91 genome, mapped by GMAP (12M)
  • XENLA_2013mayball_cdna_longest.XENLA_JGIv91_dna_final.gmap.psl.gz - GFF3 annotation file to JGIv91 genome, mapped by GMAP (5.7M)
  • XENLA_2013mayball_prot_longest.ens72.best_hits.gz - Summary of BLASTP hits to EnsEMBL v72 longest proteome (1.9M)