Human

Statistics about the GENCODE Release 35

The statistics derive from the gtf file that contains only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

General stats

Total No of Genes 60656
Protein-coding genes 19954
Long non-coding RNA genes 17957
Small non-coding RNA genes 7569
Pseudogenes 14767
- processed pseudogenes 10671
- unprocessed pseudogenes 3557
- unitary pseudogenes 235
- polymorphic pseudogenes 49
- pseudogenes 18
Immunoglobulin/T-cell receptor gene segments
- protein coding segments 408
- pseudogenes 237
Total No of Transcripts 229580
Protein-coding transcripts 84485
- full length protein-coding 58390
- partial length protein-coding 26095
Nonsense mediated decay transcripts 16495
Long non-coding RNA loci transcripts 48684
 
Total No of distinct translations 62514
Genes that have more than one distinct translations 13697

Further details on this version's gene and transcript types

biotype genes transcripts
IG_C_gene 14 24
IG_C_pseudogene 9 9
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_pseudogene 1 1
IG_V_gene 144 153
IG_V_pseudogene 188 188
lncRNA 16899 46977
miRNA 1881 1881
misc_RNA 2212 2212
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 90
nonsense_mediated_decay 0 16495
polymorphic_pseudogene 49 71
processed_pseudogene 10169 10171
processed_transcript 0 28672
protein_coding 19954 84485
pseudogene 18 37
retained_intron 0 28888
ribozyme 8 8
rRNA 47 47
rRNA_pseudogene 497 497
scaRNA 49 49
scRNA 1 1
snoRNA 943 943
snRNA 1901 1901
sRNA 5 5
TEC 1058 1150
TR_C_gene 6 6
TR_D_gene 4 4
TR_J_gene 79 79
TR_J_pseudogene 4 4
TR_V_gene 106 107
TR_V_pseudogene 33 33
transcribed_processed_pseudogene 500 500
transcribed_unitary_pseudogene 138 145
transcribed_unprocessed_pseudogene 941 948
translated_processed_pseudogene 2 2
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 97 97
unprocessed_pseudogene 2615 2616
vault_RNA 1 1