Human

Statistics about the GENCODE Release 41

The statistics derive from the gtf file that contains only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

General stats

Total No of Genes 61852
Protein-coding genes 19370
- readthrough genes (not included) 647
Long non-coding RNA genes 19095
Small non-coding RNA genes 7566
Pseudogenes 14736
- processed pseudogenes 10662
- unprocessed pseudogenes 3573
- unitary pseudogenes 250
- pseudogenes 15
Immunoglobulin/T-cell receptor gene segments
- protein coding segments 410
- pseudogenes 236
Total No of Transcripts 251236
Protein-coding transcripts 88780
- full length protein-coding 63370
- partial length protein-coding 25410
Nonsense mediated decay transcripts 20933
Long non-coding RNA loci transcripts 54291
 
Total No of distinct translations 65052
Genes that have more than one distinct translations 13614

Further details on this version's gene and transcript types

biotype genes transcripts
artifact 27 27
IG_C_gene 14 23
IG_C_pseudogene 9 9
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_pseudogene 1 1
IG_V_gene 145 145
IG_V_pseudogene 187 187
lncRNA 18041 52586
miRNA 1879 1879
misc_RNA 2212 2212
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 104
nonsense_mediated_decay 0 20933
processed_pseudogene 10149 10151
processed_transcript 0 31113
protein_coding 20017 88780
protein_coding_LoF 0 73
pseudogene 15 15
retained_intron 0 33750
ribozyme 8 8
rRNA 47 47
rRNA_pseudogene 497 497
scaRNA 49 49
scRNA 1 1
snoRNA 942 942
snRNA 1901 1901
sRNA 5 5
TEC 1054 1144
TR_C_gene 6 6
TR_D_gene 4 4
TR_J_gene 79 79
TR_J_pseudogene 4 4
TR_V_gene 107 107
TR_V_pseudogene 33 33
transcribed_processed_pseudogene 511 511
transcribed_unitary_pseudogene 152 154
transcribed_unprocessed_pseudogene 961 961
translated_processed_pseudogene 2 2
translated_unprocessed_pseudogene 3 3
unitary_pseudogene 98 97
unprocessed_pseudogene 2609 2610
vault_RNA 1 1