Human

Statistics about the GENCODE Release 45

The statistics derive from the gtf file that contains only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

General stats

Total No of Genes 63187
Protein-coding genes 19395
- readthrough genes (not included) 654
Long non-coding RNA genes 20424
Small non-coding RNA genes 7565
Pseudogenes 14719
- processed pseudogenes 10658
- unprocessed pseudogenes 3566
- unitary pseudogenes 258
Immunoglobulin/T-cell receptor gene segments
- protein coding segments 411
- pseudogenes 237
Total No of Transcripts 252930
Protein-coding transcripts 89110
- full length protein-coding 64028
- partial length protein-coding 25082
Nonsense mediated decay transcripts 21427
Long non-coding RNA loci transcripts 59719
 
Total No of distinct translations 65357
Genes that have more than one distinct translations 13600

Further details on this version's gene and transcript types

biotype genes transcripts
artifact 19 19
IG_C_gene 14 23
IG_C_pseudogene 9 9
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_pseudogene 1 1
IG_V_gene 145 145
IG_V_pseudogene 187 187
lncRNA 19370 57722
miRNA 1879 1879
misc_RNA 2208 2208
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 103
nonsense_mediated_decay 0 21427
processed_pseudogene 10143 10144
processed_transcript 0 10
protein_coding 20049 89110
protein_coding_CDS_not_defined 0 26477
protein_coding_LoF 0 74
retained_intron 0 34141
ribozyme 8 8
rRNA 47 47
rRNA_pseudogene 497 497
scaRNA 49 49
scRNA 1 1
snoRNA 942 942
snRNA 1901 1901
sRNA 5 5
TEC 1054 1141
TR_C_gene 6 6
TR_D_gene 5 5
TR_J_gene 79 79
TR_J_pseudogene 4 4
TR_V_gene 107 107
TR_V_pseudogene 33 33
transcribed_processed_pseudogene 513 513
transcribed_unitary_pseudogene 158 158
transcribed_unprocessed_pseudogene 962 962
translated_processed_pseudogene 2 2
unitary_pseudogene 100 100
unprocessed_pseudogene 2604 2605
vault_RNA 4 4