Skip to content

Instantly share code, notes, and snippets.

@hisplan
Last active April 12, 2023 16:34
Show Gist options
  • Star 4 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save hisplan/8450365871592bbe6f8795898c73d91d to your computer and use it in GitHub Desktop.
Save hisplan/8450365871592bbe6f8795898c73d91d to your computer and use it in GitHub Desktop.
Ensembl GTF file differences

Ensembl GTF file differences

FTP LIST

http://ftp.ensembl.org/pub/release-85/gtf/homo_sapiens/

../
CHECKSUMS                                          10-Jul-2016 22:16                 221
Homo_sapiens.GRCh38.85.abinitio.gtf.gz             09-Jul-2016 23:45             3366550
Homo_sapiens.GRCh38.85.chr.gtf.gz                  09-Jul-2016 23:32            45754202
Homo_sapiens.GRCh38.85.chr_patch_hapl_scaff.gtf.gz 09-Jul-2016 23:39            49523538
Homo_sapiens.GRCh38.85.gtf.gz                      09-Jul-2016 23:32            45761783
README       

tl;dr;

  • Homo_sapiens.GRCh38.84.gtf.gz: chromosomes and scaffolds
  • Homo_sapiens.GRCh38.84.chr.gtf.gz: reference chromosomes only (i.e. 1-22, X, Y, MT)
  • Homo_sapiens.GRCh38.84.chr_patch_hapl_scaff.gtf.gz: reference chromosomes, scaffolds, assembly patches, ...

Homo_sapiens.GRCh38.84.gtf.gz

Contains the comprehensive gene annotation on the primary assembly (chromosomes and scaffolds).

$ gunzip -c Homo_sapiens.GRCh38.84.gtf.gz | awk -F'\t' '{ print $1 }' | sort | uniq
#!genebuild-last-updated 2015-10
#!genome-build GRCh38.p5
#!genome-build-accession NCBI:GCA_000001405.20
#!genome-date 2013-12
#!genome-version GRCh38
1
10
11
12
13
14
15
16
17
18
19
2
20
21
22
3
4
5
6
7
8
9
GL000008.2
GL000009.2
GL000194.1
GL000195.1
GL000205.2
GL000213.1
GL000216.2
GL000218.1
GL000219.1
GL000220.1
GL000224.1
GL000225.1
KI270442.1
KI270706.1
KI270707.1
KI270708.1
KI270711.1
KI270713.1
KI270714.1
KI270721.1
KI270722.1
KI270723.1
KI270724.1
KI270726.1
KI270727.1
KI270728.1
KI270731.1
KI270733.1
KI270734.1
KI270741.1
KI270743.1
KI270744.1
KI270750.1
KI270752.1
MT
X
Y

Homo_sapiens.GRCh38.84.chr.gtf.gz

Contains the comprehensive gene annotation on the reference chromosomes only.

$ gunzip -c Homo_sapiens.GRCh38.84.chr.gtf.gz | awk -F'\t' '{ print $1 }' | sort | uniq
#!genebuild-last-updated 2015-10
#!genome-build GRCh38.p5
#!genome-build-accession NCBI:GCA_000001405.20
#!genome-date 2013-12
#!genome-version GRCh38
1
10
11
12
13
14
15
16
17
18
19
2
20
21
22
3
4
5
6
7
8
9
MT
X
Y

Homo_sapiens.GRCh38.84.chr_patch_hapl_scaff.gtf.gz

Contains the comprehensive gene annotation on the reference chromosomes, scaffolds, assembly patches and alternate loci (haplotypes).

$ gunzip -c Homo_sapiens.GRCh38.84.chr_patch_hapl_scaff.gtf.gz | awk -F'\t' '{ print $1 }' | sort | uniq
#!genebuild-last-updated 2015-10
#!genome-build GRCh38.p5
#!genome-build-accession NCBI:GCA_000001405.20
#!genome-date 2013-12
#!genome-version GRCh38
1
10
11
12
13
14
15
16
17
18
19
2
20
21
22
3
4
5
6
7
8
9
CHR_HG126_PATCH
CHR_HG1342_HG2282_PATCH
CHR_HG1362_PATCH
CHR_HG142_HG150_NOVEL_TEST
CHR_HG151_NOVEL_TEST
CHR_HG1651_PATCH
CHR_HG1832_PATCH
CHR_HG2021_PATCH
CHR_HG2030_PATCH
CHR_HG2058_PATCH
CHR_HG2062_PATCH
CHR_HG2066_PATCH
CHR_HG2072_PATCH
CHR_HG2095_PATCH
CHR_HG2104_PATCH
CHR_HG2116_PATCH
CHR_HG2128_PATCH
CHR_HG2191_PATCH
CHR_HG2213_PATCH
CHR_HG2217_PATCH
CHR_HG2232_PATCH
CHR_HG2233_PATCH
CHR_HG2235_PATCH
CHR_HG2239_PATCH
CHR_HG2247_PATCH
CHR_HG2249_PATCH
CHR_HG2288_HG2289_PATCH
CHR_HG2290_PATCH
CHR_HG2291_PATCH
CHR_HG2334_PATCH
CHR_HG23_PATCH
CHR_HG26_PATCH
CHR_HG986_PATCH
CHR_HSCHR10_1_CTG1
CHR_HSCHR10_1_CTG2
CHR_HSCHR10_1_CTG3
CHR_HSCHR10_1_CTG4
CHR_HSCHR10_1_CTG6
CHR_HSCHR11_1_CTG1_2
CHR_HSCHR11_1_CTG5
CHR_HSCHR11_1_CTG6
CHR_HSCHR11_1_CTG7
CHR_HSCHR11_1_CTG8
CHR_HSCHR11_2_CTG1
CHR_HSCHR11_2_CTG1_1
CHR_HSCHR11_3_CTG1
CHR_HSCHR12_1_CTG1
CHR_HSCHR12_1_CTG2_1
CHR_HSCHR12_2_CTG1
CHR_HSCHR12_2_CTG2
CHR_HSCHR12_2_CTG2_1
CHR_HSCHR12_3_CTG2
CHR_HSCHR12_3_CTG2_1
CHR_HSCHR12_4_CTG2
CHR_HSCHR12_4_CTG2_1
CHR_HSCHR12_5_CTG2
CHR_HSCHR12_5_CTG2_1
CHR_HSCHR12_6_CTG2_1
CHR_HSCHR13_1_CTG1
CHR_HSCHR13_1_CTG3
CHR_HSCHR13_1_CTG5
CHR_HSCHR13_1_CTG7
CHR_HSCHR13_1_CTG8
CHR_HSCHR14_1_CTG1
CHR_HSCHR14_2_CTG1
CHR_HSCHR14_3_CTG1
CHR_HSCHR14_7_CTG1
CHR_HSCHR15_1_CTG1
CHR_HSCHR15_1_CTG3
CHR_HSCHR15_1_CTG8
CHR_HSCHR15_2_CTG3
CHR_HSCHR15_2_CTG8
CHR_HSCHR15_3_CTG3
CHR_HSCHR15_3_CTG8
CHR_HSCHR15_4_CTG8
CHR_HSCHR15_5_CTG8
CHR_HSCHR15_6_CTG8
CHR_HSCHR16_1_CTG1
CHR_HSCHR16_1_CTG3_1
CHR_HSCHR16_2_CTG3_1
CHR_HSCHR16_3_CTG1
CHR_HSCHR16_4_CTG1
CHR_HSCHR16_4_CTG3_1
CHR_HSCHR16_5_CTG1
CHR_HSCHR16_CTG2
CHR_HSCHR17_10_CTG4
CHR_HSCHR17_1_CTG1
CHR_HSCHR17_1_CTG2
CHR_HSCHR17_1_CTG4
CHR_HSCHR17_1_CTG5
CHR_HSCHR17_1_CTG9
CHR_HSCHR17_2_CTG1
CHR_HSCHR17_2_CTG2
CHR_HSCHR17_2_CTG4
CHR_HSCHR17_2_CTG5
CHR_HSCHR17_3_CTG2
CHR_HSCHR17_3_CTG4
CHR_HSCHR17_4_CTG4
CHR_HSCHR17_5_CTG4
CHR_HSCHR17_6_CTG4
CHR_HSCHR17_7_CTG4
CHR_HSCHR17_8_CTG4
CHR_HSCHR17_9_CTG4
CHR_HSCHR18_1_CTG1_1
CHR_HSCHR18_1_CTG2_1
CHR_HSCHR18_2_CTG1_1
CHR_HSCHR18_2_CTG2
CHR_HSCHR18_2_CTG2_1
CHR_HSCHR18_3_CTG2_1
CHR_HSCHR18_5_CTG1_1
CHR_HSCHR18_ALT21_CTG2_1
CHR_HSCHR18_ALT2_CTG2_1
CHR_HSCHR19KIR_ABC08_A1_HAP_CTG3_1
CHR_HSCHR19KIR_ABC08_AB_HAP_C_P_CTG3_1
CHR_HSCHR19KIR_ABC08_AB_HAP_T_P_CTG3_1
CHR_HSCHR19KIR_FH05_A_HAP_CTG3_1
CHR_HSCHR19KIR_FH05_B_HAP_CTG3_1
CHR_HSCHR19KIR_FH06_A_HAP_CTG3_1
CHR_HSCHR19KIR_FH06_BA1_HAP_CTG3_1
CHR_HSCHR19KIR_FH08_A_HAP_CTG3_1
CHR_HSCHR19KIR_FH08_BAX_HAP_CTG3_1
CHR_HSCHR19KIR_FH13_A_HAP_CTG3_1
CHR_HSCHR19KIR_FH13_BA2_HAP_CTG3_1
CHR_HSCHR19KIR_FH15_A_HAP_CTG3_1
CHR_HSCHR19KIR_FH15_B_HAP_CTG3_1
CHR_HSCHR19KIR_G085_A_HAP_CTG3_1
CHR_HSCHR19KIR_G085_BA1_HAP_CTG3_1
CHR_HSCHR19KIR_G248_A_HAP_CTG3_1
CHR_HSCHR19KIR_G248_BA2_HAP_CTG3_1
CHR_HSCHR19KIR_GRC212_AB_HAP_CTG3_1
CHR_HSCHR19KIR_GRC212_BA1_HAP_CTG3_1
CHR_HSCHR19KIR_LUCE_A_HAP_CTG3_1
CHR_HSCHR19KIR_LUCE_BDEL_HAP_CTG3_1
CHR_HSCHR19KIR_RP5_B_HAP_CTG3_1
CHR_HSCHR19KIR_RSH_A_HAP_CTG3_1
CHR_HSCHR19KIR_RSH_BA2_HAP_CTG3_1
CHR_HSCHR19KIR_T7526_A_HAP_CTG3_1
CHR_HSCHR19KIR_T7526_BDEL_HAP_CTG3_1
CHR_HSCHR19LRC_COX1_CTG3_1
CHR_HSCHR19LRC_COX2_CTG3_1
CHR_HSCHR19LRC_LRC_I_CTG3_1
CHR_HSCHR19LRC_LRC_J_CTG3_1
CHR_HSCHR19LRC_LRC_S_CTG3_1
CHR_HSCHR19LRC_LRC_T_CTG3_1
CHR_HSCHR19LRC_PGF1_CTG3_1
CHR_HSCHR19LRC_PGF2_CTG3_1
CHR_HSCHR19_1_CTG2
CHR_HSCHR19_1_CTG3_1
CHR_HSCHR19_2_CTG2
CHR_HSCHR19_2_CTG3_1
CHR_HSCHR19_3_CTG2
CHR_HSCHR19_3_CTG3_1
CHR_HSCHR19_4_CTG2
CHR_HSCHR19_4_CTG3_1
CHR_HSCHR19_5_CTG2
CHR_HSCHR1_1_CTG11
CHR_HSCHR1_1_CTG3
CHR_HSCHR1_1_CTG31
CHR_HSCHR1_1_CTG32_1
CHR_HSCHR1_2_CTG3
CHR_HSCHR1_2_CTG31
CHR_HSCHR1_2_CTG32_1
CHR_HSCHR1_3_CTG3
CHR_HSCHR1_3_CTG31
CHR_HSCHR1_3_CTG32_1
CHR_HSCHR1_4_CTG3
CHR_HSCHR1_4_CTG31
CHR_HSCHR1_5_CTG32_1
CHR_HSCHR1_ALT2_1_CTG32_1
CHR_HSCHR20_1_CTG1
CHR_HSCHR20_1_CTG2
CHR_HSCHR20_1_CTG3
CHR_HSCHR20_1_CTG4
CHR_HSCHR21_2_CTG1_1
CHR_HSCHR21_3_CTG1_1
CHR_HSCHR21_4_CTG1_1
CHR_HSCHR21_5_CTG2
CHR_HSCHR21_6_CTG1_1
CHR_HSCHR21_8_CTG1_1
CHR_HSCHR22_1_CTG1
CHR_HSCHR22_1_CTG2
CHR_HSCHR22_1_CTG3
CHR_HSCHR22_1_CTG4
CHR_HSCHR22_1_CTG5
CHR_HSCHR22_1_CTG6
CHR_HSCHR22_1_CTG7
CHR_HSCHR22_2_CTG1
CHR_HSCHR22_3_CTG1
CHR_HSCHR22_4_CTG1
CHR_HSCHR22_5_CTG1
CHR_HSCHR22_6_CTG1
CHR_HSCHR22_7_CTG1
CHR_HSCHR2_1_CTG1
CHR_HSCHR2_1_CTG15
CHR_HSCHR2_1_CTG5
CHR_HSCHR2_1_CTG7
CHR_HSCHR2_1_CTG7_2
CHR_HSCHR2_2_CTG1
CHR_HSCHR2_2_CTG15
CHR_HSCHR2_2_CTG7
CHR_HSCHR2_2_CTG7_2
CHR_HSCHR2_3_CTG1
CHR_HSCHR2_3_CTG15
CHR_HSCHR2_3_CTG7_2
CHR_HSCHR2_4_CTG1
CHR_HSCHR3_1_CTG1
CHR_HSCHR3_1_CTG2_1
CHR_HSCHR3_1_CTG3
CHR_HSCHR3_2_CTG2_1
CHR_HSCHR3_2_CTG3
CHR_HSCHR3_3_CTG1
CHR_HSCHR3_3_CTG3
CHR_HSCHR3_4_CTG2_1
CHR_HSCHR3_4_CTG3
CHR_HSCHR3_5_CTG2_1
CHR_HSCHR3_5_CTG3
CHR_HSCHR3_6_CTG3
CHR_HSCHR3_7_CTG3
CHR_HSCHR3_8_CTG3
CHR_HSCHR3_9_CTG3
CHR_HSCHR4_1_CTG12
CHR_HSCHR4_1_CTG4
CHR_HSCHR4_1_CTG6
CHR_HSCHR4_1_CTG9
CHR_HSCHR4_2_CTG12
CHR_HSCHR4_2_CTG4
CHR_HSCHR4_3_CTG12
CHR_HSCHR4_4_CTG12
CHR_HSCHR4_5_CTG12
CHR_HSCHR4_6_CTG12
CHR_HSCHR4_7_CTG12
CHR_HSCHR4_8_CTG12
CHR_HSCHR4_9_CTG12
CHR_HSCHR5_1_CTG1
CHR_HSCHR5_1_CTG1_1
CHR_HSCHR5_1_CTG5
CHR_HSCHR5_2_CTG1
CHR_HSCHR5_2_CTG1_1
CHR_HSCHR5_2_CTG5
CHR_HSCHR5_3_CTG1
CHR_HSCHR5_3_CTG5
CHR_HSCHR5_4_CTG1
CHR_HSCHR5_4_CTG1_1
CHR_HSCHR5_5_CTG1
CHR_HSCHR5_6_CTG1
CHR_HSCHR5_7_CTG1
CHR_HSCHR6_1_CTG10
CHR_HSCHR6_1_CTG2
CHR_HSCHR6_1_CTG3
CHR_HSCHR6_1_CTG4
CHR_HSCHR6_1_CTG5
CHR_HSCHR6_1_CTG6
CHR_HSCHR6_1_CTG7
CHR_HSCHR6_1_CTG8
CHR_HSCHR6_1_CTG9
CHR_HSCHR6_8_CTG1
CHR_HSCHR6_MHC_APD_CTG1
CHR_HSCHR6_MHC_COX_CTG1
CHR_HSCHR6_MHC_DBB_CTG1
CHR_HSCHR6_MHC_MANN_CTG1
CHR_HSCHR6_MHC_MCF_CTG1
CHR_HSCHR6_MHC_QBL_CTG1
CHR_HSCHR6_MHC_SSTO_CTG1
CHR_HSCHR7_1_CTG1
CHR_HSCHR7_1_CTG4_4
CHR_HSCHR7_1_CTG6
CHR_HSCHR7_1_CTG7
CHR_HSCHR7_2_CTG1
CHR_HSCHR7_2_CTG4_4
CHR_HSCHR7_2_CTG6
CHR_HSCHR7_2_CTG7
CHR_HSCHR7_3_CTG6
CHR_HSCHR8_1_CTG1
CHR_HSCHR8_1_CTG6
CHR_HSCHR8_1_CTG7
CHR_HSCHR8_2_CTG1
CHR_HSCHR8_2_CTG7
CHR_HSCHR8_3_CTG1
CHR_HSCHR8_3_CTG7
CHR_HSCHR8_4_CTG1
CHR_HSCHR8_4_CTG7
CHR_HSCHR8_5_CTG1
CHR_HSCHR8_5_CTG7
CHR_HSCHR8_6_CTG1
CHR_HSCHR8_7_CTG1
CHR_HSCHR8_8_CTG1
CHR_HSCHR8_9_CTG1
CHR_HSCHR9_1_CTG1
CHR_HSCHR9_1_CTG2
CHR_HSCHR9_1_CTG3
CHR_HSCHR9_1_CTG4
CHR_HSCHR9_1_CTG5
CHR_HSCHR9_1_CTG6
CHR_HSCHRX_1_CTG3
CHR_HSCHRX_2_CTG12
CHR_HSCHRX_2_CTG3
GL000008.2
GL000009.2
GL000194.1
GL000195.1
GL000205.2
GL000213.1
GL000216.2
GL000218.1
GL000219.1
GL000220.1
GL000224.1
GL000225.1
KI270442.1
KI270706.1
KI270707.1
KI270708.1
KI270711.1
KI270713.1
KI270714.1
KI270721.1
KI270722.1
KI270723.1
KI270724.1
KI270726.1
KI270727.1
KI270728.1
KI270731.1
KI270733.1
KI270734.1
KI270741.1
KI270743.1
KI270744.1
KI270750.1
KI270752.1
MT
X
Y
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment