Skip to content

Instantly share code, notes, and snippets.

@dakl
Last active November 23, 2015 14:30
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save dakl/856067989781f499c5d9 to your computer and use it in GitHub Desktop.
Save dakl/856067989781f499c5d9 to your computer and use it in GitHub Desktop.
Merge VCFs from 19 TCGA projects to a pancan19 VCF
#!/bin/bash
REF=/proj/b2010040/private/nobackup/autoseqer-genome/genome/human_g1k_v37_decoy.fasta
DICT=/proj/b2010040/private/nobackup/autoseqer-genome/genome/human_g1k_v37_decoy.dict
GATKJAR=/home/daniel.klevebring/projects/tcga-maf-to-vcf/GenomeAnalysisTK.jar
WORKDIR=$HOME/Crisp/dakl/tcga-maf-to-vcf
mkdir -p $WORKDIR
cd $WORKDIR
VCFSTOMERGE=(`find -maxdepth 1 |grep \/tcga-|grep vcf.gz$|grep -v pancan`)
cnt=${#VCFSTOMERGE[@]}
for ((i=0;i<cnt;i++)); do
VCFSTOMERGE[i]="-V ${VCFSTOMERGE[i]}"
done
PANCAN19VCF=tcga-pancan19.vcf.gz
java -jar $GATKJAR -T CombineVariants -R $REF --genotypemergeoption UNIQUIFY ${VCFSTOMERGE[@]} -o $PANCAN19VCF
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment