Skip to content

Instantly share code, notes, and snippets.

@brentp
Created July 17, 2012 20:44
Show Gist options
  • Star 5 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save brentp/3131928 to your computer and use it in GitHub Desktop.
Save brentp/3131928 to your computer and use it in GitHub Desktop.
bed file of transcription start-sites (TSS)
UPSTREAM=400
INSTREAM=100
ORG=hg18
mysql --user genome --host genome-mysql.cse.ucsc.edu -NAD $ORG -e \
"select chrom, txStart, txEnd, X.geneSymbol, strand from knownGene as K, kgXref as X WHERE txStart != txEnd AND X.kgID = K.name" \
| awk -v ups=$UPSTREAM -v ins=$INSTREAM 'BEGIN{OFS=FS="\t"}
$5 == "-" { print $1,$3-ins,$3+ups,$4 }
$5 == "+" { print $1,$2-ins,$2+ups,$4 }' \
| sort -k1,1 -k2,2n \
| bedtools merge -i - -nms > tss.bed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment