Skip to content

Instantly share code, notes, and snippets.

View inambioinfo's full-sized avatar

Inamul Hasan Madar inambioinfo

View GitHub Profile
@inambioinfo
inambioinfo / rna_seq_variant_pipeline.sh
Created August 8, 2018 04:11 — forked from PoisonAlien/rna_seq_variant_pipeline.sh
RNA seq Variant calling pipeline according to gatk best practices
#!/bin/bash
#
# AUTHOR: Anand M.
# RNA-Seq variant calling pieline accoring to GATK Best practices.
# https://www.broadinstitute.org/gatk/guide/article?id=3891
#
# Call with following arguments
# bash rna_seq_variant_pipeline.sh <Input_Reads1.fq.gz> <Input_Reads2.fq.gz> <output_basename>
#
# Assumes STAR aligner is under path
@inambioinfo
inambioinfo / SJtoBED12.bash
Created June 11, 2018 19:44 — forked from fabiolib/SJtoBED12.bash
Short bash script to convert STAR SJ.out.tab into BED format similar to TopHat junctions.bed
#!/bin/bash
##run from results directory
##Output BED file is sorted for indexing and loading into IGV
##awk for converting SJ.out.tab to bed12 format
##based on code originally published by frymor at http://seqanswers.com/forums/showthread.php?t=62896
for sj in */*SJ.out.tab
do
echo ${sj}
echo "Converting..."
awk \
@inambioinfo
inambioinfo / rnaseq.md
Created April 24, 2018 07:44 — forked from alyssafrazee/rnaseq.md
rna-seq workflow (draft)

Analyzing RNA-seq data with the "Tuxedo" tools

Introduction/tl;dr: I wrote this post as a reference for a few new graduate students in my department that are getting started with RNA-seq data analysis. It begins with an informal, big-picture overview of RNA-seq data analysis, and the general flow of the post outlines one standard RNA-seq workflow, but I wanted to give general audiences a "heads-up" that the post goes into quite a bit of nitty gritty detail that's specific to our department's computing setup.

preliminaries: what's RNA-seq?

RNA-seq is a high-throughput technology used to measure gene expression in cell populations. For a super bare-bones picture of what gene expression is, please enjoy this ASCII art I made to illustrate the process:

[DNA]            ACGTAGGT{CGTATTT}AGCGT{AGCGCCCGA}TTACA
                                    |                   
@inambioinfo
inambioinfo / EnsemblGTFtoAnnotation.sh
Created January 25, 2017 07:33 — forked from decodebiology/EnsemblGTFtoAnnotation.sh
Convert EnsEMBL GTF to Annotation table (Geneid, GeneSymbol, GeneWiseChrLocation, GeneClass, Strand)
echo `date`;
dir=(`dirname $1`)
base=(`basename $1`)
echo "1/7 Preparing files";
cat $1 | sed 's/ "/\t/g' | sed 's/"; /\t/g' | cut -f1,4,5,7,10,12,16,18,22 | sed 's/";//' | awk '!x[$6]++' | awk '{print $6"\t"$0;}' > $dir/enst_annotation.tmp;