Skip to content

Instantly share code, notes, and snippets.

View mtw's full-sized avatar

Michael T. Wolfinger mtw

View GitHub Profile
@mtw
mtw / PhD Advertisement.pdf
Last active February 4, 2021 21:41
2021-02-03 Two PhD positions @TBI
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@mtw
mtw / extract_pp.sh
Created October 26, 2013 22:05
Extract properly-paired reads and their mates (ie flags 99/147/163/83) from paired-end BAM files
#!/bin/bash
samtools=`which samtools`
bn=$(basename $1 .bam)
echo processing $bn
$samtools view -h -b -f99 $1 > $bn.flag99.bam
$samtools view -h -b -f147 $1 > $bn.flag147.bam
$samtools view -h -b -f163 $1 > $bn.flag163.bam
$samtools view -h -b -f83 $1 > $bn.flag83.bam
@mtw
mtw / gist:3927eaebad6192bca9dc26c9e7b339a0
Created February 7, 2018 14:13
A minimalistic Perl6 Stockholm format parser
#!/usr/bin/env perl6
# a minimal Stockholm alignment format parser that reads
# ONLY single-line alignments and dumps each sequence (without additional
# gap characters) to a file
use Grammar::Tracer;
grammar StockholmParser {
token TOP { <header>+ <alignment> <consensus> <sep>? <.eol> }
@mtw
mtw / README
Last active January 29, 2018 12:07
RNA bioinformatics PhD position with Zasha Weinberg
Zasha Weinberg is looking for a PhD candidate in RNA bioinformatics
@mtw
mtw / bam2fastq_cutadapt.sh
Last active February 7, 2017 22:56
Raw NGS data is often shipped as unaligned BAM files. This shell script converts raw reads to FASTQ format and trims Illumina adapters with cutadapt. Quality Control is performed with FastQC.
#!/bin/bash
cutadapt=`which cutadapt`
fastqc=`which fastqc`
bam2fastq=`which bam2fastq`
gzip=`which gzip`
origdir="."
results="cutadapt"
fastqcdir="${results}/FastQC"
adapter5="CTACACTCTTTCCCTACACGACGCTCTTCCGATCT"
@mtw
mtw / cuff_me.sh
Last active January 3, 2016 23:49
Simple wrapper script for cufflinks
#!/bin/bash
bamdir="./"
cufflinks=`which cufflinks`
annotation="foo.gtf"
threads=40
label_RABT="CUFFRABT"
label_denovo="CUFFDENOVO"
for BAM in $(ls $bamdir/*.bam)
do
@mtw
mtw / dsm.opt
Created December 18, 2013 22:53
IBM Tivoli Storage Manager config files
dateformat 4
@mtw
mtw / dexseq_count_em.sh
Last active December 28, 2015 08:09
Wrapper for DEXseq read counting
#!/bin/bash
samdir="./"
dexseq_flat_gff="my.DEXSeq.gff"
outdir="./dexseq-count"
dexseq_count="python /home/foo/bin/dexseq_count.py"
samtools=`which samtools`
if ! [ -d "$outdir" ];
then
mkdir -p $outdir
@mtw
mtw / cutadapt_em_PE.sh
Last active December 28, 2015 06:28
Wrapper script for paired-end cutadapt
#!/bin/bash
cutadapt=`which cutadapt`
fastqc=`which fastqc`
origdir=".."
results="."
fastqcdir="${results}/FastQC"
adapter5="CTACACTCTTTCCCTACACGACGCTCTTCCGATCT"
adapter3="GATCGGAAGAGCACACGTCTGAACTCCAGTCAC"
inprefix="C"
@mtw
mtw / htseq_count_em.sh
Last active December 26, 2015 08:19
Wrapper for htseq-count
#!/bin/bash
samdir="./"
annotation="my.gtf"
htseqdir="./htseq-count"
htseqcount=`which htseq-count`
samtools=`which samtools`
feature="gene"
idattr="gene_name"
ss="reverse"