Skip to content

Instantly share code, notes, and snippets.

View geniusphil's full-sized avatar
🖖
Focusing

Philippe Lin geniusphil

🖖
Focusing
View GitHub Profile
## Hadoop For NGS Information
### Alignment
* [Spark BWA](https://github.com/citiususc/SparkBWA)
* [BigBWA](https://github.com/citiususc/BigBWA)
### Variant Calling
* [Vardoop](https://github.com/zcai/Vardoop)
* [gatk](https://github.com/broadinstitute/gatk)
### Annotation
library(ggplot2)
qual <- read.table("NA12878_QUAL.txt", header = FALSE)
hist(qual$V1,
main="NA12878_QUAL",
xlab="QUAL",
xlim=c(0,4000),
breaks=100)
@geniusphil
geniusphil / vcftools_install.sh
Last active August 26, 2019 15:57
Installation vcftools on MacOS
# Check package
brew update
brew upgrade
brew install automake
brew install zlib
brew install pkg-config
# Download vcftools
git clone https://github.com/vcftools/vcftools.git
cd vcftools
@geniusphil
geniusphil / gist:75dca4b6c9395e9fb4e63fd78f27f468
Last active September 6, 2020 11:08
digi_plus_talent_note

Analysis flow

  1. bwa mem alignment
  2. samtools convert sam to bam
  3. sambamba sort
  4. sambamba mark duplication (markdup)
  5. GATK Haplotype Caller
  6. GATK GVCFs -> final vcf file
  7. Annovar -- vcf to annovar input
  8. Annovar -- table_annovar