Skip to content

Instantly share code, notes, and snippets.

View srividya22's full-sized avatar

Srividya Ramakrishnan srividya22

  • John Hopkins University
  • Baltimore
View GitHub Profile
@srividya22
srividya22 / BCFtools cheat sheet
Created June 4, 2021 16:18 — forked from elowy01/BCFtools cheat sheet
BCFtools cheat sheet
*bcftools filter
*Filter variants per region (in this example, print out only variants mapped to chr1 and chr2)
qbcftools filter -r1,2 ALL.chip.omni_broad_sanger_combined.20140818.snps.genotypes.hg38.vcf.gz
*printing out info for only 2 samples:
bcftools view -s NA20818,NA20819 filename.vcf.gz
*printing stats only for variants passing the filter:
bcftools view -f PASS filename.vcf.gz
@srividya22
srividya22 / BCFtools cheat sheet
Created June 4, 2021 16:18 — forked from elowy01/BCFtools cheat sheet
BCFtools cheat sheet
*bcftools filter
*Filter variants per region (in this example, print out only variants mapped to chr1 and chr2)
qbcftools filter -r1,2 ALL.chip.omni_broad_sanger_combined.20140818.snps.genotypes.hg38.vcf.gz
*printing out info for only 2 samples:
bcftools view -s NA20818,NA20819 filename.vcf.gz
*printing stats only for variants passing the filter:
bcftools view -f PASS filename.vcf.gz
@srividya22
srividya22 / README.md
Created May 29, 2018 22:08 — forked from jdblischak/README.md
snakemake_vmem_usage

Testing Snakemake virtual memory usage

John Blischak 2014-05-14

Multiple users have observed that submitting jobs via Snakemake requires much more memory than is necessary to run the command (e.g. mailing list post, [Bitbucket issue][issue]).

@srividya22
srividya22 / README.md
Created May 29, 2018 22:08 — forked from jdblischak/README.md
snakemake_vmem_usage

Testing Snakemake virtual memory usage

John Blischak 2014-05-14

Multiple users have observed that submitting jobs via Snakemake requires much more memory than is necessary to run the command (e.g. mailing list post, [Bitbucket issue][issue]).

@srividya22
srividya22 / xargs.sh
Created January 5, 2018 17:13
xargs cheatsheet
# turn a find or cut (cut delimiter, get first column) output into a list
/etc find . -name "*bash*" | xargs
cut -d, -f1 file.csv | xargs
# find a file and grep for a word in the file
find . -name "*.java" | xargs grep "Stock"
# handeling filenames which have WHITESPACE
ls *txt | xargs -d '\n' grep "cost"
@srividya22
srividya22 / xargs.sh
Created January 5, 2018 17:13
xargs cheatsheet
# turn a find or cut (cut delimiter, get first column) output into a list
/etc find . -name "*bash*" | xargs
cut -d, -f1 file.csv | xargs
# find a file and grep for a word in the file
find . -name "*.java" | xargs grep "Stock"
# handeling filenames which have WHITESPACE
ls *txt | xargs -d '\n' grep "cost"
@srividya22
srividya22 / maker_genome_annotation.md
Created September 8, 2017 19:20 — forked from darencard/maker_genome_annotation.md
In-depth description of running MAKER for genome annotation.

Genome Annotation using MAKER

MAKER is a great tool for annotating a reference genome using empirical and ab initio gene predictions. GMOD, the umbrella organization that includes MAKER, has some nice tutorials online for running MAKER. However, these were quite simplified examples and it took a bit of effort to wrap my head completely around everything. Here I will describe a de novo genome annotation for Boa constrictor in detail, so that there is a record and that it is easy to use this as a guide to annotate any genome.

Software & Data

Software prerequisites:

  1. RepeatModeler and RepeatMasker with all dependencies (I used NCBI BLAST) and RepBase (version used was 20150807).
  2. MAKER MPI version 2.31.8 (though any other version 2 releases should be okay).
  3. [Augustus](http://bio
@srividya22
srividya22 / maker_genome_annotation.md
Created September 8, 2017 19:20 — forked from darencard/maker_genome_annotation.md
In-depth description of running MAKER for genome annotation.

Genome Annotation using MAKER

MAKER is a great tool for annotating a reference genome using empirical and ab initio gene predictions. GMOD, the umbrella organization that includes MAKER, has some nice tutorials online for running MAKER. However, these were quite simplified examples and it took a bit of effort to wrap my head completely around everything. Here I will describe a de novo genome annotation for Boa constrictor in detail, so that there is a record and that it is easy to use this as a guide to annotate any genome.

Software & Data

Software prerequisites:

  1. RepeatModeler and RepeatMasker with all dependencies (I used NCBI BLAST) and RepBase (version used was 20150807).
  2. MAKER MPI version 2.31.8 (though any other version 2 releases should be okay).
  3. [Augustus](http://bio
@srividya22
srividya22 / rename_genes_in_maker_gff.pl
Created August 24, 2017 21:40 — forked from avrilcoghlan/rename_genes_in_maker_gff.pl
Perl script that renames genes in the maker gff files so that they have unique names.
#!/usr/bin/env perl
=head1 NAME
rename_genes_in_maker_gff.pl
=head1 SYNOPSIS
rename_genes_in_maker_gff.pl input_gff output_gff outputdir species
where input_gff is the input gff file,
@srividya22
srividya22 / rename_genes_in_maker_gff.pl
Created August 24, 2017 21:40 — forked from avrilcoghlan/rename_genes_in_maker_gff.pl
Perl script that renames genes in the maker gff files so that they have unique names.
#!/usr/bin/env perl
=head1 NAME
rename_genes_in_maker_gff.pl
=head1 SYNOPSIS
rename_genes_in_maker_gff.pl input_gff output_gff outputdir species
where input_gff is the input gff file,