Skip to content

Instantly share code, notes, and snippets.

View gisleDK's full-sized avatar

Gisle Vestergaard gisleDK

View GitHub Profile
@gisleDK
gisleDK / read_count.sh
Created September 6, 2018 09:57
Script for summarizing fastq read counts
#!/bin/sh
# Script for merging fastq read counts
# Written by Gisle Vestergaard
# gisle@bio.ku.dk
[ $# != 2 ] && { echo "Usage: read_counter.sh <directory containing .fastq files DIR> <output filename FILE>"; exit 1; }
READDIR=$1
OUTPUT=$2
cd $READDIR
[ -e $OUTPUT ] && { echo "output file exists"; exit 1; }
@gisleDK
gisleDK / prodigal_hmmsearch_merger.rb
Created June 15, 2018 08:42
Script for adding hmmsearch results <domtblout> to Prodigal style gff files allowing artemis etc. visualization
#!/bin/ruby
# Script for adding hmmsearch results to Prodigal style gff files
# Written by Gisle Vestergaard
# gisle@bio.ku.dk
prodigal_file = ARGV[0]
hmmsearch_file = ARGV[1]
hash_id = {}
File.open(hmmsearch_file) do |ios|
@gisleDK
gisleDK / process_mcl.rb
Created June 15, 2018 08:29
Script for organizing clusters from mcl Markov Clustering
#!/bin/ruby
# Script for organizing clusters from mcl
# Written by Gisle Vestergaard
# gisle@bio.ku.dk
ARGF.each_with_index do |line, count|
plus = count+1
fields = line.chomp.split("\t")
puts plus.to_s + "\t" + line
end
@gisleDK
gisleDK / merge_runs.rb
Last active January 9, 2018 09:49
Merging several DADA2 denoised run feature tables and representative sequences for Qiime version 2017.12
#!/usr/bin/env ruby
# Qiime2: Merging several DADA2 denoised run feature tables and representative sequences for Qiime version 2017.12
# Feature tables must be name *_table.qza and representative sequences *_rep-seqs.qza
# V1.00 Written by Gisle Vestergaard
require 'pp'
require 'optparse'
tables = []
seqs = []
@gisleDK
gisleDK / merge_runs.zsh
Last active May 29, 2018 13:00
Qiime2: Merging several DADA2 denoised run feature tables and representative sequences for Qiime version 2017.7
#!/bin/zsh
# Script for merging multiple DADA2 denoised Illumina runs
# Provide file "runs.list" containing list of runs
# File names has to be <run>_table.qza and <run>_rep-seqs.qza
# Written by Gisle Vestergaard
# Email: gisle@bio.ku.dk
zmodload zsh/mapfile
runs=( ${(f)mapfile[$PWD/runs.list]} )
@gisleDK
gisleDK / test.fastq
Created May 11, 2017 08:20
test.fastq
m54141_170208_104237/9699895/ccs
CTCAGGGTTGCAGCGAAATTTCAAGCGCCACTTCCGGGAATCTCAGAGCGATCTCGAAAGGAAGGGTACTTGTCTCTTAAGTCGCGGTAGGCCTTCATATAGCGACCTGCCTGCGCATCATCCACACAGGCGGACGCTCCAGAACTTTATTCTGTGCCGCTTGCAATAAACGAGGAACATTCTGAGCCGATCCAACTGTCATGTAGGTTATGTCTCCATCTCAAAAAGCAACCCCTATCGCATTCTACGATTTGTGGCGTCACTATCGCTGTAGATTGGCAAAGCTGGGTTGTTATTGCTAAGACTGTTGTTGTAGATACTTCACCCCAACCTCAGGTTGGTTATCTTTAAGCATTGAGTATAAATTAGTAGTATAAATCCTTAAGAAACTTTAAAAAAGGTTTAGAATTAGTGTGGTGAGTGGGAGACTAGTATGTTTCTAAGACTTGCAGACCAACATCGTCAATTTGTTCGAGATCTAGTAATGAGCCTTCGAGCTATGGCAATTGTCCTAGAGCGTCGAGGCTATATTGCTTCTTGCTACAACACCTGCGGCAGCCAAATGAACAGCGCTTCATTCATGGCAAGCCTAGGGGATAACCATCTAATCCGGTTTCTGGTGTCAGATTATGGGATTACCTGGACTGAAATGCGGGATGATCGAGAACTGATGAAGCTGGAAGGGGCCGAGGCAATTAGCCAGCTTCAAGAGTTAGCTGACTTAGTTAAATATCACAGTGAGAGCTTTGAAGAGCAGCCAGTTCTGATAAATTAAGCCTAACGAGTTTAATTCAGAGTCACTACTTTAGGAACAGTCACGAGCCGGACTCATGTCGTAATTATCGTGAGTGATAGGGTTATAGTTGGAGATGGCCCTTAGACCCAACTGTATTTCATGACTGCTTTGATTCTGGATGGCAACATCAACGTATCGAGACAAACCGATTCGACTGCATTGTGTTACGCAGGGAGACGGTGATTTAGTTCTGCT
@gisleDK
gisleDK / cogdiamond.rb
Created April 27, 2017 13:13
Diamond output run against EGGNOG databases and give COG information
#!/usr/bin/env ruby2.2
#
# This program take output from Diamond output run against EGGNOG databases and give COG information
# V1.00 Written by Gisle Vestergaard
require 'pp'
require 'optparse'
ARGV << '-h' if ARGV.empty?
options = {}
@gisleDK
gisleDK / test.npo
Created December 22, 2016 21:55
Troubleshooting Nonpareil
# @impl: Nonpareil
# @ksize: 32644
# @version: 3.00
# @maxL: 100
# @L: 88.496
# @R: 500
# @overlap: 50.00
# @divide: 0.70
0 0.00000 0.00000 0.00000 0.00000 0.00000
2 0.00000 0.00000 0.00000 0.00000 0.00000
@gisleDK
gisleDK / flankgenbank.rb
Last active December 14, 2016 15:07
Extract flanking genes and genes of interest as well as finding co-occurrences. Extracts round-robin, as in correctly extracts flanking genes near start/stop of linear genome
#!/usr/bin/env ruby2.2
require 'pp'
require 'optparse'
ARGV << "-h" if ARGV.empty?
cmd_init = File.basename($0) + " " + ARGV.join(" ")
options = {}
@gisleDK
gisleDK / qsub_blat.rb
Created September 12, 2014 10:49
Ruby script to blat search fasta sequence using qsub
#!/usr/bin/env ruby
# qsub parametres:
# Specify Ruby
#$ -S /usr/bin/ruby
# name for job
#$ -N blat
# pe request for slots, means processors, with
# minimum to maximum.
# e.g. 2 means two processors
# 2- means 2 upto number of processor restricted by pe