Skip to content

Instantly share code, notes, and snippets.

View ctb's full-sized avatar

C. Titus Brown ctb

View GitHub Profile
#! /usr/bin/env python
"""
Usage:
python -m screed db <sequencefile>
python extract-screed.py <list of names> <sequencefile>
"""
import argparse, screed, sys
def main():
@ctb
ctb / README.md
Last active April 26, 2018 20:23

subtract

Subtract signatures from each other

@ctb
ctb / README.md
Last active September 21, 2017 12:22

Simple demonstration code to load sequences, compute signature, and do a search against an SBT.

sourmash compute --scaled 10000 -f data/GCF*.fna.gz
sourmash index -k 31 test.sbt GCF*.sig

./build-and-search.py test.sbt data/GCF_000005845.2_ASM584v2_genomic.fna.gz
@ctb
ctb / README.md
Last active September 7, 2017 12:10

banded kraken - example usage

Prepare the database:

mkdir ecoli_many_sigs
cd ecoli_many_sigs
curl -O -L https://github.com/dib-lab/sourmash/raw/master/data/eschericia-sigs.tar.gz
tar xzf eschericia-sigs.tar.gz
cd ../
@ctb
ctb / .gitignore
Last active September 3, 2017 16:13
*~
*.pyc
.sbt.delmont31/
.sbt.subs1/
.sbt.subs2/
.sbt.tully31/
delmont-genome-sigs.tar.gz
delmont-genome-sigs/
delmont31.sbt.json
subs1.sbt.json
@ctb
ctb / README.md
Last active September 2, 2017 17:34

reverse indexing foo for sourmash signatures

foo!

@ctb
ctb / README.md
Created May 29, 2017 15:04
NCBI taxdump parsing by @ctb

Some scripts for parsing NCBI taxonomy information => sourmash.

@ctb
ctb / README.md
Last active April 12, 2017 16:00

multik analysis of strain variation

@ctb
ctb / README.md
Last active April 6, 2017 08:06
benchmarking RAM allocation against file load time