Skip to content

Instantly share code, notes, and snippets.

View meren's full-sized avatar

A. Murat Eren (Meren) meren

View GitHub Profile
meren /
Created August 11, 2020 17:27
Get gene calls and their sequences from an anvi'o contigs database
from anvio.dbops import ContigsSuperclass
# if your args object contains a `contigs_db` entry in its
# namespace all you don't need the following two lines and
# you can directly pass it to the ContigsSuperclass.
import argparse
args = argparse.Namespace(contigs_db="INFANT-GUT-TUTORIAL/SPLITAH/E_facealis/CONTIGS.db")
# get an instance of the contigs super:
contigs_db = ContigsSuperclass(args)
meren /
Last active July 21, 2020 12:58
An update to Antti Karkman's GFF parser. Find the latest version here:
#!/usr/bin/env python
## Antti Karkman
## University of Gothenburg
## 2017
import gffutils
import argparse
#!/usr/bin/env python
# lousy script that works with the outputs of `anvi-export-collection`
# to reconcstruct the fate of contigs for a given algorithm and bin and
# spit out some text to be visualized on
import sys
from collections import OrderedDict
# -*- coding: utf-8 -*-
# pylint: disable=line-too-long
# v.140713
"""A very lightweight FASTA I/O library"""
import io
import sys
import gzip
import numpy
import hashlib
#!/usr/bin/env python
# Click 'Download > Multiple-file JSON' from NCBI search results page,
# unzip it, run this script in it without any parameters, get the
# markdown formatted table.
import json
import glob
# poor man's whatever:
FROM ubuntu:xenial
ENV DEBIAN_FRONTEND noninteractive
RUN apt-key adv --keyserver --recv-keys E084DAB9
RUN apt-get update
RUN apt-get install locales
RUN locale-gen en_US.UTF-8
>>> n = 21
>>> (n * n - n) / 2
>>> l = ['Cys', 'Asp', 'Ser', 'Gln', 'Lys', 'Trp', 'Pro', 'Thr', 'Ile', 'Ala', 'Phe', 'Gly', 'His', 'STP', 'Leu', 'Arg', 'Met', 'Glu', 'Asn', 'Tyr', 'Val']
>>> len(l)
>>> len(set(sorted([''.join(sorted([t[0], t[1]])) for t in list(itertools.product(l, l)) if t[0] != t[1]])))
meren /
Last active June 19, 2020 23:57
Generate a report for primer matching statistics (here you will find the Python program, two example input files (nifH gene primers and full-length nifH genes), and an example terminal output). You will need to have anvi'o installed on your system to run it. For questions: meren at or tdelmont at
# run this script like this:
# python primers.txt sequences.txt
# it will generate a report file called `primer_matches_report.txt`
# example primers file (TAB-delimited, no spaces):
# name pair direction sequence
meren / external-genomes.txt
Last active June 8, 2017 00:01
Phylogenomic tutorial helper files
name contigs_db_path
Bacteroides_fragilis_2334 Bacteroides_fragilis_2334.db
Bacteroides_fragilis_2346 Bacteroides_fragilis_2346.db
Bacteroides_fragilis_2347 Bacteroides_fragilis_2347.db
Escherichia_albertii_6917 Escherichia_albertii_6917.db
Escherichia_coli_6920 Escherichia_coli_6920.db
Escherichia_coli_9038 Escherichia_coli_9038.db
Prevotella_dentalis_19591 Prevotella_dentalis_19591.db
Prevotella_denticola_19594 Prevotella_denticola_19594.db
Prevotella_intermedia_19600 Prevotella_intermedia_19600.db