Skip to content

Instantly share code, notes, and snippets.

View nylander's full-sized avatar

Johan Nylander nylander

View GitHub Profile
@nylander
nylander / db.fas
Last active December 3, 2020 09:23
Example files for blast
>PIS68081.1 anaerobic sulfatase maturase [Vibrio parahaemolyticus 1911C]
MQITQSPQFNGKASKRLHVMAKPIGAACNIDCTYCYYLSKQDLLEYKKGCSPMMDEATLEAYIKQYIEGQ
NTPEIVFSWQGGEPTMLGLDYFKKIVEFQAKYQPEGVKISNDLQTNGTLLNDEWCEFLSENHFLVGLSID
GPEMLHNAYRTNRAGRGTFKQVMNAVELLHKHQVNFATLTCVNNLTSRNPLEVYRFLRDEVRSPQMQFIP
IVEQKTFRTTAPQKWQPQEQLKQGDKRLIPGHKDSVMEPWCVADEAWGNFLIAVFDEWIKNDIGKVFVQY
FEASVEAWMGRANPLCTLGEVCGKGLAMEPNGDVFACDHYVYPEYKIGNIHHDKLDELAYSKEQQKFGFA
KSRLLTQQCRDCEYQFACYGECPKNRFIRTRDGEPGLNYLCAGWKKFFAHADKAMAYILRATGNVVAHGK
YSDSVVRQESEQRPVFETKF
>PIS68069.1 anaerobic sulfatase maturase [Vibrio parahaemolyticus 1911C]
MKDAYVRRFHVMAKPGGAKCNIDCQYCFYLHKENLLHQEKQPKMDDATLEAFVKSYIESQDGEEIVFSWQ
@nylander
nylander / acc2sci
Created November 21, 2017 08:12
Get taxonomy from genbank accessions
#!/bin/bash
## Fetch scientific name for Genbank accession number
## Version: Mon 20 nov 2017 13:35:49
## By: {Andreas.Kahari,Johan.Nylander}@nbis.se
## Usage: ./acc2sci.sh infile-with-accessions
## Notes: Reads from file or STDIN, prints to STDOUT.
## No detailed error checking! If any of the Accessions
## provided as queries are incorrect, the script will
## exit with msg: "Failure of post to find data to load.."
@nylander
nylander / extract_masked.pl
Created February 13, 2017 09:04
Extract masked positions in fasta file. Question on [biosupport.se](https://biosupport.se/p/327/)
#!/usr/bin/env perl
use strict;
use warnings;
=pod
=head1
FILE: extract_masked.pl
@nylander
nylander / gist:de0fe4f009314c1272523eb11a179910
Last active February 13, 2017 09:06
Pitfall in get_taxonids(), Bio::DB::Taxonomy
#!/usr/bin/perl
#===============================================================================
=pod
=head1
FILE: cgt.pl
USAGE: ./cgt.pl
DESCRIPTION: The intention of the original code was to search genbank