Taylor Reiter taylorreiter

## download_ncbi_fungi.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                taylorreiter
                / download_ncbi_fungi.md
            
            
              Created
              October 20, 2017 20:22
            
              
                Download NCBI fungal genomes from refseq and genbank
              
          
    Create a virtual environment and install ncbi-genome-download
cd ~
python2.7 -m virtualenv ncbi-genome-downloadEnv
source ncbi-genome-downloadEnv/bin/activate
cd ncbi-genome-downloadEnv
git clone https://github.com/kblin/ncbi-genome-download.git
cd ncbi-genome-download
pip install .


## new-repo-from-prev.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                taylorreiter
                / new-repo-from-prev.md
            
            
              Created
              September 1, 2017 20:38
            
              
                code used to create new repo from previous
              
          
    An example of how to create a new git repository from a previous git repository
git clone https://github.com/taylorreiter/IGI_preproposal.git
cd IGI_preproposal/computational_pilot
git filter-branch --subdirectory-filter computational_pilot -- --all
# renamed folder cultivar_seq
git init
git add .
git commit -m "First commit"
git remote add origin https://github.com/taylorreiter/cultivar_seq.git


## mounting_ec2_volume.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                taylorreiter
                / mounting_ec2_volume.md
            
            
              Created
              April 22, 2017 23:05
            
          
    Make a new volume from the EC2 GUI.
Format the new volume
sudo mkfs -t ext4 /dev/xvdg

Mount the new volume
sudo mount /dev/xvdg /mnt2/


## blog-setup.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                taylorreiter
                / blog-setup.md
            
            
              Last active
              April 25, 2017 03:37
            
              
                Setup instructions for blog
              
          
    Followed from
https://www.dataquest.io/blog/how-to-setup-a-data-science-blog/
Set things up
cd ~/Desktop
mkdir jupyter-blog
cd jupyter-blog
nano .gitignore
# add contents from this file https://github.com/github/gitignore/blob/master/Python.gitignore


## kraken_mircea.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                taylorreiter
                / kraken_mircea.md
            
            
              Last active
              April 21, 2017 16:04
            
              
                kraken_mircea.md
              
          
    Kraken is broken something something NCBI numbers something something. Use perl scripts that supposedly dealt with the issue
(note that I was able to get the fungal one to work with the same loop etc, where the only difference was that only fungi was included)
http://www.opiniomics.org/building-a-kraken-database-with-new-ftp-structure-and-no-gi-numbers/
As of September 2016, someone commented that this method works, but something went wrong for me.
Ran on r4.8xlarge.
Get the sequences (note the script filters for complete genomes)

  
## kaiju_mircea.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                taylorreiter
                / kaiju_mircea.md
            
            
              Last active
              May 22, 2017 01:50
            
          
    Downloaded the trimmed, abundance filtered reads for the mircea data set. Ran kaiju as indicated below, using databases built here (https://gist.github.com/taylorreiter/2511b0c6e904b455e7002742c5da1492).
kaijudb -e

~/kaiju/bin/kaiju -t ~/kaijudb_e/nodes.dmp -f ~/kaijudb_e/kaiju_db_nr_euk.fmi -i SRR606249.pe.qc.fq.gz.abundtrim -v -o kaijudb_e_SRR606249.pe.qc.fq.gz.abundtrim.out

Then added taxonomy to kaiju names
~/kaiju/bin/addTaxonNames -t ~/kaijudb_e/nodes.dmp -n ~/kaijudb_e/names.dmp -i kaijudb_e_SRR606249.pe.qc.fq.gz.abundtrim.out -o kaijudb_e_SRR606249.pe.qc.fq.gz.abundtrim.out.names


## kaijudb_builds.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                taylorreiter
                / kaijudb_builds.md
            
            
              Last active
              May 21, 2017 22:03
            
          
    Download Kaiju software
cd ~
git clone https://github.com/bioinformatics-centre/kaiju.git
cd kaiju/src
make

From kaiju github:

  
## sourmash_plotting_twiddly_output
# Code to visualize numpy output from sourmash compare
# NB output was saved to csv file from python using the command:
          # sourmash compare -o yeast_syrah_L2 ./*sig  # completed in terminal
          # import numpy
          # from sourmash_lib import fig
          # import pylab
          # D, labels = fig.load_matrix_and_labels('yeast_syrah_L2')
          # D = (D + D.T) / 2.0
          # numpy.savetxt("L2_numpy.csv", D, delimiter=",")
# Orrrrrr
	# Code to visualize numpy output from sourmash compare
	# NB output was saved to csv file from python using the command:
	# sourmash compare -o yeast_syrah_L2 ./*sig # completed in terminal
	# import numpy
	# from sourmash_lib import fig
	# import pylab
	# D, labels = fig.load_matrix_and_labels('yeast_syrah_L2')
	# D = (D + D.T) / 2.0
	# numpy.savetxt("L2_numpy.csv", D, delimiter=",")
	# Orrrrrr