Chunlei Wu newgene

## mygene_use_https.py
In [1]: import mygene
In [2]: mg = mygene.MyGeneInfo()

# by default, it uses http:
In [3]: mg.url
Out[3]: 'http://mygene.info/v3'

# switch to use https
In [4]: mg.url = 'https://mygene.info/v3'

## shell_aliases_flake8.sh
# Just a short-hand to type less
alias f8='flake8'

# run "hgf" anywhere in a mercurial repo, it will check all changed *.py with flake8
alias hgf='tmp_cwd=`pwd` ; cd `hg root`; hgs -nmd |grep "\.py$" |xargs flake8; cd $tmp_cwd; unset tmp_cwd'

# run "gf" anywhere in a git repo, it will check all changed *.py with flake8
alias gf='tmp_cwd=`pwd` ; cd `git rev-parse --show-toplevel`; git diff --name-only |grep "\.py$" |xargs flake8; cd $tmp_cwd; unset tmp_cwd'


## metadata_mygene_info.json
{
  "app_revision": "193:7417080ffb37",
  "available_fields": [
    "accession",
    "alias",
    "biocarta",
    "chr",
    "end",
    "ensemblgene",
    "ensemblprotein",

## gene_object_mygene_info.json
{
    "_id": "1017",
    "_timestamp": "2014-08-25T00:00:00",
    "accession": {
        "genomic": [
            "ABBA01008397",
            "AC025162",
            "AC034102",
            "AC_000144",
            "AF512553",

## merged_variant_json_doc
{
    "_id": "15:g.33905410A>G",
    "mutdb": {
        "chromEnd": 33905410,
        "dbsnp_id": "rs2229116",
        "allele2": "G",
        "uniprot_id": "VAR_011405",
        "allele1": "A",
        "mutpred_score": 0.384,
        "cosmic_id": null,

## mygene_gene_object_mapping.json
{
  "properties": {
    "AnimalQTLdb": {
      "type": "string",
      "index": "no",
      "include_in_all": false
    },
    "FLYBASE": {
      "type": "string",
      "index_name": "flybase",

## 1_readme.md

      
              2 files
            
          
              1 fork
            
          
              0 comments
            
          
              0 stars
            
          
                newgene
                / 1_readme.md
            
            
              Last active
              August 29, 2015 13:57
            
              
                Clonify_contest.md
              
          
    Readme for this contest

General context of this contest

Human immune system produces a vast variety of antibodies in order to respond to the external stimuli. Next-generation sequencing technology allows researchers to obtain the sequences of all antibodies from a single person. Clustering these antibody sequences allows us to understand how an antibody is produced. However, the number of antibody sequences from a single sample can be up to 1 million scale. Clustering with such a big scale poses a big computation challenge.
Current algorithm

The current algorithm for clustering antibody sequences computes a pairwise distance matrix, and then perform a hierarchical clustering to group sequences into clusters. This algorithm is implemented in Python as provided clonify_contest.py script.

  
## id_mapping_mygene.ipynb

      
              1 file
            
          
              1 fork
            
          
              0 comments
            
          
              0 stars
            
          
                newgene
                / id_mapping_mygene.ipynb
            
            
              Last active
              November 16, 2020 16:04
            
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
	In [1]: import mygene
	In [2]: mg = mygene.MyGeneInfo()

	# by default, it uses http:
	In [3]: mg.url
	Out[3]: 'http://mygene.info/v3'

	# switch to use https
	In [4]: mg.url = 'https://mygene.info/v3'
	# Just a short-hand to type less
	alias f8='flake8'

	# run "hgf" anywhere in a mercurial repo, it will check all changed *.py with flake8
	alias hgf='tmp_cwd=`pwd` ; cd `hg root`; hgs -nmd \|grep "\.py$" \|xargs flake8; cd $tmp_cwd; unset tmp_cwd'

	# run "gf" anywhere in a git repo, it will check all changed *.py with flake8
	alias gf='tmp_cwd=`pwd` ; cd `git rev-parse --show-toplevel`; git diff --name-only \|grep "\.py$" \|xargs flake8; cd $tmp_cwd; unset tmp_cwd'
	{
	"app_revision": "193:7417080ffb37",
	"available_fields": [
	"accession",
	"alias",
	"biocarta",
	"chr",
	"end",
	"ensemblgene",
	"ensemblprotein",
	{
	"_id": "1017",
	"_timestamp": "2014-08-25T00:00:00",
	"accession": {
	"genomic": [
	"ABBA01008397",
	"AC025162",
	"AC034102",
	"AC_000144",
	"AF512553",
	{
	"_id": "15:g.33905410A>G",
	"mutdb": {
	"chromEnd": 33905410,
	"dbsnp_id": "rs2229116",
	"allele2": "G",
	"uniprot_id": "VAR_011405",
	"allele1": "A",
	"mutpred_score": 0.384,
	"cosmic_id": null,
	{
	"properties": {
	"AnimalQTLdb": {
	"type": "string",
	"index": "no",
	"include_in_all": false
	},
	"FLYBASE": {
	"type": "string",
	"index_name": "flybase",