Rebecca Bilbro rebeccabilbro

## DDRL_EntResLab.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                rebeccabilbro
                / DDRL_EntResLab.ipynb
            
            
              Last active
              February 22, 2016 01:22
            
          
      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## top-data-science-questions.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                rebeccabilbro
                / top-data-science-questions.md
            
            
              Created
              February 25, 2016 22:32
            
              
                initial outline for Brittne's blog post
              
          
    Title

The top n questions data scientists ask
Introduction

Data science doesn’t start with data, it starts with a problem…
The pipeline model is useful, but data scientists progress via a series of questions - what are those questions?
Scoping


## nist_clustering.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                rebeccabilbro
                / nist_clustering.ipynb
            
            
              Created
              April 23, 2016 17:51
            
          
      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## hullclass.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                rebeccabilbro
                / hullclass.ipynb
            
            
              Created
              September 27, 2016 16:13
            
          
      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## elastic_indexer.py
from elasticsearch.helpers import bulk
from elasticsearch import Elasticsearch

class ElasticIndexer(object):
    """
    Create an ElasticSearch instance, and given a list of documents,
    index the documents into ElasticSearch.
    """
    def __init__(self):
        self.elastic_search = Elasticsearch()

## get_hobbies.py
import os

from sklearn.datasets.base import Bunch
from yellowbrick.download import download_all

## The path to the test data sets
FIXTURES  = os.path.join(os.getcwd(), "data")

## Dataset loading mechanisms
datasets = {

## doctor.go
package main

import (
    "fmt"
    "log"

    "github.com/shirou/gopsutil/mem"
    "github.com/shirou/gopsutil/cpu"
    "github.com/shirou/gopsutil/disk"
    "github.com/shirou/gopsutil/host"

## get_walking_data.py
import os
import zipfile
import requests
import pandas as pd

WALKING_DATASET = (
    "https://archive.ics.uci.edu/ml/machine-learning-databases/00286/User%20Identification%20From%20Walking%20Activity.zip",
)

def download_data(path='data', urls=WALKING_DATASET):

## kimchi.py
# kimchi.py
# For converting Python 2 pickles to Python 3

import os
import dill
import pickle
import argparse


def convert(old_pkl):

## classifier_comparison.py
#!/usr/bin/python
# -*- coding: utf-8 -*-
# plot_classifier_comparison.py
"""
A comparison of a several classifiers in scikit-learn on synthetic datasets.
The point of this example is to illustrate the nature of decision boundaries
of different classifiers.

Particularly in high-dimensional spaces, data can more easily be separated
linearly and the simplicity of classifiers such as naive Bayes and linear SVMs
	from elasticsearch.helpers import bulk
	from elasticsearch import Elasticsearch

	class ElasticIndexer(object):
	"""
	Create an ElasticSearch instance, and given a list of documents,
	index the documents into ElasticSearch.
	"""
	def __init__(self):
	self.elastic_search = Elasticsearch()
	import os

	from sklearn.datasets.base import Bunch
	from yellowbrick.download import download_all

	## The path to the test data sets
	FIXTURES = os.path.join(os.getcwd(), "data")

	## Dataset loading mechanisms
	datasets = {
	package main

	import (
	"fmt"
	"log"

	"github.com/shirou/gopsutil/mem"
	"github.com/shirou/gopsutil/cpu"
	"github.com/shirou/gopsutil/disk"
	"github.com/shirou/gopsutil/host"
	import os
	import zipfile
	import requests
	import pandas as pd

	WALKING_DATASET = (
	"https://archive.ics.uci.edu/ml/machine-learning-databases/00286/User%20Identification%20From%20Walking%20Activity.zip",
	)

	def download_data(path='data', urls=WALKING_DATASET):
	# kimchi.py
	# For converting Python 2 pickles to Python 3

	import os
	import dill
	import pickle
	import argparse


	def convert(old_pkl):
	#!/usr/bin/python
	# -- coding: utf-8 --
	# plot_classifier_comparison.py
	"""
	A comparison of a several classifiers in scikit-learn on synthetic datasets.
	The point of this example is to illustrate the nature of decision boundaries
	of different classifiers.

	Particularly in high-dimensional spaces, data can more easily be separated
	linearly and the simplicity of classifiers such as naive Bayes and linear SVMs