CeShine Lee ceshine

## frozen_lake_q_learning.py
import gym, random

LEARNING_RATE = 0.1
DISCOUNT = 0.99

class qTable:
    """
    Implements a table tracking the estimated values
    for state action pairs in an MDP.
    """

## NIPS2016.md

      
              1 file
            
          
              1 fork
            
          
              0 comments
            
          
              12 stars
            
          
                artsobolev
                / NIPS2016.md
            
            
              Last active
              May 27, 2017 15:24
            
          
    Note: I'm updating this gist as I encounter new reviews, so make sure you're reading the latest revision!
Just as the previous year I collected (and keep doing so) links to various summaries and takeaways from this year's NIPS.

NIPS 2016 Symposium on People and machines: Public views on machine learning, and what this means for machine learning researchers. (Notes and panel discussion) by /u/gcr
NIPS 2016 summary, wrap up, and links to slides by /u/beamsearch
Post NIPS Reflections by Neil Lawrence
[Some general take aways from #NIPS2016](https://medium.com/@IgorCarron/some-general-take-aways-from-ni


## HMM.jl
module HMM

using Distributions

import Distributions.rand
import Distributions.fit

immutable HiddenMarkovModel{TP, K}
    theta::Vector{TP}
    A::Matrix{Float64}

## add_weather.py
train.visit_date = pd.to_datetime(train.visit_date)
test.visit_date = pd.to_datetime(test.visit_date)

def add_weather(dataset):
    print('Adding weather...')
    air_nearest = pd.read_csv(
        '../../data/raw/weather/air_store_info_with_nearest_active_station.csv')
    unique_air_store_ids = list(dataset.air_store_id.unique())

    weather_dir = '../../data/raw/weather/1-1-16_5-31-17_Weathe

## sample.py
#!/usr/bin/env python
# sample: Output lines from stdin to stdout with a given probability,
# for a given duration, and with a given delay between lines.
#
# Example usage: seq 100 | sample -r 20% -d 1000
#
# Dependency: Python 2.5
#
# Original Author: http://jeroenjanssens.com
# Original Script: https://github.com/jeroenjanssens/data-science-at-the-command-line/blob/master/tools/sample

## plot_loss+sample.py
#https://gist.github.com/stared/dfb4dfaf6d9a8501cd1cc8b8cb806d2e
class PlotLosses(keras.callbacks.Callback):

    def __init__(self,imgs):
        super(PlotLosses, self).__init__()
        self.imgs=imgs

    def on_train_begin(self, logs={}):
        self.i = 0
        self.x = []

## SparkR-datatable-aggr100M.txt
data.table vs SparkR

group-by aggregate on 100M records (1M groups)


data.table 6.5 sec (without key) / 1.3 sec (with key) - all 1 core

SparkR cached 200 sec (8 cores)
30x / 150x   ( 240x / 1200x per core)

## python_in_visual_studio_code.md

      
              1 file
            
          
              1 fork
            
          
              0 comments
            
          
              2 stars
            
          
                ceshine
                / python_in_visual_studio_code.md
            
            
              Last active
              August 5, 2019 09:30
            
              
                How To Develop Python Programs in Visual Studio Code
              
          
    How To Develop Python Programs in Visual Studio Code

Prerequisites

You have to already have these in your system:

Python. Version 3.6+ is highly recommended.

Official CPython from python.org
Anaconda


(Recommended) Miniconda


## spacy_sentencizer.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                ceshine
                / spacy_sentencizer.ipynb
            
            
              Created
              August 14, 2019 04:46
            
              
                Customizing Spacy's Statistical Sentence Segmenter with Custom Rules
              
          
      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## run_tf_glue.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              2 stars
            
          
                ceshine
                / run_tf_glue.ipynb
            
            
              Last active
              February 23, 2020 18:50
            
              
                Train huggingface/transformers BERT model on Colab CPU with TF 2.1
              
          
      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
	import gym, random

	LEARNING_RATE = 0.1
	DISCOUNT = 0.99

	class qTable:
	"""
	Implements a table tracking the estimated values
	for state action pairs in an MDP.
	"""
	module HMM

	using Distributions

	import Distributions.rand
	import Distributions.fit

	immutable HiddenMarkovModel{TP, K}
	theta::Vector{TP}
	A::Matrix{Float64}
	train.visit_date = pd.to_datetime(train.visit_date)
	test.visit_date = pd.to_datetime(test.visit_date)

	def add_weather(dataset):
	print('Adding weather...')
	air_nearest = pd.read_csv(
	'../../data/raw/weather/air_store_info_with_nearest_active_station.csv')
	unique_air_store_ids = list(dataset.air_store_id.unique())

	weather_dir = '../../data/raw/weather/1-1-16_5-31-17_Weathe
	#!/usr/bin/env python
	# sample: Output lines from stdin to stdout with a given probability,
	# for a given duration, and with a given delay between lines.
	#
	# Example usage: seq 100 \| sample -r 20% -d 1000
	#
	# Dependency: Python 2.5
	#
	# Original Author: http://jeroenjanssens.com
	# Original Script: https://github.com/jeroenjanssens/data-science-at-the-command-line/blob/master/tools/sample
	#https://gist.github.com/stared/dfb4dfaf6d9a8501cd1cc8b8cb806d2e
	class PlotLosses(keras.callbacks.Callback):

	def __init__(self,imgs):
	super(PlotLosses, self).__init__()
	self.imgs=imgs

	def on_train_begin(self, logs={}):
	self.i = 0
	self.x = []
	data.table vs SparkR

	group-by aggregate on 100M records (1M groups)


	data.table 6.5 sec (without key) / 1.3 sec (with key) - all 1 core

	SparkR cached 200 sec (8 cores)
	30x / 150x ( 240x / 1200x per core)