Skip to content

Instantly share code, notes, and snippets.

View walterreade's full-sized avatar

Walter Reade walterreade

View GitHub Profile

Walter Reade Publications

Peer Reviewed

  • Patton, P. T., et al., (2023). A deep learning approach to photo–identification demonstrates high performance on two dozen cetacean species. Methods in Ecology and Evolution, 00, 1– 15. [Open Access]
  • Wayment-Steele, H.K., Kladwang, W., Watkins, A.M. et al. "Deep learning models for predicting RNA degradation via dual crowdsourcing" Nat Mach Intell 4, 1174–1184 (2022). [Abstract] [pre-print]
  • Cheeseman, T., Southerland, K., Park, J. et al., "Advanced image recognition: a fully automated, high-accuracy photo-identification matching system for humpback whales", Mammalian Biology (2021) [Abstract]
  • Bratholm, L.A, et al., "A community-powered search of machine learning strategy space to find NMR property prediction models," PLOS ONE, July 20, 2021,
@walterreade
walterreade / virtualenv.txt
Last active October 9, 2022 06:41
Setting up standard virtualenv
### for brand-new only
sudo apt-get update
sudo apt-get install htop
sudo apt-get install build-essential
wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh
rm Miniconda3-latest-Linux-x86_64.sh
@walterreade
walterreade / xgb_hyperopt.py
Created January 16, 2016 23:02
XGBoost Hyperopt Gridsearch
# http://www.dataiku.com/blog/2015/08/24/xgboost_and_dss.html
import dataiku
import pandas as pd, numpy as np
from dataiku import pandasutils as pdu
from sklearn.metrics import roc_auc_score
import xgboost as xgb
from hyperopt import hp, fmin, tpe, STATUS_OK, Trials
train = dataiku.Dataset("train").get_dataframe()
@walterreade
walterreade / xgb_aws.txt
Created September 8, 2015 15:31
XGBoost on AWS
sudo apt-get install make
sudo apt-get update
sudo apt-get install gcc
sudo apt-get install g++
sudo apt-get install git
sudo git clone https://github.com/dmlc/xgboost
cd xgboost
./build.sh
cd python-package
python setup.py install
@walterreade
walterreade / YeoJohnson.py
Created June 9, 2017 14:52 — forked from mesgarpour/YeoJohnson.py
Yeo-Johnson Transformation
#!/usr/bin/env python
# -*- coding: UTF-8 -*-
import warnings
import numpy as np
import pandas as pd
import sys
__author__ = "Mohsen Mesgarpour"
__copyright__ = "Copyright 2016, https://github.com/mesgarpour"
__credits__ = ["Mohsen Mesgarpour"]
@walterreade
walterreade / elastic_transform.py
Created July 6, 2016 13:52 — forked from fmder/elastic_transform.py
Elastic transformation of an image in Python
import numpy
from scipy.ndimage.interpolation import map_coordinates
from scipy.ndimage.filters import gaussian_filter
def elastic_transform(image, alpha, sigma, random_state=None):
"""Elastic deformation of images as described in [Simard2003]_.
.. [Simard2003] Simard, Steinkraus and Platt, "Best Practices for
Convolutional Neural Networks applied to Visual Document Analysis", in
'''This script goes along the blog post
"Building powerful image classification models using very little data"
from blog.keras.io.
It uses data that can be downloaded at:
https://www.kaggle.com/c/dogs-vs-cats/data
In our setup, we:
- created a data/ folder
- created train/ and validation/ subfolders inside data/
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
# -*- coding: utf-8 -*-
"""
Regularized Tree Ensemble
@author: gert.jacobusse@rogatio.nl
@license: FreeBSD
Originally posted:
https://www.kaggle.com/c/bnp-paribas-cardif-claims-management/forums/t/20207/why-every-good-script-is-using-extratreeclassifier-one-way-or-the-other/115621