Skip to content

Instantly share code, notes, and snippets.

@yngtodd
yngtodd / auto-sklearn.txt
Last active July 7, 2021 23:13
Installing Auto-Sklearn and Pyrfr on Mac OSX.
# Create a conda environment
conda create -n bayes
source activate bayes
# Get a new version of gcc from anaconda
conda install gcc
# Get Auto-Sklearn and the terribly stubborn pyrfr libraries
curl https://raw.githubusercontent.com/automl/auto-sklearn/master/requirements.txt | xargs -n 1 -L 1 pip install
@debasishg
debasishg / gist:8172796
Last active May 10, 2024 13:37
A collection of links for streaming algorithms and data structures

General Background and Overview

  1. Probabilistic Data Structures for Web Analytics and Data Mining : A great overview of the space of probabilistic data structures and how they are used in approximation algorithm implementation.
  2. Models and Issues in Data Stream Systems
  3. Philippe Flajolet’s contribution to streaming algorithms : A presentation by Jérémie Lumbroso that visits some of the hostorical perspectives and how it all began with Flajolet
  4. Approximate Frequency Counts over Data Streams by Gurmeet Singh Manku & Rajeev Motwani : One of the early papers on the subject.
  5. [Methods for Finding Frequent Items in Data Streams](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.187.9800&rep=rep1&t