Skip to content

Instantly share code, notes, and snippets.

@cedrickchee
Created June 11, 2018 15:32
Show Gist options
  • Save cedrickchee/75b1318619b456e7fe607db5033b9daa to your computer and use it in GitHub Desktop.
Save cedrickchee/75b1318619b456e7fe607db5033b9daa to your computer and use it in GitHub Desktop.
"Building the Software 2.0 Stack" talk: https://vimeo.com/272696002

My Notes for the Talk

A talk by Andrej Karpathy at Train AI 2018 conference - machine learning for a human world.

Training Datasets

The part around building and managing datasets is very interesting. We don't get to hear about these problems often.

Software 2.0 Integrated Development Enviroments (IDEs)

What IDEs including code editors will look like?

  • Show a full inventory or statistics of the current dataset.
  • Create or edit annotation layers for any datapoint.
  • Flag, escalate & resolve discrepancies in multiple labels.
  • Flag and escalate datapoints that are likely to be mislabeled.
  • Display predictions on an arbirary set of test datapoints.
  • Autosuggest datapoints that should be labeled.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment