Skip to content

Instantly share code, notes, and snippets.

@liopic
Created October 14, 2019 21:14
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save liopic/bf1d068791e8681826d9b7a3ea2327a1 to your computer and use it in GitHub Desktop.
Save liopic/bf1d068791e8681826d9b7a3ea2327a1 to your computer and use it in GitHub Desktop.
PyCon DE & PyData Berlin 2019

PyCon DE & PyData Berlin, notes

Lots of lots of people!

Algo.Rules - Ethics in code

  • List of rules

Airflow for beginners

  • Operator (worker), DAG (instructions), Task (job), Connection (credentials), Hooks (common interfaces to external services, Slack Hook), Variables (envs), XComs (small messages between Tasks)
  • github.com/karpenkovarya/airflow_for_beginners

Practical DevOps

  • Focus on a ML workflos/Pipeline
  • MLOps: source control, create enviroments, use pipelines, CI+CD
  • Work with docker images
  • ml.azure.com
  • bit.ly/PyConDE-mlops

James Powell TBC

  • meta-programming
  • DRY or punish your coworkers
  • Why not repeat yourself
  • decorators
  • using classes hierachy
  • metaclass
  • init_subclass
  • template = dedent(), exec(template, var)

Why you should not train your own BERT model

Deployment oriented mindset for building ML models

  • research enviroment (jupyter) vs development env
    • results inconstintency, features inc...
  • pay attention: feat calculation
  • you can limit data sources
  • features: calculation, test business values

kendro

  • Use tags to define parts of the process: feature retrieving, engineering, etc
  • Notebook(Exploration) -> script
  • Looking for data column lineage

python 2020

  • Languages fight in different uses/platforms
  • Compiler to webassembly?

control experiments

  • Gradient (saas, CI addon to check experiments)
  • sacred (defined in code) + omniboard
  • mlflow (in code) enforces local env,
  • dvc

Data Literacy for managers

your specs are broken

  • Swagger 2.0
  • bravado* and pyramid-swagger

skorch

  • paralell with dask
  • save state
  • feature names

scikit-learn

  • ColumnTransformer
  • permutation_importance

Reinforcement learning

  • Real tests are costly

interpretable machine learning

  • eli5
  • partial dependent plots
  • shap

event sourcing

  • Tech debt factory

lightning

  • Pyqtgraph
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment