- scholars do not agree on definition
- scrape manifests and speeches, label by expert
- correlate populism x time in office x exit strategies
- don't evaluate countries bellow democratic threshold (outliers)
- "Populists in Europe"
- used: TF-IDF, scattertext, gensim, pyLDA
- cat and mouse problem - populists can avoid looking like populists
- python library populus for unit testing smart contracts written in Solidity
- remix.ethereum.org - Solidity IDE
- metamask.io - dApps in browser
- bunch.ai - Culture analytics
- natural time representation parser
- underthink
- overexpect
- outsource
- wire all together
- generic entity types
- prodigy annotation (and modeling) tool
- "time to first evidence" concept
- Bayes approach
- user segmentation
- new contacts -> watchdogs
- watchdogs ->buyers
- another talk last year https://www.youtube.com/watch?v=v7MBunqwBSY https://www.slideshare.net/FlorianWilhelm2/which-car-fits-my-life-pydata-berlin-2017
- Spark > Apache Hive
- Arrow eliminates data conversion by providing shared data structure with bindings
- 200x performance in piping data
- data pipelines blog https://www.inovex.de/blog/
- feature preparation pipeline: TFX, Kubeflow, TF.Transform
- Dask is better if one uses only python
- no such thing as full-stack
- only marginalizing/compromising technologies (DB design, security, etc)
- SRE (Site Reliability Engineer)
- Product Readiness Level (from NASA's TRL)
- https://pydata.org/berlin2018/schedule/presentation/16/
- delta encoding, etc.
- Be clear when communicating, no buzzwords
- Define what is desired, eg. Clustering by attributes
- Daimond.ai
- prof. Jens Dittrich on YouTube
- df.info()
- ExtensionDtype
- ExtensionArray
- user defined functions
- more native types
- efficient memory, I/O,
- acceleration with just decorator for for-loops
- jitclass for data store
- custom data types to avoid py objects
- similar: cyberpandas, geopandas
- Open Source AI conversational framework
- very nice
- just see the presentation