Skip to content

Instantly share code, notes, and snippets.

@Luke-SNAW
Created October 31, 2021 23:49
Show Gist options
  • Save Luke-SNAW/789602d71953c4da17de99f1d40ddd8e to your computer and use it in GitHub Desktop.
Save Luke-SNAW/789602d71953c4da17de99f1d40ddd8e to your computer and use it in GitHub Desktop.
[An Architecture for Life] You have three functions. MuZero calls them h (representation), g (dynamics), and f (prediction). #bio #ML

https://geohot.github.io//blog/jekyll/update/2021/10/29/an-architecture-for-life.html

A representation function takes in the sensor data and transforms the manifold to something more learnable, referred to as the “hidden state”. It may discard non task relevant information. This can be a VAE, GAN, pretrained ImageNet, custom trained conv net, or in the case of GPT-3, handcoded as BPE.

A dynamics function deals with long term temporal correspondences. It functions as both a summarizer of the past and a predictor of the future, in the same way a Kalman filter does. This can be a couple dense layers, a transformer, or an RNN/GRU/LSTM. I guess it even could be a Kalman filter itself.

A prediction function tells you how to act in a state. It can do this by directly outputting an action or by outputting a value of this state allowing you to search.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment