You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Here are some things you can do with Gists in GistBox.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
1. Written Memories: Understanding, Deriving and Extending the LSTM : http://r2rt.com/written-memories-understanding-deriving-and-extending-the-lstm.html
2. RECURRENT NEURAL NETWORKS TUTORIAL, PART 1 – INTRODUCTION TO RNNS : http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-1-introduction-to-rnns/
3. The Unreasonable Effectiveness of Recurrent Neural Networks: http://karpathy.github.io/2015/05/21/rnn-effectiveness/
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
What is the Universal Approximation Theorem?
The Universal Approximation Theorem (UAT) states that a feedforward neural network with a single hidden layer, using a suitable activation function, can approximate any continuous function defined on a compact domain (like the unit cube $[0,1]^n$) as closely as we wish, provided we have enough hidden units. In other words, such networks are universal approximators of continuous functions.
Framework for Zero-Shot Learning with Large Language Models (LLMs)
This document outlines essential prompting techniques for leveraging zero-shot learning capabilities of Large Language Models (LLMs). These methods allow you to perform a wide variety of tasks without requiring prior task-specific training data.
1. Natural Language Descriptions
Overview
Describe the task or concept in clear, natural language so the model understands what to do.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Understanding Proofs - 2 - Noether's Theorem: Symmetry, Invariances and Deep Learning
Jan 7,2025
Introduction: The Deep Connection Between Symmetry and Invariances.
In the landscape of modern science, few principles have proven as universally powerful as Emmy Noether's Theorem. Published in 1918, this remarkable insight connects symmetries in physical systems to their invariances (conservation laws). Today, over a century later, we're discovering that these same principles govern not just the physical world, but also the behavior of artificial neural networks and deep learning systems.
This comprehensive exploration will bridge the gap between classical physics and cutting-edge artificial intelligence, revealing how Noether's insights illuminate both fields. We'll begin with fundamental mathematical principles, progress through classical applications, and ultimately reveal how these same concepts manifest in modern deep learning architectures.