Skip to content

Instantly share code, notes, and snippets.

@winding-lines
Last active November 24, 2018 14:34
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save winding-lines/9f961ccab0e82b5d546ddcb36adf983d to your computer and use it in GitHub Desktop.
Save winding-lines/9f961ccab0e82b5d546ddcb36adf983d to your computer and use it in GitHub Desktop.
RL Acronyms

arxiv POMCPOW - partially observable Monte Carlo planning with observation widening PFT-DPW - particle filter trees with double progressive widening

cmu PBVI - point based value iteration

rhoPOMDP - POMDP with belief dependent rewards nips

[Somani et al. 2013] Somani, A.; Ye, N.; Hsu, D.; and Lee, W. S. 2013. DESPOT: Online POMDP planning with regularization. In Advances in Neural Information Processing Systems (NIPS), 1772–1780. DESPOT - determinized sparse partially observable tree

sarsop 2008 SARSOP - Successive Approximations of the Reachable Space under Optimal Policies

ABT - Adaptive belief tree

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment