[HN Gopher] Mathematical Foundations of Reinforcement Learning
___________________________________________________________________
Mathematical Foundations of Reinforcement Learning
Author : ibobev
Score : 122 points
Date : 2025-03-10 18:27 UTC (4 hours ago)
(HTM) web link (github.com)
(TXT) w3m dump (github.com)
| dualofdual wrote:
| The best lectures on Reinforcement Learning and related topics
| are by Dimitris Bertsekas:
| https://web.mit.edu/dimitrib/www/home.html
| rybthrow2 wrote:
| Also one by David Silver of Deepmind, AlphaGo fame are good
| too: https://www.youtube.com/watch?v=2pWv7GOvuf0
| esafak wrote:
| His books tend to be dry and geared towards researchers, in my
| opinion. He has a new one on RL:
| https://web.mit.edu/dimitrib/www/RLCOURSECOMPLETE%202ndEDITI...
| joe_lin wrote:
| I'm looking for content (researcher myself) -- mainly on the
| application side. Should I start with this one? Or anything
| else?
|
| Very curious about RL for LLMs for example (using data from
| real use).
| esafak wrote:
| I have not read it but it looks like a comprehensive
| reference. For a more applied treatment see _Foundations of
| Deep Reinforcement Learning_. https://slm-
| lab.gitbook.io/slm-lab/publications-and-talks/in...
|
| Neither cover LLMs. I don't follow the literature closely
| so I can only suggest you read papers:
| https://github.com/WindyLab/LLM-RL-Papers
| forkerenok wrote:
| Would you mind explicitly indicating whether you have reviewed
| the submitted materials? And if so, why is it inferior to the
| material you linked?
|
| Not trying to catch you, genuine interest.
| richard___ wrote:
| No. They are outdated and focused on strange things. You wont
| understand ppo from his textbooks
| lemonlym wrote:
| Another great resource on RL is Mykel Kochenderfer's suite of
| textbooks: https://algorithmsbook.com/
| noobly wrote:
| These books are all RL? I've got the decision one, I didn't
| think the other had anything to do with RL.
| kristjansson wrote:
| Also worth mentioning Murphy's WIP textbook[0] focused entirely
| on RL, which is an outgrowth of his excellent ML textbooks.
|
| [0]: https://arxiv.org/abs/2412.05265
| ivanbelenky wrote:
| Awesome resource, in case someone is interested I implemented
| most of suttons book here https://github.com/ivanbelenky/RL
___________________________________________________________________
(page generated 2025-03-10 23:00 UTC)