Decade* of DRL
Inspired by the wounderful blogpost “The Decade of Deep Learning” by Leo Gao, I wrote one about Deep Reinforcement Learning.
One landmark paper per year:
- 2013 — DQN
- 2014 — Deterministic policy gradient (DPG)
- 2015 — DDPG
- 2016 — AlphaGo
- 2017 — PPO
- 2018 — SAC
- 2019 — Dreamer
- 2020 — CURL
- 2021 — Decision Transformer
- 2022 — InstructGPT (RLHF)
- 2023 — TD-MPC2
- 2024 — AlphaProof
- 2025 — DeepSeek-R1
You can read the full blog under this link: schwinger.dev/posts/decade-of-drl
What would be your list?
submitted by /u/Ill-Accident-836
[link] [comments]
Like
0
Liked
Liked