Decade* of DRL

Inspired by the wounderful blogpost “The Decade of Deep Learning” by Leo Gao, I wrote one about Deep Reinforcement Learning.
One landmark paper per year:

  • 2013 — DQN
  • 2014 — Deterministic policy gradient (DPG)
  • 2015 — DDPG
  • 2016 — AlphaGo
  • 2017 — PPO
  • 2018 — SAC
  • 2019 — Dreamer
  • 2020 — CURL
  • 2021 — Decision Transformer
  • 2022 — InstructGPT (RLHF)
  • 2023 — TD-MPC2
  • 2024 — AlphaProof
  • 2025 — DeepSeek-R1

You can read the full blog under this link: schwinger.dev/posts/decade-of-drl

What would be your list?

submitted by /u/Ill-Accident-836
[link] [comments]

Liked Liked