Decade* of DRL

digitado ⋅ 4 de May de 2026

Inspired by the wounderful blogpost “The Decade of Deep Learning” by Leo Gao, I wrote one about Deep Reinforcement Learning.
One landmark paper per year:

2013 — DQN
2014 — Deterministic policy gradient (DPG)
2015 — DDPG
2016 — AlphaGo
2017 — PPO
2018 — SAC
2019 — Dreamer
2020 — CURL
2021 — Decision Transformer
2022 — InstructGPT (RLHF)
2023 — TD-MPC2
2024 — AlphaProof
2025 — DeepSeek-R1

You can read the full blog under this link: schwinger.dev/posts/decade-of-drl

What would be your list?

submitted by /u/Ill-Accident-836
[link] [comments]

Like 0

Liked Liked