Has there been a followup to “A Closer Look at Deep Policy Gradients” for recent on-policy PG methods?

paper: https://arxiv.org/pdf/1811.02553

I checked connected papers and didn’t find any recent papers on the questions/issues raised in this paper. They seem pretty insightful to me, so I’m debating at looking at whether more recent methods have alleviated the issues, and if so, why.

submitted by /u/icantclosemytub
[link] [comments]

Liked Liked