From A/B to RL: A gentle bridge from A/B testing to reinforcement learning

I created a 3-part series called From A/B to RL. The goal is to start from A/B testing ideas and gradually introduce actions, rewards, policies, online learning, states, episodes, and delayed feedback, with a Bayesian decision-making thread running through it:

The posts came out of some old Jupyter notebook drafts from when I was teaching myself reinforcement learning. I finally cleaned them up into a more coherent series.

Feedback is welcome.

submitted by /u/Xochipilli
[link] [comments]

Liked Liked