24
Reinforcement Learning from Human Feedback (RLHF) in Notebooks (github.com/ash80)
5 days ago | ash_at_hny | github.com | frontpage