natolambert/rlhf-book
Textbook on reinforcement learning from human feedback
Language:TeX
Total stars: 237
Stars trend:
#tex
#ai, #alignment, #rlhf
Textbook on reinforcement learning from human feedback
Language:TeX
Total stars: 237
Stars trend:
1 Feb 2025
10pm ▍ +3
11pm ▌ +4
2 Feb 2025
12am ▉ +7
1am █ +8
2am ▊ +6
3am █▌ +12
4am ▌ +4
5am █▏ +9
6am ▊ +6
7am █▏ +9
8am █ +8
#tex
#ai, #alignment, #rlhf