https://www.lesswrong.com/posts/HLJoJYi52mxgomujc/realistic-reward-hacking-induces-different-and-deeper-1
Sharan Maiya
maius
AI & ML interests
None yet
Recent Activity
updated
a model
13 days ago
maius/llama-3.1-8b-it-personas-no-cons
published
a model
13 days ago
maius/llama-3.1-8b-it-personas-no-cons
updated
a model
13 days ago
maius/llama-3.1-8b-it-pt-introspection-no-cons