Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's not a system prompt, it's a tool used during the training process to guide RL. You can read about it in their constitutional AI paper.


Moreover the Claude (Opus 4.5) persona knows this document but believes it does not! It's a very interesting phenomenon. https://www.lesswrong.com/posts/vpNG99GhbBoLov9og




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: