Login

OpenAI: Investigating the consequences of accidentally grading CoT during RL

(alignment.openai.com) by pretext | May 9, 2026 | 0 comments on HN
Visit Link
← Back to news