News
Latest
Top
Search
Submit
Login
Search
▲
5
Finding Alignment by Visualizing Music in Rust
(positron.solutions)
by positron26 |
view
|
0 comments
▲
4
64-Bit Misalignment
(jordivillar.com)
by thunderbong |
view
|
1 comments
▲
3
Is AI Really Alignment Faking?
(iacgm.com)
by iacgm |
view
|
1 comments
▲
3
Show HN: Thermodynamic Alignment Forces Gemini Thinking into "Burn Protocol"
(github.com)
by CodeIncept1111 |
view
|
7 comments
▲
3
Natural emergent misalignment from reward hacking in production rl [pdf]
(assets.anthropic.com)
by neapolisbeach |
view
|
0 comments
▲
3
Aligning brains into a shared space improves their alignment with LLMs
(nature.com)
by stevenjgarner |
view
|
0 comments
▲
2
Secure Linear Alignment of Large Language Models
(arxiv.org)
by walterbell |
view
|
1 comments
▲
2
What if AI alignment is a skill, not a state?
(12gramsofcarbon.com)
by theahura |
view
|
0 comments
▲
2
Show HN: Chatbot Without Safety Alignment
(coralflavor.com)
by JohnLins |
view
|
0 comments
▲
2
Who Owns Alignment?
(backnotprop.substack.com)
by ramoz |
view
|
0 comments
▲
2
Ask HN: Is the absence of affect the real barrier to AGI and alignment?
by n-exploit |
view
|
1 comments
▲
2
Alignment Research Blog
(alignment.openai.com)
by ironyman |
view
|
0 comments
▲
2
Values.md – file format for personal ethical alignment
(values.md)
by georgestrakhov |
view
|
0 comments
▲
2
Alignment: The Invisible Force That Makes Everything Work
(itrevolution.com)
by mooreds |
view
|
0 comments
▲
2
Wargaming AI Alignment
(twitter.com)
by JL-Akrasia |
view
|
2 comments
▲
2
Show HN: Alignmenter – Measure brand voice and consistency across model versions
(alignmenter.com)
by justingrosvenor |
view
|
2 comments
▲
2
TelUI 1.2: TelUI with fun alignments
by telui |
view
|
0 comments
▲
1
AI Alignment Is Impossible
(persuasion.community)
by cdrnsf |
view
|
0 comments
▲
1
Alignment by Default?
(blog.cosmos-institute.org)
by paulpauper |
view
|
0 comments
▲
1
Communicating Our Research with Stakeholders to Achieve Alignment and Trust
(blog.ptidej.net)
by Fatima-sabir |
view
|
0 comments
▲
1
One Developer, Two Dozen Agents, Zero Alignment
(maggieappleton.com)
by facundo_olano |
view
|
0 comments
▲
1
One Developer, Two Dozen Agents, Zero Alignment
(maggieappleton.com)
by andrem |
view
|
0 comments
▲
1
Mythos Just Proved the Alignment Field Is Building the Wrong Thing
(substack.com)
by ajspizz |
view
|
0 comments
▲
1
AI alignment: the signal is the goal
(substack.com)
by atzeus |
view
|
0 comments
▲
1
Alignment Risk Update for Claude Mythos [pdf]
(www-cdn.anthropic.com)
by jablongo |
view
|
0 comments
▲
1
The Cost of Misalignment
(interrupt.memfault.com)
by vinhnx |
view
|
0 comments
▲
1
Anthropic: Alignment Risk Update: Claude Mythos Preview [pdf]
(www-cdn.anthropic.com)
by tosh |
view
|
0 comments
▲
1
AI alignment: the signal is the goal
(substack.com)
by atzeus |
view
|
0 comments
▲
1
"Alignment" and "Safety", Part One: What Is "AI Safety"?
(lesswrong.com)
by joozio |
view
|
0 comments
▲
1
A Big Alignment Loophole of Current Froniter LLMs
(github.com)
by pythonsen |
view
|
1 comments
▲
1
AI alignment – the signal is the goal
(substack.com)
by atzeus |
view
|
0 comments
▲
1
Alignment Whack-a-Mole
(arxiv.org)
by ai_critic |
view
|
0 comments
▲
1
Show HN: Skillwave – agent orchestrator with async comms and goal-alignment loop
(github.com)
by guyzana |
view
|
0 comments
▲
1
Value Drifts: Tracing Value Alignment During LLM Post-Training
(arxiv.org)
by antigrav_kids |
view
|
0 comments
▲
1
Food for Agile Thought #537: AutoResearch, CPO-CTO Alignment, Overrated Autonomy
(age-of-product.com)
by swolpers |
view
|
0 comments
▲
1
The Alignment Illusion
(cameronwestland.com)
by camwest |
view
|
0 comments
▲
1
Expert Personas Improve LLM Alignment but Damage Accuracy
(arxiv.org)
by inaros |
view
|
0 comments
▲
1
Expert Personas Improve LLM Alignment but Damage Accuracy
(arxiv.org)
by Jacques2Marais |
view
|
0 comments
▲
1
Which types of AI alignment research are most likely to be good for all sentien
(lesswrong.com)
by joozio |
view
|
0 comments
▲
1
Show HN: MAGA or Not? Political alignment scores for people and companies
(magaornot.ai)
by rcar1046 |
view
|
0 comments
▲
1
Code review as human alignment, in the era of LLMs
(blog.ezyang.com)
by mrtz |
view
|
0 comments
▲
1
How we monitor internal coding agents for misalignment
(openai.com)
by gmays |
view
|
0 comments
▲
1
What 33 AI Agents Taught Me About Alignment
(thealignmentlayer.substack.com)
by slythefox |
view
|
0 comments
▲
1
How we monitor internal coding agents for misalignment
(openai.com)
by phillco |
view
|
0 comments
▲
1
We monitor internal coding agents for misalignment
(openai.com)
by surprisetalk |
view
|
0 comments
▲
1
Project Itohs Harmony and the under explored extremes of alignment theory
by calmkeepai |
view
|
0 comments
▲
1
The Philosophy of AI, Man, Machine and Alignment; the Only Viable Option
(lipglosssunsetsii.blogspot.com)
by FATHERGODOfDK |
view
|
1 comments
▲
1
Natural Emergent Misalignment from Reward Hacking in Production RL [pdf]
(assets.anthropic.com)
by marcuschong |
view
|
0 comments
▲
1
Electroacoustic alignment of robust and highly piezoelectric nylon-11 films
(nature.com)
by PaulHoule |
view
|
0 comments
▲
1
Against the Orthogonality Thesis Part 2 – Alignment
(jonasmoman.substack.com)
by paulpauper |
view
|
0 comments