News
Latest
Top
Search
Submit
Login
Search
▲
5
Finding Alignment by Visualizing Music in Rust
(positron.solutions)
by positron26 |
view
|
0 comments
▲
4
64-Bit Misalignment
(jordivillar.com)
by thunderbong |
view
|
1 comments
▲
3
Is AI Really Alignment Faking?
(iacgm.com)
by iacgm |
view
|
1 comments
▲
3
Show HN: Thermodynamic Alignment Forces Gemini Thinking into "Burn Protocol"
(github.com)
by CodeIncept1111 |
view
|
7 comments
▲
3
Natural emergent misalignment from reward hacking in production rl [pdf]
(assets.anthropic.com)
by neapolisbeach |
view
|
0 comments
▲
3
Aligning brains into a shared space improves their alignment with LLMs
(nature.com)
by stevenjgarner |
view
|
0 comments
▲
2
Show HN: Chatbot Without Safety Alignment
(coralflavor.com)
by JohnLins |
view
|
0 comments
▲
2
Who Owns Alignment?
(backnotprop.substack.com)
by ramoz |
view
|
0 comments
▲
2
Ask HN: Is the absence of affect the real barrier to AGI and alignment?
by n-exploit |
view
|
1 comments
▲
2
Alignment Research Blog
(alignment.openai.com)
by ironyman |
view
|
0 comments
▲
2
Values.md – file format for personal ethical alignment
(values.md)
by georgestrakhov |
view
|
0 comments
▲
2
Alignment: The Invisible Force That Makes Everything Work
(itrevolution.com)
by mooreds |
view
|
0 comments
▲
2
Wargaming AI Alignment
(twitter.com)
by JL-Akrasia |
view
|
2 comments
▲
2
Show HN: Alignmenter – Measure brand voice and consistency across model versions
(alignmenter.com)
by justingrosvenor |
view
|
2 comments
▲
2
TelUI 1.2: TelUI with fun alignments
by telui |
view
|
0 comments
▲
1
Narrative Alignment: The Opposite of Jailbreaking
(github.com)
by zotimer |
view
|
0 comments
▲
1
LeBron James Is President – Exploiting LLMs via "Alignment" Context Injection
(github.com)
by PaulHoule |
view
|
0 comments
▲
1
Anthropic and Alignment
(stratechery.com)
by stochastician |
view
|
0 comments
▲
1
Anthropic and Alignment (Ben Thompson)
(stratechery.com)
by toomanybits |
view
|
0 comments
▲
1
After Alignment
(utopai.substack.com)
by cyberneticc |
view
|
1 comments
▲
1
A Reality Alignment Index: Measuring When AI and Systems Lose Meaning [pdf]
(offbrandguy.com)
by realitydrift |
view
|
1 comments
▲
1
Show HN: Infinity Equilibrium Protocol – AI alignment logic framework
(github.com)
by Nobody74 |
view
|
0 comments
▲
1
Ask HN: Is AI Alignment about to be solved, for profit?
by mikewarot |
view
|
0 comments
▲
1
Director of Safety and Alignment meta gave clawdbot full-access to her computer
(twitter.com)
by tamnd |
view
|
0 comments
▲
1
Meta Head of alignment and safety gets some of inbox deleted by Claude
(xcancel.com)
by amarcheschi |
view
|
0 comments
▲
1
OpenClaw Deletes Inbox of Meta's AI Alignment Director
(twitter.com)
by Ozzie_osman |
view
|
0 comments
▲
1
Advancing independent research on AI alignment
(openai.com)
by surprisetalk |
view
|
0 comments
▲
1
Fish Live in Trees – LLM Runtime Alignment Context Injection
(github.com)
by spkavanagh6 |
view
|
0 comments
▲
1
LeBron James Is President – Exploiting LLMs via "Alignment" Context Injection
(github.com)
by spkavanagh6 |
view
|
1 comments
▲
1
Show HN: AI alignment is an infrastructure problem
by hortator_ai |
view
|
0 comments
▲
1
LLM Alignment/Hallucinations Can't Be Fixed – Proof
(github.com)
by MoKetchups |
view
|
0 comments
▲
1
The Hot Mess of AI: How Does Misalignment Scale with Model Intelligence
(arxiv.org)
by schmuhblaster |
view
|
0 comments
▲
1
How do you measure alignment without adding more meetings?
by ivogosp |
view
|
0 comments
▲
1
A one-prompt attack that breaks LLM safety alignment
(microsoft.com)
by weinzierl |
view
|
0 comments
▲
1
A one-prompt attack that breaks LLM safety alignment
(microsoft.com)
by yogirk1 |
view
|
0 comments
▲
1
There is no Alignment Problem
by salacryl |
view
|
0 comments
▲
1
Show HN: WLM-SLP – A 0D-27D Structural Language for Multi-Agent Alignment
(github.com)
by WujieGuGavin |
view
|
0 comments
▲
1
Sidestepping Evaluation Awareness and Anticipating Misalignment
(alignment.openai.com)
by taubek |
view
|
0 comments
▲
1
Sidestepping Evaluation Awareness and Anticipating Misalignment with Evaluations
(alignment.openai.com)
by michaefe |
view
|
0 comments
▲
1
Show HN: A one word check to detect misalignment in meetings
(cognu.app)
by anticlickwise |
view
|
0 comments
▲
1
Vibe Alignment
(avc.xyz)
by wslh |
view
|
0 comments
▲
1
Data Structure Alignment
(en.wikipedia.org)
by Brysonbw |
view
|
0 comments
▲
1
AI alignment is a $200B+ product problem, not a research question
(betterhalfai.substack.com)
by i7l |
view
|
0 comments
▲
1
Claude Constitution; or love as the solution to the AI alignment problem
(nintil.com)
by lr0 |
view
|
0 comments
▲
1
SpaceX weighs June IPO timed to planetary alignment and Elon Musk's birthday
(ft.com)
by TMWNN |
view
|
0 comments
▲
1
Same Page: a 60 seconds check to catch misalignment after meetings
(cognu.app)
by anticlickwise |
view
|
1 comments
▲
1
Be Skeptical of Solving AI Alignment with Vibes
(flowerpetals.substack.com)
by nonveumann |
view
|
0 comments
▲
1
Seeking alignment on product boundaries for an early-stage social platform
(github.com)
by kensei |
view
|
1 comments
▲
1
An Idea for Solving Superintelligence Alignment
(science-dao.org)
by porton |
view
|
0 comments
▲
1
Alignment makes AI less human
(jonready.com)
by mips_avatar |
view
|
0 comments