News
Latest
Top
Search
Submit
Login
Search
▲
5
Finding Alignment by Visualizing Music in Rust
(positron.solutions)
by positron26 |
view
|
0 comments
▲
4
64-Bit Misalignment
(jordivillar.com)
by thunderbong |
view
|
1 comments
▲
3
Is AI Really Alignment Faking?
(iacgm.com)
by iacgm |
view
|
1 comments
▲
3
Show HN: Thermodynamic Alignment Forces Gemini Thinking into "Burn Protocol"
(github.com)
by CodeIncept1111 |
view
|
7 comments
▲
3
Natural emergent misalignment from reward hacking in production rl [pdf]
(assets.anthropic.com)
by neapolisbeach |
view
|
0 comments
▲
3
Aligning brains into a shared space improves their alignment with LLMs
(nature.com)
by stevenjgarner |
view
|
0 comments
▲
2
Show HN: Chatbot Without Safety Alignment
(coralflavor.com)
by JohnLins |
view
|
0 comments
▲
2
Who Owns Alignment?
(backnotprop.substack.com)
by ramoz |
view
|
0 comments
▲
2
Ask HN: Is the absence of affect the real barrier to AGI and alignment?
by n-exploit |
view
|
1 comments
▲
2
Alignment Research Blog
(alignment.openai.com)
by ironyman |
view
|
0 comments
▲
2
Values.md – file format for personal ethical alignment
(values.md)
by georgestrakhov |
view
|
0 comments
▲
2
Alignment: The Invisible Force That Makes Everything Work
(itrevolution.com)
by mooreds |
view
|
0 comments
▲
2
Wargaming AI Alignment
(twitter.com)
by JL-Akrasia |
view
|
2 comments
▲
2
Show HN: Alignmenter – Measure brand voice and consistency across model versions
(alignmenter.com)
by justingrosvenor |
view
|
2 comments
▲
2
TelUI 1.2: TelUI with fun alignments
by telui |
view
|
0 comments
▲
1
LeBron James Is President – Exploiting LLMs via "Alignment" Context Injection
(github.com)
by spkavanagh6 |
view
|
1 comments
▲
1
Show HN: AI alignment is an infrastructure problem
by hortator_ai |
view
|
0 comments
▲
1
LLM Alignment/Hallucinations Can't Be Fixed – Proof
(github.com)
by MoKetchups |
view
|
0 comments
▲
1
The Hot Mess of AI: How Does Misalignment Scale with Model Intelligence
(arxiv.org)
by schmuhblaster |
view
|
0 comments
▲
1
How do you measure alignment without adding more meetings?
by ivogosp |
view
|
0 comments
▲
1
A one-prompt attack that breaks LLM safety alignment
(microsoft.com)
by weinzierl |
view
|
0 comments
▲
1
A one-prompt attack that breaks LLM safety alignment
(microsoft.com)
by yogirk1 |
view
|
0 comments
▲
1
There is no Alignment Problem
by salacryl |
view
|
0 comments
▲
1
Show HN: WLM-SLP – A 0D-27D Structural Language for Multi-Agent Alignment
(github.com)
by WujieGuGavin |
view
|
0 comments
▲
1
Sidestepping Evaluation Awareness and Anticipating Misalignment
(alignment.openai.com)
by taubek |
view
|
0 comments
▲
1
Sidestepping Evaluation Awareness and Anticipating Misalignment with Evaluations
(alignment.openai.com)
by michaefe |
view
|
0 comments
▲
1
Show HN: A one word check to detect misalignment in meetings
(cognu.app)
by anticlickwise |
view
|
0 comments
▲
1
Vibe Alignment
(avc.xyz)
by wslh |
view
|
0 comments
▲
1
Data Structure Alignment
(en.wikipedia.org)
by Brysonbw |
view
|
0 comments
▲
1
AI alignment is a $200B+ product problem, not a research question
(betterhalfai.substack.com)
by i7l |
view
|
0 comments
▲
1
Claude Constitution; or love as the solution to the AI alignment problem
(nintil.com)
by lr0 |
view
|
0 comments
▲
1
SpaceX weighs June IPO timed to planetary alignment and Elon Musk's birthday
(ft.com)
by TMWNN |
view
|
0 comments
▲
1
Same Page: a 60 seconds check to catch misalignment after meetings
(cognu.app)
by anticlickwise |
view
|
1 comments
▲
1
Be Skeptical of Solving AI Alignment with Vibes
(flowerpetals.substack.com)
by nonveumann |
view
|
0 comments
▲
1
Seeking alignment on product boundaries for an early-stage social platform
(github.com)
by kensei |
view
|
1 comments
▲
1
An Idea for Solving Superintelligence Alignment
(science-dao.org)
by porton |
view
|
0 comments
▲
1
Alignment makes AI less human
(jonready.com)
by mips_avatar |
view
|
0 comments
▲
1
PAZ O.S. – A "Bio-Civic" Alignment Framework for Ethical LLMs
(github.com)
by PiSounds |
view
|
1 comments
▲
1
Training large language models on narrow tasks can lead to broad misalignment
(nature.com)
by petemetefete |
view
|
0 comments
▲
1
Training large language models on narrow tasks can lead to broad misalignment
(nature.com)
by thebeardisred |
view
|
0 comments
▲
1
Training large language models on narrow tasks can lead to broad misalignment
(nature.com)
by Anon84 |
view
|
0 comments
▲
1
Nature: Training LLM's on narrow tasks can lead to broad misalignment
(nature.com)
by trajektorie |
view
|
0 comments
▲
1
A dynamic definition of being "right": error as a temporary misalignment
by DELTA-X |
view
|
0 comments
▲
1
Show HN: PAlignPrims – C++ library for sequence alignment beyond bioinformatics
(github.com)
by offbynull |
view
|
0 comments
▲
1
Silent Worker Teaching Method – AI alignment without modifying weights
(github.com)
by Hope_Genom |
view
|
0 comments
▲
1
The Simulation Gambit: A Game-Theoretic Argument for ASI Alignment
(darayat.substack.com)
by mentalmaths |
view
|
0 comments
▲
1
Thai researcher documents cultural erasure in AI alignment
(zenodo.org)
by luciusrockwing |
view
|
1 comments
▲
1
Ilion Stateless AI Identity Framework for Semantic Alignment and Moral Integrity
(ilion-project.org)
by ilion_identity |
view
|
1 comments
▲
1
Ask HN: The Alignment Tax
by brihati |
view
|
0 comments
▲
1
I Fixed My Coworker's Alignment Problem
(hallofdreams.org)
by TheCog |
view
|
0 comments