News
Latest
Top
Search
Submit
Login
Search
▲
129
Transformers know more than they can tell: Learning the Collatz sequence
(arxiv.org)
by Xcelerate |
view
|
45 comments
▲
78
Weight-sparse transformers have interpretable circuits [pdf]
(cdn.openai.com)
by 0x79de |
view
|
46 comments
▲
50
Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer
(AmitZalcher.github.io)
by SerCe |
view
|
10 comments
▲
9
Out-of-Distribution Generalization in Transformers via Latent Space Reasoning
(arxiv.org)
by marojejian |
view
|
1 comments
▲
7
Transformers v5 Is Out
(huggingface.co)
by unofficialmerve |
view
|
1 comments
▲
6
Show HN: Pulse-Field – O(N) AI Architecture (12x faster than Transformers)
(github.com)
by makimilan |
view
|
8 comments
▲
5
Show HN: Wasda – Experience transformer attention as music
(github.com)
by kinders |
view
|
0 comments
▲
3
Stronger Normalization-Free Transformers
(arxiv.org)
by mfiguiere |
view
|
0 comments
▲
3
An AI Startup Looks Toward the Post-Transformer Era
(wsj.com)
by fortran77 |
view
|
1 comments
▲
3
What's Next for AI? OpenAI's Łukasz Kaiser (Transformer Co-Author) [video]
(youtube.com)
by abrichr |
view
|
0 comments
▲
3
Z-Image: Efficient Image Gen Model with Single-Stream Diffusion Transformer
(tongyi-mai.github.io)
by SerCe |
view
|
0 comments
▲
3
Parallel Loop Transformer for Efficient Test-Time Computation Scaling
(arxiv.org)
by PaulHoule |
view
|
0 comments
▲
3
Symmetric Power Transformers
(manifestai.com)
by ashvardanian |
view
|
0 comments
▲
3
The Transformer and the Hash: building blocks of 21st century political science
(nothinghuman.substack.com)
by ivee |
view
|
0 comments
▲
2
2015 radio interview: AI as "high-level algebra" before Transformers and LLMs
(doomlaser.com)
by doomlaser |
view
|
0 comments
▲
2
Why are Transformers replacing CNNs? [video]
(youtube.com)
by chii |
view
|
0 comments
▲
2
Transformers v5.0 by HuggingFace
(huggingface.co)
by satvikpendem |
view
|
0 comments
▲
2
Porting Nanochat to Transformers
(huggingface.co)
by us321 |
view
|
0 comments
▲
2
Turbine Transport Transformer
(mitxela.com)
by mhb |
view
|
0 comments
▲
2
Show HN: PDFClear – Browser-based PDF tools with local AI (WASM+Transformers.js)
(pdfclear.com)
by aliansari22 |
view
|
1 comments
▲
2
Show HN: Aion-Torch – Adaptive residual scaling for deep Transformers
(github.com)
by Rioverde |
view
|
0 comments
▲
2
Who Invented Transformer Neural Networks?
(people.idsia.ch)
by puttycat |
view
|
0 comments
▲
2
Show HN: Run HF Transformers in pure Go (10 MB binary, no Python)
(github.com)
by openfluke |
view
|
0 comments
▲
2
The Curved Spacetime of Transformer Architectures
(arxiv.org)
by luis_likes_math |
view
|
1 comments
▲
1
Can a Transformer "Learn" Economic Relationships? Revisiting the Lucas Critique
(aleximas.substack.com)
by larsiusprime |
view
|
0 comments
▲
1
The Annotated Transformer
(nlp.seas.harvard.edu)
by auraham |
view
|
0 comments
▲
1
Transformers Are Dead. Google Killed Them – Then Went Silent
(medium.com)
by washedup |
view
|
0 comments
▲
1
From a For-Loop to Transformers
(python2llms.org)
by yegortk |
view
|
0 comments
▲
1
Genesis Open Source Embodied AGI Simulation, Rust (Mamba-3, Not Transformers)
by RGBra |
view
|
0 comments
▲
1
The Illustrated Transformer
(jalammar.github.io)
by auraham |
view
|
0 comments
▲
1
Tokenization in Transformers v5: Simpler, Clearer, and More Modular
(huggingface.co)
by ibobev |
view
|
0 comments
▲
1
Semantic Field Execution: Decoupling Transformers from Runtime Inference
(zenodo.org)
by anima-core |
view
|
1 comments
▲
1
Reverse-Engineering the RK3588 NPU: Hacking Limits to Run Vision Transformers
(amohan.dev)
by rcarmo |
view
|
0 comments
▲
1
Building a Transformer from Scratch Taught Me Where Knowledge Lives
(medium.com)
by kishore-jalleda |
view
|
0 comments
▲
1
DSPL: Replacing Transformer Depth with Coupled Recursive Streams for ARC-AGI
(zenodo.org)
by Doug_Bitterbot |
view
|
1 comments
▲
1
Idea-Gated Transformers: open-source semantic gating trick (2025)
(arxiv.org)
by DARSHANFOFADIYA |
view
|
0 comments
▲
1
Π-Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling
(arxiv.org)
by PaulHoule |
view
|
0 comments
▲
1
Nice to Meet You: Synthesizing Practical MLIR Abstract Transformers [pdf]
(users.cs.utah.edu)
by matt_d |
view
|
0 comments
▲
1
Porting nanochat to Transformers: an AI modeling history lesson
(huggingface.co)
by victormustar |
view
|
0 comments
▲
1
Encoderfile v0.1.0: Deploy Encoder Transformers as Single Binary Executables
(blog.mozilla.ai)
by theshrike79 |
view
|
0 comments
▲
1
Can Transformers Do Everything, and Undo It Too?
(astro-eric.github.io)
by nekofneko |
view
|
0 comments
▲
1
Deploy Encoder Transformers as Single Binary Executables with Encoderfile v0.1.0
(blog.mozilla.ai)
by mzlaai |
view
|
0 comments
▲
1
Ask HN: Did PageRank delay the invention of transformers and modern AI?
by amichail |
view
|
1 comments
▲
1
Show HN: Interactive Transformer Architecture Designer with Emotion Analysis
(wasda.ai)
by kinderpingui |
view
|
0 comments
▲
1
Transformers Explained: The Discovery That Changed AI Forever [video]
(youtube.com)
by gmays |
view
|
0 comments