News
Latest
Top
Search
Submit
Login
Search
▲
129
Transformers know more than they can tell: Learning the Collatz sequence
(arxiv.org)
by Xcelerate |
view
|
45 comments
▲
78
Weight-sparse transformers have interpretable circuits [pdf]
(cdn.openai.com)
by 0x79de |
view
|
46 comments
▲
50
Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer
(AmitZalcher.github.io)
by SerCe |
view
|
10 comments
▲
9
Out-of-Distribution Generalization in Transformers via Latent Space Reasoning
(arxiv.org)
by marojejian |
view
|
1 comments
▲
7
Transformers v5 Is Out
(huggingface.co)
by unofficialmerve |
view
|
1 comments
▲
6
Show HN: Pulse-Field – O(N) AI Architecture (12x faster than Transformers)
(github.com)
by makimilan |
view
|
8 comments
▲
5
Show HN: Wasda – Experience transformer attention as music
(github.com)
by kinders |
view
|
0 comments
▲
4
Show HN: MacMind – A transformer neural network in HyperCard on a 1989 Macintosh
(github.com)
by hammer32 |
view
|
0 comments
▲
3
Ask HN: Analog Model of Transformers
by JPLeRouzic |
view
|
0 comments
▲
3
Stronger Normalization-Free Transformers
(arxiv.org)
by mfiguiere |
view
|
0 comments
▲
3
An AI Startup Looks Toward the Post-Transformer Era
(wsj.com)
by fortran77 |
view
|
1 comments
▲
3
What's Next for AI? OpenAI's Łukasz Kaiser (Transformer Co-Author) [video]
(youtube.com)
by abrichr |
view
|
0 comments
▲
3
Z-Image: Efficient Image Gen Model with Single-Stream Diffusion Transformer
(tongyi-mai.github.io)
by SerCe |
view
|
0 comments
▲
3
Parallel Loop Transformer for Efficient Test-Time Computation Scaling
(arxiv.org)
by PaulHoule |
view
|
0 comments
▲
3
Symmetric Power Transformers
(manifestai.com)
by ashvardanian |
view
|
0 comments
▲
3
The Transformer and the Hash: building blocks of 21st century political science
(nothinghuman.substack.com)
by ivee |
view
|
0 comments
▲
2
Do Transformers Need Three Projections? Systematic Study of QKV Variants
(arxiv.org)
by Anon84 |
view
|
0 comments
▲
2
Sparser, Faster, Lighter Transformer Language Models
(pub.sakana.ai)
by hardmaru |
view
|
0 comments
▲
2
LingBot-Map: Streaming 3D reconstruction with geometric context transformer
(technology.robbyant.com)
by nateb2022 |
view
|
0 comments
▲
2
Transformer from scratch HTTPS://github.com/Eamon2009/Transformer-language-model
by Eamon_Sippy |
view
|
0 comments
▲
2
What would you do if you have AI software that may be transformers alternative?
by adinhitlore |
view
|
1 comments
▲
2
A biologically inspired cognitive architecture without Transformers
(github.com)
by Brain_cognitive |
view
|
1 comments
▲
2
Securing America's grid: a strategic transformer reserve
(breakingdefense.com)
by jrpt |
view
|
0 comments
▲
2
Get to Grips with Transformers and LLMs
(i-programmer.info)
by aquastorm |
view
|
0 comments
▲
2
2015 radio interview: AI as "high-level algebra" before Transformers and LLMs
(doomlaser.com)
by doomlaser |
view
|
0 comments
▲
2
Why are Transformers replacing CNNs? [video]
(youtube.com)
by chii |
view
|
0 comments
▲
2
Transformers v5.0 by HuggingFace
(huggingface.co)
by satvikpendem |
view
|
0 comments
▲
2
Porting Nanochat to Transformers
(huggingface.co)
by us321 |
view
|
0 comments
▲
2
Turbine Transport Transformer
(mitxela.com)
by mhb |
view
|
0 comments
▲
2
Show HN: PDFClear – Browser-based PDF tools with local AI (WASM+Transformers.js)
(pdfclear.com)
by aliansari22 |
view
|
1 comments
▲
2
Show HN: Aion-Torch – Adaptive residual scaling for deep Transformers
(github.com)
by Rioverde |
view
|
0 comments
▲
2
Who Invented Transformer Neural Networks?
(people.idsia.ch)
by puttycat |
view
|
0 comments
▲
2
Show HN: Run HF Transformers in pure Go (10 MB binary, no Python)
(github.com)
by openfluke |
view
|
0 comments
▲
2
The Curved Spacetime of Transformer Architectures
(arxiv.org)
by luis_likes_math |
view
|
1 comments
▲
1
Inside The Transformer: The Life of a Token
(aleksagordic.com)
by thunderbong |
view
|
0 comments
▲
1
Adaptive Low-Rank Transformer with Dynamic Expert Routing for Continual Learning
(zenodo.org)
by jballanc |
view
|
0 comments
▲
1
Transformers Are Inherently Succinct
(openreview.net)
by brandonb |
view
|
0 comments
▲
1
Transformer Golf – The Unrolled Transformer
(github.com)
by brianjmingus |
view
|
0 comments
▲
1
Serving Transformers: Lessons from the Trenches – Stanford CS25 Transformers [video]
(youtube.com)
by matt_d |
view
|
0 comments
▲
1
Nemotron 3 Ultra: Open Moe Hybrid Mamba-Transformer for Agentic Reasoning [pdf]
(research.nvidia.com)
by victormustar |
view
|
0 comments
▲
1
Elon's Trillions (text-transformer-generated)
(pastebin.com)
by joebig |
view
|
0 comments
▲
1
Lattice Deduction Transformers
(arxiv.org)
by 44za12 |
view
|
0 comments
▲
1
Building a Recurrent-Depth Transformer for Security Research on a 2013 MacBook
(github.com)
by btthomas |
view
|
0 comments
▲
1
Déjà View: Looping Transformers for Multi-View 3D Reconstruction
(research.nvidia.com)
by theschwa |
view
|
0 comments
▲
1
TCNs as Alternative to Transformers?
by adinhitlore |
view
|
0 comments
▲
1
Delayed Tensor Parallelism for Faster Transformer Inference
(blog.kog.ai)
by matt_d |
view
|
0 comments
▲
1
AVTR-1: Open-weight real-time flow-matching transformer for audio-driven avatars
(github.com)
by hexfaker |
view
|
0 comments
▲
1
The Transformer: The Life of a Token
(aleksagordic.com)
by ai-epiphany |
view
|
0 comments
▲
1
Show HN: NeuroFlow 55.8x video inference speedup for Vision Transformers PyTorch
(github.com)
by ynnk |
view
|
0 comments
▲
1
LT2: Linear-Time Looped Transformers
(charlesdddd.github.io)
by matt_d |
view
|
0 comments