News
Latest
Top
Search
Submit
Login
Search
▲
539
LLM from scratch, part 28 – training a base model from scratch on an RTX 3090
(gilesthomas.com)
by gpjt |
view
|
121 comments
▲
69
Training LLMs for honesty via confessions
(arxiv.org)
by arabello |
view
|
58 comments
▲
26
Training Foundation Models on a Full-Stack AMD Platform
(arxiv.org)
by ngaut |
view
|
1 comments
▲
20
Pre-training under infinite compute
(arxiv.org)
by SweetSoftPillow |
view
|
0 comments
▲
16
Training Qwen 4B to Beat Large Models on Work Tasks
(neurometric.substack.com)
by robmay |
view
|
0 comments
▲
11
OpenAI acquired AI training monitor Neptune
(neptune.ai)
by stared |
view
|
15 comments
▲
10
Ask HN: What did onboarding training look like in OS kernel teams?
by markus_zhang |
view
|
2 comments
▲
9
Trump Signs Epstein Files Bill After Fight Straining Party Unity
(bloomberg.com)
by wslh |
view
|
2 comments
▲
8
Open-source VR framework for training rats to play DOOM
(ratsplaydoom.com)
by k0ba |
view
|
0 comments
▲
6
The Kenyan workers training China's AI models
(restofworld.org)
by poisonborz |
view
|
0 comments
▲
6
The 1B Token Challenge: Finding the Perfect Pre-Training Mix
(huggingface.co)
by codelion |
view
|
0 comments
▲
6
Putin Is Turning Eighth-Grade Classrooms into Army Training Grounds
(wsj.com)
by pinewurst |
view
|
0 comments
▲
5
Dating apps are training us to want the wrong people
(bloomberg.com)
by barishnamazov |
view
|
2 comments
▲
5
Amazon Found 'High Volume' of Child Sex Abuse Material in AI Training Data
(bloomberg.com)
by speckx |
view
|
0 comments
▲
5
Don't spill your guts to your chatbot friend – it'll hoover up info for training
(theregister.com)
by Bender |
view
|
1 comments
▲
5
Inside A Texas Church's Training Academy for Christians Running for Office
(fortworthreport.org)
by beardyw |
view
|
3 comments
▲
5
Meta Downloaded 2,400 'Adult Movies' and Says Personal Use, Not Training AI
(vice.com)
by felineflock |
view
|
2 comments
▲
5
Her Research Could Improve Training for Service Dogs
(nytimes.com)
by mikhael |
view
|
0 comments
▲
5
Meta Says Porn Stash Was for 'Personal Use,' Not Training AI Models
(gizmodo.com)
by alphabettsy |
view
|
5 comments
▲
4
China's Tech Giants Take AI Model Training Offshore to Tap Nvidia Chips
(ft.com)
by skx001 |
view
|
0 comments
▲
4
AWS's Project Rainier: the most powerful computer for training AI
(aboutamazon.com)
by kristianp |
view
|
0 comments
▲
3
Show HN: Autonomous recovery for distributed training jobs
(docs.tensorpool.dev)
by tsvoboda |
view
|
2 comments
▲
3
Show HN: Per-instance TSP Solver with No Pre-training (1.66% gap on d1291)
by jivaprime |
view
|
0 comments
▲
3
Show HN: A repo to turn any model into a reasoning model without training
(github.com)
by Dl1683 |
view
|
0 comments
▲
3
OpenAI to acquire Neptune, a startup that helps with AI model training
(cnbc.com)
by pseudolus |
view
|
1 comments
▲
3
OpenAI Agrees to Acquire Neptune to Improve AI Model Training
(bloomberg.com)
by world2vec |
view
|
2 comments
▲
3
China's tech giants take AI model training offshore to tap Nvidia chips
(ft.com)
by alecco |
view
|
1 comments
▲
3
Google denies analyzing your emails for AI training
(zdnet.com)
by nreece |
view
|
0 comments
▲
3
Training Agents Inside of Scalable World Models
(danijar.com)
by CharlesW |
view
|
0 comments
▲
3
Ask HN: How do you handle logging and evaluation when training ML models?
by calepayson |
view
|
2 comments
▲
3
Open source x 3: GRPO training with OpenEnv, vLLM, and Oumi
(github.com)
by stefanwebb |
view
|
0 comments
▲
3
Miro: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency
(arxiv.org)
by PaulHoule |
view
|
0 comments
▲
3
Don't Fight the Weights: Learn to Spot Contexts That Go Against Training
(dbreunig.com)
by dbreunig |
view
|
0 comments
▲
3
EU Violates Case Law in Proposed GDPR Big Tech AI Training Carve-Out
(noyb.eu)
by piltdownman |
view
|
3 comments
▲
3
Show HN: I built a tool to create custom OCR APIs in minutes, no training needed
(struxs.com)
by great_domino |
view
|
2 comments
▲
3
Deep Learning Without Training
(zenodo.org)
by car |
view
|
1 comments
▲
3
The 1B Token Challenge: Finding the Perfect Pre-Training Mix
(huggingface.co)
by codelion |
view
|
0 comments
▲
3
Show HN: Torque – A declarative, typesafe DSL for LLM training datasets (MIT)
(github.com)
by michalwarda |
view
|
1 comments
▲
2
Show HN: Orion – Native Training LLMs on the Apple Neural Engine Without CoreML
(github.com)
by mechramc |
view
|
1 comments
▲
2
GitHub – Maderix/ANE: Training Neural Networks on Apple Neural Engine
(github.com)
by bilsbie |
view
|
0 comments
▲
2
Training microgpt 19,000x faster on M5 Mac
(github.com)
by easygenes |
view
|
1 comments
▲
2
Ask HN: Is LLM training infra still broken enough to build a company around?
by harsh020 |
view
|
0 comments
▲
2
Fair Use Paradox: Training and Distillation
(jasonwillems.com)
by jayw_lead |
view
|
0 comments
▲
2
Show HN: Coco Ear Training – a research-backed ear training app for musicians
(cocomusic.app)
by marksantonocito |
view
|
0 comments
▲
2
This doctor is training AI to do her job. And it's a booming business
(cnn.com)
by hhs |
view
|
0 comments
▲
2
Xbox cancel French localizations as voice actors refuse AI training clauses
(jeuxonline.info)
by WhereIsTheTruth |
view
|
0 comments
▲
2
Speed-based cognitive training reverses 10 years of brain aging
(medium.com)
by smanuel |
view
|
0 comments
▲
2
Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective
(huggingface.co)
by ibobev |
view
|
0 comments
▲
2
Best AI Training Platforms of 2026: Ranked and Reviewed
(aitrainer.work)
by xceladonx |
view
|
0 comments
▲
2
Starlink updates Privacy Policy to allow AI model training with personal data
(coywolf.com)
by speckx |
view
|
0 comments