News
Latest
Top
Search
Submit
Login
Search
▲
539
LLM from scratch, part 28 – training a base model from scratch on an RTX 3090
(gilesthomas.com)
by gpjt |
view
|
121 comments
▲
69
Training LLMs for honesty via confessions
(arxiv.org)
by arabello |
view
|
58 comments
▲
26
Training Foundation Models on a Full-Stack AMD Platform
(arxiv.org)
by ngaut |
view
|
1 comments
▲
20
Pre-training under infinite compute
(arxiv.org)
by SweetSoftPillow |
view
|
0 comments
▲
11
OpenAI acquired AI training monitor Neptune
(neptune.ai)
by stared |
view
|
15 comments
▲
10
Ask HN: What did onboarding training look like in OS kernel teams?
by markus_zhang |
view
|
2 comments
▲
9
Trump Signs Epstein Files Bill After Fight Straining Party Unity
(bloomberg.com)
by wslh |
view
|
2 comments
▲
8
Open-source VR framework for training rats to play DOOM
(ratsplaydoom.com)
by k0ba |
view
|
0 comments
▲
6
The Kenyan workers training China's AI models
(restofworld.org)
by poisonborz |
view
|
0 comments
▲
6
The 1B Token Challenge: Finding the Perfect Pre-Training Mix
(huggingface.co)
by codelion |
view
|
0 comments
▲
6
Putin Is Turning Eighth-Grade Classrooms into Army Training Grounds
(wsj.com)
by pinewurst |
view
|
0 comments
▲
5
Don't spill your guts to your chatbot friend – it'll hoover up info for training
(theregister.com)
by Bender |
view
|
1 comments
▲
5
Inside A Texas Church's Training Academy for Christians Running for Office
(fortworthreport.org)
by beardyw |
view
|
3 comments
▲
5
Meta Downloaded 2,400 'Adult Movies' and Says Personal Use, Not Training AI
(vice.com)
by felineflock |
view
|
2 comments
▲
5
Her Research Could Improve Training for Service Dogs
(nytimes.com)
by mikhael |
view
|
0 comments
▲
5
Meta Says Porn Stash Was for 'Personal Use,' Not Training AI Models
(gizmodo.com)
by alphabettsy |
view
|
5 comments
▲
4
China's Tech Giants Take AI Model Training Offshore to Tap Nvidia Chips
(ft.com)
by skx001 |
view
|
0 comments
▲
4
AWS's Project Rainier: the most powerful computer for training AI
(aboutamazon.com)
by kristianp |
view
|
0 comments
▲
3
Show HN: A repo to turn any model into a reasoning model without training
(github.com)
by Dl1683 |
view
|
0 comments
▲
3
OpenAI to acquire Neptune, a startup that helps with AI model training
(cnbc.com)
by pseudolus |
view
|
1 comments
▲
3
OpenAI Agrees to Acquire Neptune to Improve AI Model Training
(bloomberg.com)
by world2vec |
view
|
2 comments
▲
3
China's tech giants take AI model training offshore to tap Nvidia chips
(ft.com)
by alecco |
view
|
1 comments
▲
3
Google denies analyzing your emails for AI training
(zdnet.com)
by nreece |
view
|
0 comments
▲
3
Training Agents Inside of Scalable World Models
(danijar.com)
by CharlesW |
view
|
0 comments
▲
3
Ask HN: How do you handle logging and evaluation when training ML models?
by calepayson |
view
|
2 comments
▲
3
Open source x 3: GRPO training with OpenEnv, vLLM, and Oumi
(github.com)
by stefanwebb |
view
|
0 comments
▲
3
Miro: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency
(arxiv.org)
by PaulHoule |
view
|
0 comments
▲
3
Don't Fight the Weights: Learn to Spot Contexts That Go Against Training
(dbreunig.com)
by dbreunig |
view
|
0 comments
▲
3
EU Violates Case Law in Proposed GDPR Big Tech AI Training Carve-Out
(noyb.eu)
by piltdownman |
view
|
3 comments
▲
3
Show HN: I built a tool to create custom OCR APIs in minutes, no training needed
(struxs.com)
by great_domino |
view
|
2 comments
▲
3
Deep Learning Without Training
(zenodo.org)
by car |
view
|
1 comments
▲
3
The 1B Token Challenge: Finding the Perfect Pre-Training Mix
(huggingface.co)
by codelion |
view
|
0 comments
▲
3
Show HN: Torque – A declarative, typesafe DSL for LLM training datasets (MIT)
(github.com)
by michalwarda |
view
|
1 comments
▲
2
Show HN: Build ML training datasets from large-scale satellite/aerial imagery
(github.com)
by noahgolmant |
view
|
0 comments
▲
2
Agents Training Agents: A practical architecture for autonomous self-improvement
(techlife.blog)
by tsenturk |
view
|
3 comments
▲
2
Fair Use Paradox: If Training on Public Data Is Fair Use, Why Not Distillation?
(jasonwillems.com)
by jayw_lead |
view
|
1 comments
▲
2
Training LLMs for Honesty via Confessions [pdf]
(cdn.openai.com)
by goplayoutside |
view
|
0 comments
▲
2
Awesome-distributed-ML – A curated list for distributed [faster] LLM training
(github.com)
by peter_d_sherman |
view
|
0 comments
▲
2
Training open source LLMs at ESE Kongress 2025
(collabora.com)
by losgehts |
view
|
0 comments
▲
2
Training Foundation Models on a Full-Stack AMD Platform
(arxiv.org)
by srameshc |
view
|
0 comments
▲
2
OptiLLM: Accuracy improvements on reasoning tasks with zero training
(github.com)
by rzk |
view
|
0 comments
▲
2
Next general training environment for superintelligence?
(shash42.substack.com)
by shash42 |
view
|
1 comments
▲
2
Recap: Reproducing Copyrighted Data from LLMs Training with an Agentic Pipeline
(arxiv.org)
by PaulHoule |
view
|
0 comments
▲
2
AI data centers are straining power grids, environmental resources and markets
(bloomberg.com)
by zerosizedweasle |
view
|
0 comments
▲
2
Cyberattack Prevention via Anomaly Detection Ensembles and Diverse Training Sets
(mdpi.com)
by PaulHoule |
view
|
0 comments
▲
2
Built an Energy-Based Model without training – just GloVe and GPT embeddings
(github.com)
by kinders |
view
|
0 comments
▲
2
Show HN: PaceGuru – Visualizing data and guiding personalized training
(paceguru.app)
by laihj |
view
|
1 comments
▲
2
Thinking through how pretraining vs. RL learn
(dwarkesh.com)
by gwintrob |
view
|
0 comments
▲
2
Quantifying Long-Range Information for Long-Context LLM Pretraining Data
(arxiv.org)
by PaulHoule |
view
|
0 comments
▲
2
Rating the Rater: Training Tasteful Generative Models
(markusstrasser.org)
by eatitraw |
view
|
0 comments