Search | News by Netwrck

LLM from scratch, part 28 – training a base model from scratch on an RTX 3090

(gilesthomas.com) by gpjt | view | 121 comments

Training LLMs for honesty via confessions

(arxiv.org) by arabello | view | 58 comments

Grove: Distributed ML Training over AirDrop

(swarnimjain.com) by swar_ja | view | 1 comments

Training Foundation Models on a Full-Stack AMD Platform

(arxiv.org) by ngaut | view | 1 comments

Pre-training under infinite compute

(arxiv.org) by SweetSoftPillow | view | 0 comments

Training Qwen 4B to Beat Large Models on Work Tasks

(neurometric.substack.com) by robmay | view | 0 comments

Field Notes from a Year of Opsec Training

(eff.org) by hn_acker | view | 0 comments

OpenAI acquired AI training monitor Neptune

(neptune.ai) by stared | view | 15 comments

Ask HN: What did onboarding training look like in OS kernel teams?

by markus_zhang | view | 2 comments

Trump Signs Epstein Files Bill After Fight Straining Party Unity

(bloomberg.com) by wslh | view | 2 comments

Open-source VR framework for training rats to play DOOM

(ratsplaydoom.com) by k0ba | view | 0 comments

Encyclopedia Britannica sues OpenAI over AI training

(reuters.com) by thm | view | 0 comments

The Kenyan workers training China's AI models

(restofworld.org) by poisonborz | view | 0 comments

The 1B Token Challenge: Finding the Perfect Pre-Training Mix

(huggingface.co) by codelion | view | 0 comments

Putin Is Turning Eighth-Grade Classrooms into Army Training Grounds

(wsj.com) by pinewurst | view | 0 comments

Dating apps are training us to want the wrong people

(bloomberg.com) by barishnamazov | view | 2 comments

Amazon Found 'High Volume' of Child Sex Abuse Material in AI Training Data

(bloomberg.com) by speckx | view | 0 comments

Don't spill your guts to your chatbot friend – it'll hoover up info for training

(theregister.com) by Bender | view | 1 comments

Inside A Texas Church's Training Academy for Christians Running for Office

(fortworthreport.org) by beardyw | view | 3 comments

Meta Downloaded 2,400 'Adult Movies' and Says Personal Use, Not Training AI

(vice.com) by felineflock | view | 2 comments

Her Research Could Improve Training for Service Dogs

(nytimes.com) by mikhael | view | 0 comments

Meta Says Porn Stash Was for 'Personal Use,' Not Training AI Models

(gizmodo.com) by alphabettsy | view | 5 comments

Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training

(github.com) by xlayn | view | 0 comments

China's Tech Giants Take AI Model Training Offshore to Tap Nvidia Chips

(ft.com) by skx001 | view | 0 comments

AWS's Project Rainier: the most powerful computer for training AI

(aboutamazon.com) by kristianp | view | 0 comments

OpenAI faked inability to search training data, hid billions of logs, NYT says

(arstechnica.com) by cdrnsf | view | 0 comments

Meta to start capturing employee mouse movements, keystrokes for AI training

(tech.yahoo.com) by devonnull | view | 0 comments

Meta capturing employee mouse movements, keystrokes for AI training data

(economictimes.indiatimes.com) by dlx | view | 0 comments

Skilled older workers turn to AI training to stay afloat

(theguardian.com) by billybuckwheat | view | 0 comments

CDox: A Google Docs style editor with no AI training or data extraction

(cdox.ca) by jethronethro | view | 0 comments

Ask HN: Do you care if coding agents use your generated code for training?

by general_reveal | view | 2 comments

Show HN: Autonomous recovery for distributed training jobs

(docs.tensorpool.dev) by tsvoboda | view | 2 comments

Show HN: Per-instance TSP Solver with No Pre-training (1.66% gap on d1291)

by jivaprime | view | 0 comments

Show HN: A repo to turn any model into a reasoning model without training

(github.com) by Dl1683 | view | 0 comments

OpenAI to acquire Neptune, a startup that helps with AI model training

(cnbc.com) by pseudolus | view | 1 comments

OpenAI Agrees to Acquire Neptune to Improve AI Model Training

(bloomberg.com) by world2vec | view | 2 comments

China's tech giants take AI model training offshore to tap Nvidia chips

(ft.com) by alecco | view | 1 comments

Google denies analyzing your emails for AI training

(zdnet.com) by nreece | view | 0 comments

Training Agents Inside of Scalable World Models

(danijar.com) by CharlesW | view | 0 comments

Ask HN: How do you handle logging and evaluation when training ML models?

by calepayson | view | 2 comments

Open source x 3: GRPO training with OpenEnv, vLLM, and Oumi

(github.com) by stefanwebb | view | 0 comments

Miro: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency

(arxiv.org) by PaulHoule | view | 0 comments

Don't Fight the Weights: Learn to Spot Contexts That Go Against Training

(dbreunig.com) by dbreunig | view | 0 comments

EU Violates Case Law in Proposed GDPR Big Tech AI Training Carve-Out

(noyb.eu) by piltdownman | view | 3 comments

Show HN: I built a tool to create custom OCR APIs in minutes, no training needed