Search | News by Netwrck

Study identifies weaknesses in how AI systems are evaluated

(oii.ox.ac.uk) by pseudolus | view | 192 comments

Anthropic raises $30B in Series G funding at $380B post-money valuation

(anthropic.com) by ryanhn | view | 409 comments

Adk-go: code-first Go toolkit for building, evaluating, and deploying AI agents

(github.com) by maxloh | view | 24 comments

Fast Lua runtime written in Rust

(astra.arkforge.net) by akagusu | view | 55 comments

Agent-o-rama: build, trace, evaluate, and monitor LLM agents in Java or Clojure

(blog.redplanetlabs.com) by yayitswei | view | 5 comments

Client-side GPU load balancing with Redis and Lua

(galileo.ai) by lneiman | view | 11 comments

Lua 5.5.0 (rc1) has been released for testing

(lua.org) by dottrap | view | 2 comments

SpaceX in Talks for Share Sale That Would Boost Valuation to $800B

(wsj.com) by bko | view | 127 comments

3.5B Accounts: Complete WhatsApp Directory Retrieved and Evaluated

(heise.de) by therealmarv | view | 1 comments

Silverbullet: Personal productivity platform built with Markdown and Lua

(github.com) by nateb2022 | view | 6 comments

Evaluating Multilingual, Context-Aware Guardrails: A Humanitarian LLM Use Case

(blog.mozilla.ai) by benbreen | view | 1 comments

Evaluating Uniform Memory Access Mode on AMD's Turin

(chipsandcheese.com) by zdw | view | 3 comments

Cursor Raises Funds at $29.3B Valuation

(bloomberg.com) by blahgeek | view | 7 comments

Luarrow – True pipeline operators and elegant Haskell-style function compositio

(github.com) by todsacerdoti | view | 4 comments

3.5B Accounts: Complete WhatsApp Directory Retrieved and Evaluated

(heise.de) by doener | view | 0 comments

Show HN: AA-Briefcase: a frontier knowledge work evaluation

(artificialanalysis.ai) by declanjackson | view | 2 comments

Michael Burry slams Tesla valuation, warns of 'ridiculous' dilution

(electrek.co) by breve | view | 4 comments

Deep Dive into G-Eval: How LLMs Evaluate Themselves

(medium.com) by zlatkov | view | 6 comments

Dash0 raises $110M Series B at $1B valuation

(dash0.com) by fred_ | view | 0 comments

Retracted: Safety Evaluation and Risk Assessment of the Herbicide Roundup

(sciencedirect.com:5037) by mindracer | view | 1 comments

AgentLens: The Future of Evaluation Is Agentic

(contextual.ai) by shikib | view | 2 comments

Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult

(simonwillison.net) by janpio | view | 2 comments

Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult

(simonwillison.net) by jonesn11 | view | 1 comments

AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Language Models

(arxiv.org) by declanjackson | view | 1 comments

Port the Lua REPL to the RP2350 (Chinese)

(ruanx.net) by uneven9434 | view | 0 comments

The mind-boggling valuations of AI companies

(theguardian.com) by devonnull | view | 0 comments

Kalshi raises $1B at $11B valuation, doubling value in under two months

(techcrunch.com) by ryan_j_naughton | view | 0 comments

Kalshi Reaches $11B Valuation as App Takes over America

(businesswire.com) by serial_dev | view | 7 comments

Browserbench.ai is launched to evaluate browser runtimes for AI Agents

(browserbench.ai) by idanraman | view | 3 comments

Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult

(simonw.substack.com) by hackthegibson2 | view | 1 comments

Public IPv4 addresses are now valuable loan collateral and can be worth millions

(tomshardware.com) by llamasushi | view | 0 comments

Ramp raises at $32B valuation

(ramp.com) by joshuawright11 | view | 0 comments

RAG Chunk: CLI tool to parse, chunk, and evaluate Markdown documents for RAG

(github.com) by handfuloflight | view | 0 comments

In the AI era, Wikipedia has never been more valuable

(wikimediafoundation.org) by speckx | view | 1 comments

AI valuation fears grip global investors as tech bubble concerns grow

(cnbc.com) by belter | view | 0 comments

Retracted: Safety Evaluation, Risk Assessment of Roundup/Glyphosate for Humans

(sciencedirect.com) by Someone | view | 0 comments

Ragas: Automated Evaluation of Retrieval Augmented Generation

(arxiv.org) by Anon84 | view | 0 comments

Libinput 1.30 adds support for Lua plugins

(lore.freedesktop.org) by qrobit | view | 0 comments

Evaluating the Effectiveness of LLM-Evaluators (a.k.a. LLM-as-Judge)

(eugeneyan.com) by jxmorris12 | view | 0 comments

Is AI Valuation Bubble About to Burst ?

(medium.com) by MindBreaker2605 | view | 0 comments

Lilly first drugmaker to hit $1T valuation on weight-loss demand

(reuters.com) by geox | view | 0 comments

Nvim-orgmode/orgmode: Orgmode clone written in Lua for Neovim

(github.com) by edward | view | 0 comments

Show HN: SelenAI – Terminal AI pair-programmer with sandboxed Lua tools

(github.com) by moridin | view | 0 comments

Michael Burry to close hedge fund as he warns on valuations

(ft.com) by amrrs | view | 2 comments

Chart: Even Next to Nvidia, Tesla's Valuation Looks Ludicrous

(statista.com) by ZeljkoS | view | 0 comments

Gamma raises $68M to challenge PowerPoint – profitable,52 people,$2.1B valuation

(nytimes.com) by haebom | view | 0 comments

Writing an LLM from scratch, part 26 – evaluating the fine-tuned model

(gilesthomas.com) by gpjt | view | 0 comments

The S&P 500 stands at the most extreme level of valuations in history

(hussmanfunds.com) by xqcgrek2 | view | 2 comments

Paul Krugman breaks down problems with SpaceX valuation [video]

(youtube.com) by jethronethro | view | 0 comments

Former GitHub CEO raises record $60M dev tool seed round at $300M valuation

(techcrunch.com) by AnhTho_FR | view | 0 comments