News
Latest
Top
Search
Submit
Login
Search
▲
416
Study identifies weaknesses in how AI systems are evaluated
(oii.ox.ac.uk)
by pseudolus |
view
|
192 comments
▲
86
Adk-go: code-first Go toolkit for building, evaluating, and deploying AI agents
(github.com)
by maxloh |
view
|
24 comments
▲
84
Fast Lua runtime written in Rust
(astra.arkforge.net)
by akagusu |
view
|
55 comments
▲
80
Agent-o-rama: build, trace, evaluate, and monitor LLM agents in Java or Clojure
(blog.redplanetlabs.com)
by yayitswei |
view
|
5 comments
▲
54
Client-side GPU load balancing with Redis and Lua
(galileo.ai)
by lneiman |
view
|
11 comments
▲
53
Lua 5.5.0 (rc1) has been released for testing
(lua.org)
by dottrap |
view
|
2 comments
▲
46
SpaceX in Talks for Share Sale That Would Boost Valuation to $800B
(wsj.com)
by bko |
view
|
127 comments
▲
41
3.5B Accounts: Complete WhatsApp Directory Retrieved and Evaluated
(heise.de)
by therealmarv |
view
|
1 comments
▲
34
Silverbullet: Personal productivity platform built with Markdown and Lua
(github.com)
by nateb2022 |
view
|
6 comments
▲
29
Evaluating Uniform Memory Access Mode on AMD's Turin
(chipsandcheese.com)
by zdw |
view
|
3 comments
▲
25
Cursor Raises Funds at $29.3B Valuation
(bloomberg.com)
by blahgeek |
view
|
7 comments
▲
24
Luarrow – True pipeline operators and elegant Haskell-style function compositio
(github.com)
by todsacerdoti |
view
|
4 comments
▲
21
3.5B Accounts: Complete WhatsApp Directory Retrieved and Evaluated
(heise.de)
by doener |
view
|
0 comments
▲
11
Michael Burry slams Tesla valuation, warns of 'ridiculous' dilution
(electrek.co)
by breve |
view
|
4 comments
▲
11
Deep Dive into G-Eval: How LLMs Evaluate Themselves
(medium.com)
by zlatkov |
view
|
6 comments
▲
10
Retracted: Safety Evaluation and Risk Assessment of the Herbicide Roundup
(sciencedirect.com:5037)
by mindracer |
view
|
1 comments
▲
7
AgentLens: The Future of Evaluation Is Agentic
(contextual.ai)
by shikib |
view
|
2 comments
▲
7
Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult
(simonwillison.net)
by janpio |
view
|
2 comments
▲
6
Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult
(simonwillison.net)
by jonesn11 |
view
|
1 comments
▲
6
AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Language Models
(arxiv.org)
by declanjackson |
view
|
1 comments
▲
6
Port the Lua REPL to the RP2350 (Chinese)
(ruanx.net)
by uneven9434 |
view
|
0 comments
▲
6
The mind-boggling valuations of AI companies
(theguardian.com)
by devonnull |
view
|
0 comments
▲
5
Kalshi raises $1B at $11B valuation, doubling value in under two months
(techcrunch.com)
by ryan_j_naughton |
view
|
0 comments
▲
5
Kalshi Reaches $11B Valuation as App Takes over America
(businesswire.com)
by serial_dev |
view
|
7 comments
▲
5
Browserbench.ai is launched to evaluate browser runtimes for AI Agents
(browserbench.ai)
by idanraman |
view
|
3 comments
▲
5
Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult
(simonw.substack.com)
by hackthegibson2 |
view
|
1 comments
▲
5
Public IPv4 addresses are now valuable loan collateral and can be worth millions
(tomshardware.com)
by llamasushi |
view
|
0 comments
▲
5
Ramp raises at $32B valuation
(ramp.com)
by joshuawright11 |
view
|
0 comments
▲
5
RAG Chunk: CLI tool to parse, chunk, and evaluate Markdown documents for RAG
(github.com)
by handfuloflight |
view
|
0 comments
▲
5
In the AI era, Wikipedia has never been more valuable
(wikimediafoundation.org)
by speckx |
view
|
1 comments
▲
5
AI valuation fears grip global investors as tech bubble concerns grow
(cnbc.com)
by belter |
view
|
0 comments
▲
4
Retracted: Safety Evaluation, Risk Assessment of Roundup/Glyphosate for Humans
(sciencedirect.com)
by Someone |
view
|
0 comments
▲
4
Ragas: Automated Evaluation of Retrieval Augmented Generation
(arxiv.org)
by Anon84 |
view
|
0 comments
▲
4
Libinput 1.30 adds support for Lua plugins
(lore.freedesktop.org)
by qrobit |
view
|
0 comments
▲
4
Evaluating the Effectiveness of LLM-Evaluators (a.k.a. LLM-as-Judge)
(eugeneyan.com)
by jxmorris12 |
view
|
0 comments
▲
4
Is AI Valuation Bubble About to Burst ?
(medium.com)
by MindBreaker2605 |
view
|
0 comments
▲
4
Lilly first drugmaker to hit $1T valuation on weight-loss demand
(reuters.com)
by geox |
view
|
0 comments
▲
4
Nvim-orgmode/orgmode: Orgmode clone written in Lua for Neovim
(github.com)
by edward |
view
|
0 comments
▲
4
Show HN: SelenAI – Terminal AI pair-programmer with sandboxed Lua tools
(github.com)
by moridin |
view
|
0 comments
▲
4
Michael Burry to close hedge fund as he warns on valuations
(ft.com)
by amrrs |
view
|
2 comments
▲
4
Chart: Even Next to Nvidia, Tesla's Valuation Looks Ludicrous
(statista.com)
by ZeljkoS |
view
|
0 comments
▲
4
Gamma raises $68M to challenge PowerPoint – profitable,52 people,$2.1B valuation
(nytimes.com)
by haebom |
view
|
0 comments
▲
4
Writing an LLM from scratch, part 26 – evaluating the fine-tuned model
(gilesthomas.com)
by gpjt |
view
|
0 comments
▲
4
The S&P 500 stands at the most extreme level of valuations in history
(hussmanfunds.com)
by xqcgrek2 |
view
|
2 comments
▲
3
Creating C closures from Lua closures
(lowkpro.com)
by publicdebates |
view
|
0 comments
▲
3
The LLM Evaluation Guidebook
(huggingface.co)
by aratahikaru5 |
view
|
0 comments
▲
3
Evaluating Uniform Memory Access Mode on AMD's Turin Ft. Verda
(chipsandcheese.com)
by pella |
view
|
0 comments
▲
3
Revolut hits $75B valuation
(news.crunchbase.com)
by rudderdev |
view
|
3 comments
▲
3
Musk's xAI in advanced talks to raise $15B at $230B valuation
(reuters.com)
by iamtech |
view
|
0 comments
▲
3
AI Valuation Bubble Will Burst and Nobody Is Ready
(medium.com)
by MindBreaker2605 |
view
|
0 comments