Search | News by Netwrck

Benchmarking leading AI agents against Google reCAPTCHA v2

(research.roundtable.ai) by mdahardy | view | 97 comments

Drawing Text Isn't Simple: Benchmarking Console vs. Graphical Rendering

(cv.co.hu) by PaulHoule | view | 41 comments

How Good Are Chinese CPUs? Benchmarking the Loongson 3A6000

(lemire.me) by ashvardanian | view | 1 comments

Benchmarking the Most Reliable Document Parsing API

(tensorlake.ai) by calavera | view | 14 comments

Benchmarking NVENC video transcoding on the Pi

(jeffgeerling.com) by ingve | view | 0 comments

Benchmarking KDB-X vs. QuestDB, ClickHouse, TimescaleDB and InfluxDB

(kx.com) by rustc | view | 0 comments

Benchmarking my Redis clone in Zig (a web dev learning systems)

(charlesfonseca.substack.com) by barddoo | view | 1 comments

CodSpeed CLI: Deterministic benchmarking for any executable

(github.com) by art049 | view | 0 comments

Benchmarking GPT-5.1 vs. Gemini 3.0 vs. Opus 4.5 across 3 Coding Tasks

(blog.kilo.ai) by heymax054 | view | 0 comments

Benchmarking LLMs at the Frontier of Physics

(artificialanalysis.ai) by mustaphah | view | 0 comments

Benchmarking Language Implementations: Am I doing it right? Get Early Feedback

(stefan-marr.de) by speckx | view | 0 comments

Powering AI at Scale: Benchmarking 1B Vectors in YugabyteDB

(yugabyte.com) by ashvardanian | view | 0 comments

Benchmarking the Cost of Java's EnumSet – A Second Look

(kinnen.de) by birdculture | view | 0 comments

Benchmarking multilingual long-context language models

(arxiv.org) by sysoleg | view | 0 comments

FlowerBench: Benchmarking AI Agents on Real Enterprise Work

(flower.ai) by dimitrisflwr | view | 1 comments

Giving a domain a hill to climb: benchmarking as data activation

(sparsethought.com) by galsapir | view | 0 comments

Ask HN: Is there a recognized standard for swarm intelligence benchmarking?

by stephanieriggs | view | 0 comments

Reverse Benchmarking

(dominiknitsch.com) by wseqyrku | view | 0 comments

Benchmarking node collision algorithms for React/Svelte Flow

(xyflow.com) by moklick | view | 0 comments

Show HN: Benchmark-ips-Python – benchmarking tool for Python

(github.com) by Igor_Wiwi | view | 0 comments

Benchmarking Checksum Tools

(heitorpb.github.io) by furkansahin | view | 0 comments

Dell Pro Max with GB10 Arrives for Linux Performance Benchmarking Review

(phoronix.com) by rbanffy | view | 0 comments

Benchmarking the Thomson Reuters legal agent

(thomsonreuters.com) by gk1 | view | 0 comments

Benchmarking the AMD EPYC 9V64H: Azure HBv5's Custom AMD CPU with HBM3

(phoronix.com) by ashvardanian | view | 0 comments

Benchmarking Qwen 3.6 35B MoE (3B active) on an RTX 3090

(gilesthomas.com) by gpjt | view | 0 comments

Ask HN: Is anyone giving out tokens for benchmarking LLMs?

by emosenkis | view | 0 comments

Benchmarking Slop, Introducing Slop Index (A Fun Slashy Research Project)

(twitter.com) by hgaddipa001 | view | 0 comments

Agent Arena: Benchmarking AI Agent Devtool Onboarding

(2027.dev) by karlmush | view | 1 comments

Wolfram LLM Benchmarking Project

(wolfram.com) by rzk | view | 0 comments

Keynote: Benchmarking – It's About Time – Matt Godbolt – C++Now 2026 [video]

(youtube.com) by SleepyMyroslav | view | 0 comments

Benchmarking gRPC Load Balancing on K8s in 2026: Linkerd vs. Istio vs. Cilium

(buoyant.io) by darksoul | view | 0 comments

Benchmarking 15 "E-Waste" GPUs with Modern Workloads

(esologic.com) by eso_logic | view | 0 comments

Benchmarking Cloudflare Containers vs. AWS MicroVMs

(v2.alchemy.run) by mariuz | view | 0 comments

Benchmarking quantum advantage: Quantum Advantage Tracker

(quantum-advantage-tracker.github.io) by Alien1Being | view | 0 comments

Benchmarking Coding Agents on Databricks' Multi-Million Line Codebase

(databricks.com) by tanelpoder | view | 0 comments

Show HN: VetoBench – benchmarking AI memory beyond retrieval

(github.com) by mart1adelina | view | 0 comments

Show HN: Token-saviour – agent skill from benchmarking 9 token-saving tools

(github.com) by vagkaratzas | view | 0 comments

Not All Miles Are Equal: Benchmarking Autonomous Safety

(waymo.com) by xnx | view | 0 comments

From Words to Watts: Benchmarking the Energy Costs of LLM Inference (2023)

(arxiv.org) by teleforce | view | 0 comments

Same Query, Three Results: Benchmarking ParadeDB and Postgres FTS

(paradedb.com) by jamesgresql | view | 0 comments

SlopCodeBench: Benchmarking How Coding Agents Degrade over Long, Iterative Tasks

(arxiv.org) by softwaredoug | view | 0 comments

Benchmarking Hardwood 1.0 on a Threadripper 9980X

(jack-vanlightly.com) by rmoff | view | 0 comments

Shard your locks: benchmarking 6 Go cache designs – Beyond the Happy Path

(strebkov.dev) by atkrad | view | 0 comments

PCB-Bench: Benchmarking LLMs for PCB Placement and Routing (ICLR 2026)

(github.com) by teleforce | view | 0 comments

Benchmarking real-time voice translation

(startpinch.com) by christiansafka | view | 0 comments

Shard your locks: benchmarking 6 Golang cache designs

(strebkov.dev) by fanf2 | view | 0 comments

Tuning a Server for Benchmarking

(david.alvarezrosa.com) by dalvrosa | view | 0 comments

Benchmarking AI Gateways: GoModel vs. LiteLLM vs. Portkey vs. Bifrost

(enterpilot.io) by santiago-pl | view | 1 comments

GLM-5.2 (Max) API Provider Benchmarking and Analysis

(artificialanalysis.ai) by codycharris | view | 0 comments

Benchmarking four open-source geo-experiment tools against known ground truth

(research.getrecast.com) by mrsoli | view | 0 comments