News
Latest
Top
Search
Submit
Login
Search
▲
124
Benchmarking leading AI agents against Google reCAPTCHA v2
(research.roundtable.ai)
by mdahardy |
view
|
97 comments
▲
53
Drawing Text Isn't Simple: Benchmarking Console vs. Graphical Rendering
(cv.co.hu)
by PaulHoule |
view
|
41 comments
▲
31
How Good Are Chinese CPUs? Benchmarking the Loongson 3A6000
(lemire.me)
by ashvardanian |
view
|
1 comments
▲
28
Benchmarking the Most Reliable Document Parsing API
(tensorlake.ai)
by calavera |
view
|
14 comments
▲
10
Benchmarking NVENC video transcoding on the Pi
(jeffgeerling.com)
by ingve |
view
|
0 comments
▲
5
Benchmarking KDB-X vs. QuestDB, ClickHouse, TimescaleDB and InfluxDB
(kx.com)
by rustc |
view
|
0 comments
▲
4
Benchmarking my Redis clone in Zig (a web dev learning systems)
(charlesfonseca.substack.com)
by barddoo |
view
|
1 comments
▲
3
CodSpeed CLI: Deterministic benchmarking for any executable
(github.com)
by art049 |
view
|
0 comments
▲
3
Benchmarking GPT-5.1 vs. Gemini 3.0 vs. Opus 4.5 across 3 Coding Tasks
(blog.kilo.ai)
by heymax054 |
view
|
0 comments
▲
3
Benchmarking LLMs at the Frontier of Physics
(artificialanalysis.ai)
by mustaphah |
view
|
0 comments
▲
3
Benchmarking Language Implementations: Am I doing it right? Get Early Feedback
(stefan-marr.de)
by speckx |
view
|
0 comments
▲
3
Powering AI at Scale: Benchmarking 1B Vectors in YugabyteDB
(yugabyte.com)
by ashvardanian |
view
|
0 comments
▲
3
Benchmarking the Cost of Java's EnumSet – A Second Look
(kinnen.de)
by birdculture |
view
|
0 comments
▲
3
Benchmarking multilingual long-context language models
(arxiv.org)
by sysoleg |
view
|
0 comments
▲
2
Reverse Benchmarking
(dominiknitsch.com)
by wseqyrku |
view
|
0 comments
▲
2
Benchmarking node collision algorithms for React/Svelte Flow
(xyflow.com)
by moklick |
view
|
0 comments
▲
2
Show HN: Benchmark-ips-Python – benchmarking tool for Python
(github.com)
by Igor_Wiwi |
view
|
0 comments
▲
2
Benchmarking Checksum Tools
(heitorpb.github.io)
by furkansahin |
view
|
0 comments
▲
2
Dell Pro Max with GB10 Arrives for Linux Performance Benchmarking Review
(phoronix.com)
by rbanffy |
view
|
0 comments
▲
2
Benchmarking the Thomson Reuters legal agent
(thomsonreuters.com)
by gk1 |
view
|
0 comments
▲
2
Benchmarking the AMD EPYC 9V64H: Azure HBv5's Custom AMD CPU with HBM3
(phoronix.com)
by ashvardanian |
view
|
0 comments
▲
1
Show HN: LOAB – benchmarking AI process fidelity in lending
(github.com)
by shubh-chat |
view
|
0 comments
▲
1
Show HN: Benchmarking the Keep memory system with LoCoMo
(keepnotes.ai)
by inguz |
view
|
0 comments
▲
1
Seeing Is Not Believing: Benchmarking AI Image Detectors
(blog.succinct.xyz)
by ncb9094 |
view
|
0 comments
▲
1
Benchmarking the best base small model for fine-tuning
(distillabs.ai)
by maciejgryka |
view
|
0 comments
▲
1
In Pursuit of High-Fidelity GPU Kernel Benchmarking
(standardkernel.com)
by matt_d |
view
|
0 comments
▲
1
I spent $100 benchmarking LLM providers on a weekend CTF
by wwdmaxwell |
view
|
0 comments
▲
1
Benchmarking 5 concurrent map implementations in Go (incl. sync.Map)
(github.com)
by puzpuzpuz-hn |
view
|
1 comments
▲
1
Vibenchmarking different JSON schema validator CLI tools
(github.com)
by whacked_new |
view
|
0 comments
▲
1
Benchmarking STT for Voice Agents – 10 Services, 1k Samples, Semantic WER
(daily.co)
by edgarsDev |
view
|
1 comments
▲
1
Benchmarking Apple Silicon unified memory for GPU-accelerated SQL analytics
(github.com)
by sadopc |
view
|
1 comments
▲
1
Benchmarking CDC Tools: Supermetal vs. Debezium vs. Flink CDC
(streamingdata.tech)
by sap1enz |
view
|
0 comments
▲
1
Benchmarking Automatic Typesetting Systems
(news.speedata.de)
by patrickg |
view
|
1 comments
▲
1
Ask HN: Is "Low Velocity" Just "High Drag"? (Benchmarking Series B)
by berkanduzgun |
view
|
0 comments
▲
1
Benchmarking Claude C Compiler
(dineshgdk.substack.com)
by dinesh_gdk |
view
|
1 comments
▲
1
Portfolio/Investment Growth Benchmarking
(finbodhi.com)
by ciju |
view
|
0 comments
▲
1
Benchmarking On-Device LLMs on iPhone and iPad Using MLX
(rickytakkar.com)
by nullnotzero |
view
|
0 comments
▲
1
Benchmarking how well LLMs can play FizzBuzz
(huggingface.co)
by _venkatasg |
view
|
1 comments
▲
1
Jsbench – AI-written scriptable HTTP benchmarking tool
(github.com)
by zhidao9 |
view
|
0 comments
▲
1
BalatroBench – Benchmarking LLMs' Strategic Performance Through Games
(balatrobench.com)
by S1M0N38-hn |
view
|
0 comments
▲
1
Show HN: Guro – Python CLI system monitoring, benchmarking and telemetry tool
(github.com)
by akadhanu |
view
|
0 comments
▲
1
Bencher – Continuous Benchmarking
(github.com)
by sea-gold |
view
|
0 comments
▲
1
Benchmarking STT providers on real calls (Deepgram 15.9% vs. OpenAI 39.8% WER)
(twitter.com)
by pstrav |
view
|
1 comments
▲
1
Benchmarking LLMs for Voice Agent Use Cases
(daily.co)
by benlower |
view
|
0 comments
▲
1
Advancing AI Benchmarking with Game Arena
(blog.google)
by salkahfi |
view
|
0 comments
▲
1
CooperBench: Benchmarking AI Agents' Cooperation
(cooperbench.com)
by embedding-shape |
view
|
0 comments
▲
1
Benchmarking with Vulkan: the curse of variable GPU clock rates
(mropert.github.io)
by ingve |
view
|
0 comments
▲
1
Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis
(arxiv.org)
by darshandesh1504 |
view
|
1 comments
▲
1
CooperBench: Benchmarking AI Agents' Cooperation
(cooperbench.com)
by SomaticPirate |
view
|
0 comments
▲
1
Benchmarking the JDBC Bottleneck in Trino
(starburst.io)
by abadid |
view
|
1 comments