Benchmarking - DEV Community

Skip to content

DEV Community

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Jul 22

SQLAlchemy ORM Security: The Raw Query Escape Hatch

#security #ai #benchmarking

5 min read

Jul 22

A Benchmark Smelled Funny

#go #performance #benchmarking #testing

9 min read

Robin

Jul 17

If 30% of Coding Tasks May Be Broken, Your Leaderboard Needs an Uncertainty Budget

#ai #testing #reliability #benchmarking

3 min read

Jul 14

Benchmarking Apple's SpeechAnalyzer API vs. Whisper: Performance, Accuracy, and Use Cases

#ai #benchmarking #apple #speechanalyzer

2 min read

Pneumetron

Jul 12

IdeaGene-Bench: A New Benchmark for Scientific Lineage Reasoning in AI

#aiml #benchmarking #scientificdiscovery #llms

4 min read

Jaydeep Shah (JD)

Jul 6

How I Benchmarked an LLM Running Entirely on a Phone (No Cloud, No API)

#edgeai #android #litertlm #benchmarking

16 min read

Jun 30

prima.cpp local llm benchmark: 15% Faster Than llama.cpp

#ai #llm #localllm #benchmarking

8 min read

Jul 8

My Code, My Test, and My Prompt All Agreed. All Three Were Wrong.

#ai #llm #machinelearning #benchmarking

10 min read

Jun 16

Building an Official Performance Baseline for Vix.cpp Core v2.6.3

#cpp #performance #benchmarking #opensource

3 min read

Jun 16

I measure how fast 42 LLMs actually answer. Here's the honest method.

#llm #ai #benchmarking #performance

2 min read

Pavel Kostromin

Jun 21

Comparing Node.js Postgres Client Libraries: brianc/node-postgres vs. porsager/postgres for Efficiency and Use Cases

#node #postgres #performance #benchmarking

10 min read

Isaiah Kim

Jul 10

I benchmarked my document-extraction API against Textract and Google DocAI — on public datasets, in public CI

#ai #ocr #benchmarking #opensource

3 min read

Jul 4

My AI memory benchmark said 98.3%. The number was true — and worthless.

#ai #mcp #opensource #benchmarking

4 min read

AI Explore

Jul 5

The Mean Is Lying to You: Benchmarks Hide the Variance That Breaks Prod

#ai #llm #evaluation #benchmarking

5 min read

Jul 2

Is AI-Generated Code Buggier? The 2025-26 Data

#benchmarking #ai #security

3 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.