往下拉回到首頁
花大錢買最新 AI 根本浪費?我們測試 18 個 LLM 做 OCR,結果便宜舊模型反而贏了,完整測試資料全部免費公開

花大錢買最新 AI 根本浪費?我們測試 18 個 LLM 做 OCR,結果便宜舊模型反而贏了,完整測試資料全部免費公開

We benchmarked 18 LLMs on OCR (7k+ calls) — cheaper/old models oftentimes win. Full dataset + framework open-sourced.

TLDR; We were overpaying for OCR, so we compared flagship models with cheaper and older models. New mini-bench + leaderboard. Free tool to test your own documents. Open Source. We’ve been looking at OCR / document extraction workflows and kept seeing the same pattern: Too many teams are either stuck in legacy OCR pipelines, or are overpaying badly for LLM calls by defaulting to the newest/ biggest model. We put together a curated set of 42 standard documents and ran every model 10 times under