opinionsreddit2026年4月20日下午09:01

我在 MacBook Air M5 上對 21 個本地大語言模型進行了程式碼品質和速度的基準測試

I benchmarked 21 local LLMs on a MacBook Air M5 for code quality AND speed

There are plenty of "bro trust me, this model is better for coding" discussions out there. I wanted to replace the vibes with actual data: which model writes correct code and how fast does it run on real hardware, tested under identical conditions so the results are directly comparable. No cherry-picked prompts, no subjective impressions, just pass@1 on 164 coding problems with an expanded test suite. Hardware: MacBook Air M5, 32 GB unified memory Quantization: Q4_K_M for all models via llam

閱讀原文 →

相關報導

OpenAI 直播活動

ChatGPT 圖像 2.0 來了，生成圖片的能力大升級

「再等六個月就會變好」的說法只撐過一輪就破功了

Mistral Medium 3.5 在 AMD Strix Halo 上跑起來超慢，準備好熬夜吧