往下拉回到首頁
在 MacBook Air M5 跑 21 個 AI 模型寫程式碼,結果出乎意料——有些快到爆炸但寫得爛,有些寫得超好卻慢到不行

在 MacBook Air M5 跑 21 個 AI 模型寫程式碼,結果出乎意料——有些快到爆炸但寫得爛,有些寫得超好卻慢到不行

I benchmarked 21 local LLMs on a MacBook Air M5 for code quality AND speed

There are plenty of "bro trust me, this model is better for coding" discussions out there. I wanted to replace the vibes with actual data: which model writes correct code and how fast does it run on real hardware, tested under identical conditions so the results are directly comparable. No cherry-picked prompts, no subjective impressions, just pass@1 on 164 coding problems with an expanded test suite. Hardware: MacBook Air M5, 32 GB unified memory Quantization: Q4_K_M for all models via llam