Ollama's new MLX engine doubles local LLM speed on MacBook Air M5
Ollama’s new MLX engine dramatically speeds up local LLM inference on MacBook Air M5 machines, cuts quality loss with NVFP4 quantization, and introduces a snapshot system that benefits coding assistants.