I ran local llms on intel's cheapest igpu, and the results were surprisingly decent
Running 4‑7B parameter LLMs on an Intel N100‑based LattePanda Mu using llama.cpp and Vulkan delivers ~2.9 tokens/s, offering a cheap edge alternative to dedicated GPUs.