Local LLM gets dumber over time on RTX 5090 due to context window and memory issues
Running Qwen 3.6 27B on an RTX 5090 reveals how context window management and memory pressure can make local LLMs appear to degrade over time, with simple restarts restoring performance.