Google's Gemma 4 isn't the smartest local LLM I've run, but it's the one I reach for most
Google's Gemma 4 models offer a rare balance of speed and quality for local AI, with a 26B MoE variant that activates just 3.8B parameters per token and edge models capable of offline voice and vision tasks.