Tag

Local Llm

Stories with this tag. Sections and all tags live in the Topics menu; for full-text use search.

Co-occur with these stories — for navigation and internal links.

Google's Gemma 4 Replaces Claude Pro in Homelab Setup, Ending $20/Month Subscription

Shekhar Vaidya replaces Claude Pro with Google's Gemma 4 and Tailscale, saving $240 annually through a self-hosted homelab setup.
May 29, 2026
self-hosted AI Gemma 4 Tailscale Open WebUI local LLM
Tailscale is the only home lab change I made this year that I actually noticed

A home lab enthusiast describes how Tailscale evolved from a remote-access tool into the central management layer for every device, container, and AI workload in their setup.
May 25, 2026
tailscale home lab networking overlay network local llm container management
AMD's Ryzen AI Halo brings 128 GB unified memory to compact AI workstations

AMD's Ryzen AI Halo mini PC runs local LLMs with 128 GB unified memory and a 650 TOPS NPU — priced at $3,999, with 400-series upgrades promising 192 GB RAM and 300B-model support.
May 21, 2026
AMD Ryzen AI Halo local LLM mini PC ROCm Nvidia DGX Spark
RTX 5090 vs Apple Silicon for local LLMs: the memory gap nobody expected

The RTX 5090's 32GB VRAM can't hold the biggest local LLMs, while Apple Silicon's unified memory lets a Mac Studio run DeepSeek R1 671B — at a fraction of the power draw.
May 14, 2026
local-llm apple-silicon rtx-5090 deepseek-r1 unified-memory mlx
I finally found an open-source local LLM that actually competes with cloud AI

Google DeepMind's Gemma 4 E4B open-weight model offers competitive performance for local AI tasks, with strong image and audio capabilities, challenging cloud AI dominance for privacy-focused users.
May 12, 2026
Gemma 4 local LLM Google DeepMind open-source AI
Qwen 2.5 is the local LLM that powers a smart home without cloud dependency

Author runs Qwen 2.5 locally on a NAS for smart home automation instead of relying on cloud models like Claude, citing privacy, cost, and hardware fit.
May 10, 2026
qwen 2.5 local llm smart home automation nas ai open-weight model home assistant
How I Built a Free Local LLM Pipeline on a 10-Year-Old GTX 1080 with llama.cpp

A ten-year-old GTX 1080 and a Vulkan-powered llama.cpp setup deliver 15 tokens per second on 26-billion-parameter models — proving that self-hosted local LLMs can be both free and capable.
May 10, 2026
local llm llama.cpp self-hosted AI GPU passthrough Gemma open source inference Mixture of Experts
local llms changed how i use home assistant and now my smart devices actually listen

A Home Assistant power user details how local LLMs, a smartphone voice satellite, and MCP servers replaced dedicated hardware and reshaped their smart home control workflow.
April 23, 2026
Home Assistant local LLM MCP voice assistant Ollama

Related tags