devops

Use Your Mac for AI Agents: Self-Host Gemma 4 12 B with Pulumi and Tailscale

Published: June 4, 2026 Source: pulumi.com 1 min read

You are reading a summary. The full content is hosted on pulumi.com.

Open-weight models now run well on consumer hardware, allowing data to stay local and inference to work offline. The Gemma 4 12 B model can be run on a modern Mac, providing high-quality language processing capabilities. This setup uses llama.cpp for host-native inference, k3d for a local Kubernetes cluster, Pulumi for infrastructure as code, and Tailscale for secure access.

Read the full article on the original website

External link to pulumi.com