Runs entirely in your browser via WebGPU. First run downloads the model (~3.3 GB) and caches it. No server, no data leaves your machine.