Models · Hacker News ·
Gemma 4 QAT models: Optimizing compression for mobile and laptop efficiency
Google introduced Gemma 4 quantization-aware training models designed to improve compression and efficiency on mobile and laptop devices. The post describes using QAT to reduce model size while preserving performance for local deployment.