Short · YouTube Shorts ·
Gemma 4 AI runs 3x faster with no loss in accuracy
Google says its Gemma 4 AI models can run up to 3x faster using a new speculative decoding technique, with no reported loss in accuracy. The change is aimed at improving inference speed.