Info This post is auto-generated from RSS feed Hacker News. Source: Accelerating Gemma 4: faster inference with multi-token prediction drafters