Fuck Yes! Serverless GPU for everyone! The Cloud Run just shipped Serverless GPU with no quota request required. 🤯 Deploy @GoogleDeepMind Gemma with a single command!
- Pay-per-second GPU billing.
- Scale to zero instances.
- TTFT of 19 seconds for a Gemma3 as cold start.
- No r L4 GPUs. (3 per default).
