Gemini API Gets a Batch Mode for High-Throughput Workloads

2025-07-11
Gemini API Gets a Batch Mode for High-Throughput Workloads

Google's Gemini API now offers a batch mode, an asynchronous endpoint ideal for high-throughput tasks where latency isn't critical. Submit large jobs, let the system handle processing, and retrieve results within 24 hours at a 50% discount compared to synchronous APIs. Perfect for pre-prepared data needing no immediate response, it offers cost savings, increased throughput, and simplified API calls. Reforged Labs uses it to process massive video ads, significantly improving efficiency and lowering costs. Get started easily with the Google GenAI Python SDK.