Google Gemini
Google's multimodal AI model APIs for text, image, audio, and video understanding.
APIs
Gemini API
Generate content using Google's Gemini models with text, image, audio, and video inputs.
Gemini Pro API
Advanced reasoning and complex task handling.
Gemini Pro Vision API
Multimodal understanding of text and images.
Gemini Ultra API
Most capable model for highly complex tasks.
Gemini Embedding API
Generate text embedding vectors for semantic search, classification, clustering, and retrieval tasks using the gemini-embedding-001 model.
Gemini Live API
Low-latency real-time voice and video interactions with Gemini using WebSockets for streaming multimodal input and output.
Gemini Context Caching API
Cache input tokens for repeated use across multiple requests to reduce costs and improve latency for large context workloads.
Gemini Fine-Tuning API
Customize Gemini model behavior for specific tasks using supervised fine-tuning with your own training data.
Gemini Interactions API
Unified interface for interacting with Gemini models and agents providing a consistent way to manage multi-turn conversations and tool use.
Vertex AI Gemini API
Enterprise-grade access to Gemini models through Google Cloud Vertex AI with advanced features including grounding, safety filters, and regional endpoints.
Vertex AI Imagen API
Generate and edit images using Google Imagen models on Vertex AI for high-quality image creation from text prompts.
Vertex AI Gemini Live API
Enterprise real-time multimodal streaming API on Vertex AI for building low-latency voice and video AI agents.
Vertex AI Text Embeddings API
Generate text embeddings for semantic search and classification tasks using Google embedding models on Vertex AI.
Firebase AI Logic API
Access Gemini API capabilities through Firebase SDKs for mobile and web applications with built-in security and authentication.