Posts by Shrestha Basu Mallick

14 results

Clear filters
  • MAY 23, 2025 / Gemini

    Gemini API I/O updates

    Announcing new features and models for the Gemini API, with the introduction of Gemini 2.5 Flash Preview with improved reasoning and efficiency, Gemini 2.5 Pro and Flash text-to-speech supporting multiple languages and speakers, and Gemini 2.5 Flash native audio dialog for conversational AI.

    Gemini_API_metadata
  • MAY 20, 2025 / Gemini

    Building agents with Google Gemini and open source frameworks

    Google Gemini models offer several advantages when building AI agents, such as advanced reasoning, function calling, multimodality, and large context window capabilities. Open-source frameworks like LangGraph, CrewAI, LlamaIndex, and Composio can be used with Gemini for agent development.

    Building agents with Google Gemini and open source frameworks
  • APRIL 23, 2025 / Gemini

    Achieve real-time interaction: Build with the Live API

    Explore real world applications for the Live API for Gemini models, now updated to include enhanced features for real-time audio, video, and text processing, improved session management, control over interactions, and richer output options.

    gemini-live-api-meta
  • APRIL 9, 2025 / Gemini

    Gemini 2.5 Flash and Pro, Live API, and Veo 2 in the Gemini API

    Updates to the Gemini API, including the production readiness of Veo 2 for video generation, the preview of the Live API for real-time interactions, and the upcoming Gemini 2.5 Flash model, alongside the existing Gemini 2.5 Pro aim to enhance developer capabilities in building AI applications with improved thinking models, dynamic interactions, and high-quality video generation.

    Gemini-2.5-meta
  • FEB. 25, 2025 / Gemini

    Start building with Gemini 2.0 Flash and Flash-Lite

    Gemini 2.0 Flash-Lite is now generally available in the Gemini API for production use in Google AI Studio and for enterprise customers on Vertex AI. 2.0 Flash-Lite offers improved performance over 1.5 Flash across reasoning, multimodal, math and factuality benchmarks. For projects that require long context windows, 2.0 Flash-Lite is an even more cost-effective solution, with simplified pricing for prompts more than 128K tokens.

    Flash Family
  • FEB. 5, 2025 / Gemini

    Gemini 2.0: Flash, Flash-Lite and Pro

    The Gemini 2.0 model family is seeing significant updates, including the release of Gemini 2.0 Flash, which is now production-ready and boasts higher rate limits, enhanced performance, and simplified pricing. Developers can also start testing an updated experimental version of Gemini 2.0 Pro today. Additionally, a new variant called Gemini 2.0 Flash-Lite, specifically designed for large-scale workloads, will be made available next week.

    Gemini 2.0 family expands for developers: Flash, Flash Lite and Pro
  • DEC. 23, 2024 / AI

    Gemini 2.0: Level Up Your Apps with Real-Time Multimodal Interactions

    The Multimodal Live API for Gemini 2.0 enables real-time multimodal interactions between humans and computers, and can be used to build real-time virtual assistants and adaptive educational tools.

    Stream-meta
  • DEC. 11, 2024 / Gemini

    The next chapter of the Gemini era for developers

    Gemini 2.0 Flash has enhanced capabilities like multimodal outputs and native tool use, and introduces new coding agents to improve developer productivity, now available for testing in Google AI Studio.

    Gemini 2.0
  • OCT. 31, 2024 / Gemini

    Gemini API and Google AI Studio now offer Grounding with Google Search

    Grounding with Google Search for the Gemini API and Google AI Studio enhances the accuracy and freshness of Gemini's responses by leveraging Google Search data.

    GeminiGrounding_Header
  • OCT. 3, 2024 / Gemini

    Gemini 1.5 Flash-8B is now production ready

    Google is rolling out (in GA) a new Gemini model, 1.5 Flash-8B, our smallest production model with SOTA performance for its size.

    Gemini 1.5 Flash -8B Social