Topic Hub

Gemini

36 linked pages across the LLM-Docs library.

doc

Gemini CLI Guide

How to use Gemini CLI for terminal coding, automation, and MCP-connected workflows.

news

Gemini 3.1 Flash TTS: the next generation of expressive AI speech

Our newest audio model introduces granular audio tags that give you precise control to direct AI speech for expressive audio generation.

news

Gemini Robotics-ER 1.6: Powering real-world robotics tasks through enhanced embodied reasoning

Gemini Robotics ER 1.6: Enhancing spatial reasoning and multi-view understanding for autonomous robotics.

news

Gemma 4: Byte for byte, the most capable open models

Gemma 4: Our most intelligent open models to date, purpose-built for advanced reasoning and agentic workflows.

news

Gemini 3.1 Flash Live: Making audio AI more natural and reliable

Our latest voice model has improved precision and lower latency to make voice interactions more fluid, natural and precise.

news

Lyria 3 Pro: Create longer tracks in more

Introducing Lyria 3 Pro, which unlocks longer tracks with structural awareness. We’re also bringing Lyria to more Google products and surfaces.

news

Protecting people from harmful manipulation

Google DeepMind researches AI's harmful manipulation risks across areas like finance and health, leading to new safety measures.

news

Measuring progress toward AGI: A cognitive framework

We’re introducing a framework to measure progress toward AGI, and launching a Kaggle hackathon to build the relevant evaluations.

news

From games to biology and beyond: 10 years of AlphaGo’s impact

Ten years since AlphaGo, we explore how it is catalyzing scientific discovery and paving a path to AGI.

news

Gemini 3.1 Flash-Lite: Built for intelligence at scale

Gemini 3.1 Flash-Lite is our fastest and most cost-efficient Gemini 3 series model yet.

news

Nano Banana 2: Combining Pro capabilities with lightning-fast speed

Our latest image generation model offers advanced world knowledge, production ready specs, subject consistency and more, all at Flash speed.

news

Gemini 3.1 Pro: A smarter model for your most complex tasks

3.1 Pro is designed for tasks where a simple answer isn’t enough.

news

A new way to express yourself: Gemini can now create music

The Gemini app now features our most advanced music generation model Lyria 3, empowering anyone to make 30-second tracks using text or images.

news

Accelerating discovery in India through AI-powered science and education

Google DeepMind brings National Partnerships for AI initiative to India, scaling AI for science and education

news

Gemini 3 Deep Think: Advancing science, research and engineering

Our most specialized reasoning mode is now updated to solve modern science, research and engineering challenges.

news

Accelerating Mathematical and Scientific Discovery with Gemini Deep Think

Research papers point to the growing impact of Deep Think across fields

news

Project Genie: Experimenting with infinite, interactive worlds

Google AI Ultra subscribers in the U.S. can try out Project Genie, an experimental research prototype that lets you create and explore worlds.

news

D4RT: Teaching AI to see the world in four dimensions

D4RT: Unified, efficient 4D reconstruction and tracking up to 300x faster than prior methods.

news

Veo 3.1 Ingredients to Video: More consistency, creativity and control

Our latest Veo update generates lively, dynamic clips that feel natural and engaging — and supports vertical video generation.

news

Google's year in review: 8 areas with research breakthroughs in 2025

Google 2025 recap: Research breakthroughs of the year

news

Gemini 3 Flash: frontier intelligence built for speed

Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

agent

Gemini CLI

Google's open-source terminal agent for coding, shell work, web grounding, and MCP-based extensions.

model

Gemini 3.1 Flash TTS: the next generation of expressive AI speech

Our newest audio model introduces granular audio tags that give you precise control to direct AI speech for expressive audio generation.

model

Gemini Robotics-ER 1.6: Powering real-world robotics tasks through enhanced embodied reasoning

Gemini Robotics ER 1.6: Enhancing spatial reasoning and multi-view understanding for autonomous robotics.

model

Gemma 4: Byte for byte, the most capable open models

Gemma 4: Our most intelligent open models to date, purpose-built for advanced reasoning and agentic workflows.

model

Gemini 3.1 Flash Live: Making audio AI more natural and reliable

Our latest voice model has improved precision and lower latency to make voice interactions more fluid, natural and precise.

model

Gemini 3.1 Flash-Lite: Built for intelligence at scale

Gemini 3.1 Flash-Lite is our fastest and most cost-efficient Gemini 3 series model yet.

model

Nano Banana 2: Combining Pro capabilities with lightning-fast speed

Our latest image generation model offers advanced world knowledge, production ready specs, subject consistency and more, all at Flash speed.

model

Gemini 3.1 Pro: A smarter model for your most complex tasks

3.1 Pro is designed for tasks where a simple answer isn’t enough.

model

A new way to express yourself: Gemini can now create music

The Gemini app now features our most advanced music generation model Lyria 3, empowering anyone to make 30-second tracks using text or images.

model

Gemini 3 Deep Think: Advancing science, research and engineering

Our most specialized reasoning mode is now updated to solve modern science, research and engineering challenges.

model

Accelerating Mathematical and Scientific Discovery with Gemini Deep Think

Research papers point to the growing impact of Deep Think across fields

model

Veo 3.1 Ingredients to Video: More consistency, creativity and control

Our latest Veo update generates lively, dynamic clips that feel natural and engaging — and supports vertical video generation.

model

Gemini 3 Flash: frontier intelligence built for speed

Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

model

Gemma Scope 2: helping the AI safety community deepen understanding of complex language model behavior

Open interpretability tools for language models are now available across the entire Gemma 3 family with the release of Gemma Scope 2.

model

Improved Gemini audio models for powerful voice experiences

Automatic model update from Google DeepMind.