How to use Gemma 4 with the Gemini API and Google AI Studio

April 7, 20264 minute read

Google's Gemma 4 family of open models is now available through the Gemini API and Google AI Studio. Built from the same research behind Gemini 3, these models bring advanced reasoning, native function calling, multimodal understanding, and 256K context windows to an open, Apache 2.0-licensed package you can run anywhere.

Two models are available through the Gemini API today:

gemma-4-26b-a4b-it
gemma-4-31b-it

What makes Gemma 4 different

Gemma 4 models handle function calling, structured JSON output, and system instructions at the model level rather than through prompt engineering. The 31B dense model currently ranks as the #3 open model on the Arena AI text leaderboard, with the 26B MoE model at #6, competing with models 20x their size.

Key capabilities:

256K context window on both models
Native function calling and structured output
Multimodal text, images, and video
140+ languages trained natively
Apache 2.0 license full commercial use, no restrictions

Getting started with AI Studio

The fastest way to try Gemma 4 is Google AI Studio. Select gemma-4-26b-a4b-it or gemma-4-31b-it from the model picker, type a prompt, and start chatting. You can test system instructions, adjust temperature, and experiment with multimodal inputs all through the browser. No API key or code required.

Or click Get Code to export Python, JavaScript, or cURL snippets from any conversation.

Using Gemma 4 with the Gemini API

Install the Python SDK:

Bash

pip install google-genai

Set your API key as an environment variable. You can create one at aistudio.google.com/apikey.

Bash

export GEMINI_API_KEY="your-api-key"

Text generation

Generate text with Gemma 4:

Python

from google import genai
 
client = genai.Client()
 
response = client.models.generate_content(
    model="gemma-4-26b-a4b-it",
    contents="Compare ramen and udon in 3 bullet points: broth, noodle texture, and best season to eat."
)
print(response.text)

Pass a system instruction to set the model's behavior:

Python

from google import genai
from google.genai import types
 
client = genai.Client()
 
response = client.models.generate_content(
    model="gemma-4-31b-it",
    config=types.GenerateContentConfig(
        system_instruction="You are a wise Kyoto tea master. Speak calmly and poetically, using nature metaphors. Keep answers under 3 sentences."
    ),
    contents="What is the purpose of the tea ceremony?"
)
print(response.text)

Multi-turn conversations

The SDK provides a chat interface that tracks conversation history automatically:

Python

from google import genai
 
client = genai.Client()
chat = client.chats.create(model="gemma-4-26b-a4b-it")
 
response = chat.send_message("What are the three most famous castles in Japan?")
print(response.text)
 
response = chat.send_message("Which one should I visit in spring for cherry blossoms?")
print(response.text)

Image understanding

Pass an image alongside your text prompt:

Python

from google import genai
from google.genai import types
 
client = genai.Client()
 
with open("path/to/image.png", "rb") as f:
    image_bytes = f.read()
 
response = client.models.generate_content(
    model="gemma-4-26b-a4b-it",
    contents=[
        types.Part.from_bytes(data=image_bytes, mime_type="image/png"),
        "Describe this image in 2-3 sentences as if writing a caption for a Japanese travel magazine."
    ]
)
print(response.text)

Function calling

Define tools as function declarations. The model decides when to call them:

Python

from google import genai
from google.genai import types
 
# Define the function declaration
get_weather = {
    "name": "get_weather",
    "description": "Get current weather for a given location.",
    "parameters": {
        "type": "object",
        "properties": {
            "location": {
                "type": "string",
                "description": "City and state, e.g. 'San Francisco, CA'",
            },
        },
        "required": ["location"],
    },
}
 
client = genai.Client()
tools = types.Tool(function_declarations=[get_weather])
config = types.GenerateContentConfig(tools=[tools])
 
response = client.models.generate_content(
    model="gemma-4-26b-a4b-it",
    contents="Should I bring an umbrella to Kyoto today?",
    config=config,
)
 
# The model returns a function call instead of text
if response.candidates[0].content.parts[0].function_call:
    fc = response.candidates[0].content.parts[0].function_call
    print(f"Function: {fc.name}")
    print(f"Arguments: {fc.args}")

Google Search

Ground Gemma 4 responses in real-time web data with Google Search:

Python

from google import genai
from google.genai import types
 
client = genai.Client()
 
response = client.models.generate_content(
    model="gemma-4-26b-a4b-it",
    contents="What are the dates for cherry blossom season in Tokyo this year?",
    config=types.GenerateContentConfig(
        tools=[{"google_search":{}}]
    ),
)
 
print(response.text)
 
# Access grounding metadata for citations
for chunk in response.candidates[0].grounding_metadata.grounding_chunks:
    print(f"Source: {chunk.web.title} — {chunk.web.uri}")

Where to go next

Google AI Studio — Try Gemma 4 in the browser
Gemini API documentation
Gemma Documentation
Gemma Cookbooks

Thanks for reading! If you have any questions or feedback, please let me know on Twitter or LinkedIn.