FastAPI tutorial
In this tutorial, you’ll build Bargain Chef, a standalone Genkit backend on FastAPI that exposes a recipe-generating flow over HTTP. It uses two AI patterns Genkit simplifies: streaming structured output and tool calling.
What you’ll build
Section titled “What you’ll build”For each request, your server prompts Gemini to draft a recipe, and the model calls a tool to look up mock grocery sale prices so it can prefer on-sale ingredients. The server streams the recipe back field-by-field as it’s generated, so clients see progress before the full recipe is ready.
You can find the finished code on GitHub.
Prerequisites
Section titled “Prerequisites”- Python 3.10 or later
- uv package manager
This tutorial assumes you’re already familiar with building FastAPI applications.
Set up the application
Section titled “Set up the application”Create the FastAPI project
Section titled “Create the FastAPI project”mkdir bargain-chefcd bargain-chefuv init --no-readme --python 3.10Install packages
Section titled “Install packages”Install the Genkit CLI:
curl -sL cli.genkit.dev | bashThen install the packages you need in your project:
uv add fastapi uvicorn genkit genkit-plugin-google-genai genkit-plugin-fastapiThese packages include:
fastapi: The async Python web framework that serves your endpoint.uvicorn: The ASGI server that runs your FastAPI app.genkit: Core Genkit SDK.genkit-plugin-google-genai: Plugin that connects Genkit to Google’s Gemini models.genkit-plugin-fastapi: Serves Genkit flows as FastAPI endpoints, including streaming.
Configure a model API key
Section titled “Configure a model API key”This tutorial uses the Gemini API from Google AI Studio. Get a key from Google AI Studio, then set the GEMINI_API_KEY environment variable to your key:
export GEMINI_API_KEY=<your API key>Optional: install the Genkit agent skills
Section titled “Optional: install the Genkit agent skills”If you’re coding with an AI assistant, install the Genkit Agent Skills so it has structured guidance on Genkit APIs, patterns, and common errors:
npx skills add genkit-ai/skillsSee Develop with AI for tool-specific installation instructions.
Create the backend
Section titled “Create the backend”The backend handles requests from clients. For each request, it prompts Gemini to draft a recipe, lets the model call a tool to look up today’s grocery sale prices, and streams the partial recipe back to the caller as it’s generated.
The whole pipeline is a single Genkit flow. A flow is a special Genkit function with built-in observability, type safety, and tooling integration.
You’ll build the backend in four parts:
- Initialize Genkit and register Gemini as the model provider.
- Define a tool the model can call to fetch sale prices.
- Describe the recipe shape with Pydantic so Genkit can validate the final output and stream partial recipe chunks.
- Define the flow that ties everything together.
Replace the contents of main.py with the following:
from datetime import datetimefrom typing import Literal
from fastapi import FastAPIfrom fastapi.middleware.cors import CORSMiddlewarefrom pydantic import BaseModel, Field
from genkit import ActionRunContext, Genkitfrom genkit.plugins.fastapi import genkit_fastapi_handlerfrom genkit.plugins.google_genai import GoogleAI
ai = Genkit( plugins=[GoogleAI()], model='googleai/gemini-flash-latest',)
class SaleItem(BaseModel): name: str price: str
class GetIngredientsInput(BaseModel): day_type: Literal['weekday', 'weekend'] = Field( description='Whether to fetch weekday or weekend sale prices.', )
@ai.tool( name='get_ingredients_on_sale', description=( 'Returns the ingredients on sale at the local grocery store, with prices. ' 'The sale set differs between weekdays and weekends.' ),)async def get_ingredients_on_sale(input: GetIngredientsInput) -> list[SaleItem]: # Mock data: in a real app, query a pricing database. if input.day_type == 'weekend': return [ SaleItem(name='chicken breast', price='$2.99/lb'), SaleItem(name='pasta', price='$0.79'), SaleItem(name='canned tomatoes', price='$0.99'), SaleItem(name='garlic', price='$0.50 / head'), SaleItem(name='olive oil', price='$6.99'), ] return [ SaleItem(name='eggs', price='$3.49 / dozen'), SaleItem(name='spinach', price='$1.99'), SaleItem(name='parmesan', price='$4.99'), SaleItem(name='lemons', price='$0.50 each'), SaleItem(name='rice', price='$2.49'), SaleItem(name='butter', price='$3.99'), ]
class RecipeIngredient(BaseModel): name: str quantity: str on_sale: bool
class Recipe(BaseModel): title: str description: str servings: int ingredients: list[RecipeIngredient] steps: list[str]
class BargainChefInput(BaseModel): craving: str = Field(description='What the user feels like eating right now.')
@ai.flow(name='bargainChefFlow', chunk_type=Recipe)async def bargain_chef_flow(input: BargainChefInput, ctx: ActionRunContext) -> Recipe: today = datetime.now().strftime('%A')
stream_response = ai.generate_stream( prompt=( f'Today is {today}. The user is craving: {input.craving}.\n\n' 'Call the get_ingredients_on_sale tool with the day_type that matches today. ' 'Saturday and Sunday are weekends; all other days are weekdays. ' 'Then propose ONE recipe that takes advantage of those deals. For each ' "ingredient, set on_sale=true if it appears in the tool's response, " 'false otherwise.' ), tools=[get_ingredients_on_sale], output_schema=Recipe, config={'temperature': 0.7, 'thinkingConfig': {'thinkingLevel': 'MINIMAL'}}, )
async for chunk in stream_response.stream: if chunk.output: ctx.send_chunk(chunk.output)
response = await stream_response.response if not response.output: raise ValueError('Failed to generate recipe') return response.output
app = FastAPI()
# allow_origins=['*'] lets any browser frontend call the endpoint during# development. Before deploying, restrict it to the origins you actually serve.app.add_middleware( CORSMiddleware, allow_origins=['*'], allow_methods=['*'], allow_headers=['*'],)
@app.post('/bargainChefFlow', response_model=None)@genkit_fastapi_handler(ai)async def bargain_chef_endpoint(): return bargain_chef_flowA few details are worth noting before you run the backend:
- Final output and streamed chunks:
Recipeis the complete recipe the flow returns at the end, passed asoutput_schemaso Genkit validates the model’s output against it. Declaringchunk_type=Recipeon the flow tells Genkit that streamed chunks share the same shape, with fields filling in progressively as the model generates them. - Shared Python types: the Pydantic models (
Recipe,RecipeIngredient,BargainChefInput) define the request and response shapes once, so the flow, the HTTP handler, and any client share a single source of truth. - The
get_ingredients_on_saletool: the model decides when to call it based on the prompt, and the typedGetIngredientsInputforces the model to passday_type='weekday'or'weekend'. In a real app, the tool would query a pricing database, inventory system, or third-party API. ctx.send_chunk: each call pushes the latest partial recipe to the client, giving it a typed view of the generated JSON as it grows. After the stream completes, the flow awaitsstream_response.responseso the HTTP request still resolves with a validated recipe.
Expose the flow over HTTP
Section titled “Expose the flow over HTTP”The @genkit_fastapi_handler(ai) decorator wraps the flow as a FastAPI route. The handler inspects the incoming request: when the client sends Accept: text/event-stream, it streams partial chunks as Server-Sent Events; otherwise it returns the final recipe as JSON.
Check the project layout
Section titled “Check the project layout”Verify that your project layout matches the structure below:
- pyproject.toml
- main.py
Run the app
Section titled “Run the app”Start the FastAPI server:
uv run uvicorn main:app --reloadThis launches the FastAPI app at http://localhost:8000. With the server running, send a streaming request from another terminal:
curl -N -X POST http://localhost:8000/bargainChefFlow \ -H "Content-Type: application/json" \ -H "Accept: text/event-stream" \ -d '{"data":{"craving":"something warm with chicken"}}'The { "data": ... } wrapper is required: Genkit’s HTTP handler reads the flow input from the request body’s data field.
The response arrives as a series of data: events. Each event contains the partial recipe accumulated so far: title first, then description, then ingredients (with on_sale flags on the ones the model picked from the tool), then steps.
Test and inspect the app
Section titled “Test and inspect the app”You can test the endpoint directly with curl, and you can use the Developer UI to inspect both manual runs and live requests.
Send a request with curl
Section titled “Send a request with curl”For a non-streaming response, drop the Accept: text/event-stream header:
curl -X POST http://localhost:8000/bargainChefFlow \ -H "Content-Type: application/json" \ -d '{"data":{"craving":"something warm with chicken"}}'You’ll receive the final structured recipe as a single JSON response.
Use the Developer UI
Section titled “Use the Developer UI”The Developer UI is Genkit’s local console for testing flows and inspecting execution traces. It runs alongside your backend code, gives you a visual runner for any flow in your project, and records every tool call and model invocation so you can iterate on prompts and debug tool behavior.
-
Start the Developer UI from your project root:
Terminal window genkit start -- uv run uvicorn main:app --reloadThis launches the Developer UI at
http://localhost:4000by default. -
Select
bargainChefFlowfrom the list of flows. -
Enter sample input:
{ "craving": "something warm with chicken" } -
Click Run.
You’ll see the generated recipe, with a trace that builds in real time so you can follow the flow’s progress through each tool call and model invocation.
What you built
Section titled “What you built”You now have a standalone Genkit backend on FastAPI that streams structured output from Gemini over HTTP, calls a tool during generation to ground the model’s response in mock sale-price data, validates input and output against schemas, and surfaces every step in a local trace UI.