UdaPlay - AI Game Research Agent Project

Project Scenario

You’ve been hired as an AI Engineer at a gaming analytics company developing an assistant called UdaPlay. Executives, analysts, and gamers want to ask natural language questions like:

“Who developed FIFA 21?”
“When was God of War Ragnarok released?”
“What platform was Pokémon Red launched on?”
“What is Rockstar Games working on right now?”

Your agent should:

Attempt to answer the question from internal knowledge (about a pre-loaded list of companies and games)
If the information is not found or confidence is low, search the web
Parse and persist the information in long-term memory
Generate a clean, structured answer/report

Project Specifications

In this project, you will build an AI Research Agent called UdaPlay designed to answer questions about video games. The agent will be capable of:

Answering user questions about games, including:
- Game titles and their details
- Release dates and platforms
- Game descriptions and genres
- Publisher information
Using a two-tier information retrieval system:
- Primary: RAG (Retrieval Augmented Generation) over a local dataset of games
- Secondary: Web search using the Tavily API when internal knowledge is insufficient
Implementing a robust evaluation system:
- Assessing the quality of retrieved information
- Determining when to fall back to web search
- Providing confidence levels in answers
Generating clear, well-structured responses that:
- Cite information sources
- Combine information from multiple sources when needed
- Present information in a natural, readable format

Project Overview

UdaPlay is an AI-powered research agent for the video game industry. This project is divided into two main parts that will help you build a sophisticated AI agent capable of answering questions about video games using both local knowledge and web searches. A short demonstration video (udaplay.mp4) is included in this repository and can be played directly below.

Project Structure

Part 1: Offline RAG (Retrieval-Augmented Generation)

In this part, you'll build a Vector Database using ChromaDB to store and retrieve video game information efficiently.

Set up ChromaDB as a persistent client
Create a collection using embedding_functions.SentenceTransformerEmbeddingFunction for embeddings
Process and index game data from JSON files
Each game document contains:
- Name
- Platform
- Genre
- Publisher
- Description
- Year of Release

Part 2: AI Agent Development

Build an intelligent agent that combines local knowledge with web search capabilities.

The agent will have the following capabilities:

Answer questions using internal knowledge (RAG)
Search the web when needed
Maintain conversation state
Return structured outputs
Store useful information for future use

Required Tools to Implement:

retrieve_game: Search the vector database for game information
evaluate_retrieval: Assess the quality of retrieved results
game_web_search: Perform web searches for additional information

Requirements

Environment Setup

Create a .env file with the following API keys:

OPENAI_API_KEY="YOUR_KEY"
TAVILY_API_KEY="YOUR_KEY"
CHROMA_URL="http://localhost:8000"

Project Dependencies

ChromaDB now uses local SentenceTransformer embeddings. OPENAI_API_KEY is still required for the agent's LLM.

Python 3.11+
ChromaDB
OpenAI
Tavily
dotenv

ChromaDB Setup with Docker Compose

A docker-compose.yml file is provided to run ChromaDB. Start it with:

docker compose up -d chromadb

The service listens on http://localhost:8000 and stores data in the chromadb directory. Set CHROMA_URL=http://localhost:8000 before running the project.

Directory Structure

project/
├── starter/
│   ├── games/           # JSON files with game data
│   ├── lib/             # Custom library implementations
│   │   ├── llm.py       # LLM abstractions
│   │   ├── messages.py  # Message handling
│   │   ├── ...
│   │   └── tooling.py   # Tool implementations
│   ├── Udaplay_01_starter_project.ipynb  # Part 1 implementation
│   └── Udaplay_02_starter_project.ipynb  # Part 2 implementation

Getting Started

Run the setup script to create a virtual environment and install all dependencies:
- On Linux/macOS: ./setup_linux.sh
- On Windows: setup_windows.bat
Activate the environment whenever needed:
- On Linux/macOS: source .venv/bin/activate
- On Windows: activate_venv.bat
Set up your .env file with necessary API keys
Follow the notebooks in order:
- Complete Part 1 to set up your vector database
- Complete Part 2 to implement the AI agent

Testing Your Implementation

After completing both parts, test your agent with questions like:

"When was Pokémon Gold and Silver released?"
"Which one was the first 3D platformer Mario game?"
"Was Mortal Kombat X released for PlayStation 5?"

Advanced Features

After completing the basic implementation, you can enhance your agent with:

Long-term memory capabilities
Additional tools and capabilities

Library Overview

The lib/ directory now contains a complete reference implementation used by the notebooks and the Streamlit demo. The main modules are:

llm.py – wrapper around the OpenAI client with tool support
messages.py – typed message classes (system, user, assistant and tool)
tooling.py – @tool decorator to expose Python functions as tools
state_machine.py – generic state machine driving agent workflows
documents.py and loaders.py – helpers for handling documents and PDFs/JSON
vector_db.py – ChromaDB convenience layer for storing and querying vectors
rag.py – Retrieval-Augmented Generation pipeline built on the state machine
game_tools.py – tools for querying games, evaluating results and web search
game_agent.py – example agent orchestrating retrieval, evaluation and search
memory.py – short-term session memory and vector-based long-term memory
evaluation.py – framework for scoring agent runs and responses
agents.py – generic agent class with conversation history and tool calls
streamlit_app.py – simple UI for asking questions and inspecting the run

These modules demonstrate how to build a production-ready research agent and can serve as templates for your own projects.

Notes

Web search functionality is still under development
Make sure to implement proper error handling
Follow best practices for API key management
Document your code thoroughly
Test your implementation with various types of queries

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UdaPlay - AI Game Research Agent Project

Project Scenario

Project Specifications

Project Overview

Project Structure

Part 1: Offline RAG (Retrieval-Augmented Generation)

Part 2: AI Agent Development

Requirements

Environment Setup

Project Dependencies

ChromaDB Setup with Docker Compose

Directory Structure

Getting Started

Testing Your Implementation

Advanced Features

Library Overview

Notes

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

UdaPlay - AI Game Research Agent Project

Project Scenario

Project Specifications

Project Overview

Project Structure

Part 1: Offline RAG (Retrieval-Augmented Generation)

Part 2: AI Agent Development

Requirements

Environment Setup

Project Dependencies

ChromaDB Setup with Docker Compose

Directory Structure

Getting Started

Testing Your Implementation

Advanced Features

Library Overview

Notes