Skip to content

chanzuckerberg/dataset-catalog

Scientific Dataset Catalog

Managing scientific datasets, their relationships, and metadata across research workflows can be complex and error-prone. The Scientific Dataset Catalog provides a centralized system for tracking datasets, their lineage, collections, and rich metadata throughout the research lifecycle.

This system helps you organize datasets into collections, track how datasets relate to each other (lineage relationships), store rich metadata, and provides a Python client for programmatic access to all these capabilities.

Getting Started

🐍 Using the Python Client

Want to integrate dataset catalog functionality into your Python workflows? → Quick Start | Full Documentation

📋 Understanding the Data Schema

Want to understand dataset metadata structure and relationships? → Schema Documentation

🤖 Claude Code Plugin

Prefer to work in Claude Code? Install the catalog plugin. → Installation

🔧 Contributing

Want to contribute to the codebase? → Development Guide

Claude Code Plugin

This repo ships a Claude Code plugin, catalog, distributed through the dataset-catalog marketplace defined in .claude-plugin/marketplace.json. Install it from inside a Claude Code session:

/plugin marketplace add chanzuckerberg/dataset-catalog
/plugin install catalog@dataset-catalog

Quick Start

Ready to start using the Python client? The fastest way to get up and running:

Installation & Quick Start Guide

This will walk you through installation, getting an API token, and your first few API calls.

Documentation & Resources

📚 Complete Documentation

🔗 Related Projects

  • Dataset Catalog API - The backend service this client connects to
  • Schema Documentation - Detailed data models and relationships

🤝 Contributing

Code of Conduct

This project adheres to the Contributor Covenant code of conduct. By participating, you are expected to uphold this code. Please report unacceptable behavior to opensource@chanzuckerberg.com.

About

Tooling and utilities to work with the dataset-catatlog

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors