AlignGPT

AlignGPT is a research-grade alignment platform for building, evaluating, and operating language-model systems with preference learning, retrieval, safety gates, benchmark automation, and web-facing product surfaces.

The repository started as a compact RLHF prototype. It now keeps that useful training logic while organizing the project like a maintainable AI platform: research work, backend services, frontend surfaces, deployment assets, observability, security, evaluation, benchmarks, SDKs, and developer tooling all have explicit ownership.

System Goals

Provide a clear RLHF, DPO, reward-modeling, and evaluation path for alignment research.
Expose alignment workflows through API, CLI, SDK, and web dashboard entry points.
Make benchmark runs reproducible with tracked configs, manifests, metrics, and reports.
Keep production concerns visible from day one: auth boundaries, rate limiting, secret hygiene, observability, and deployment topology.
Preserve optional heavy ML dependencies behind lazy imports so the core package remains import-safe in CI and lightweight development environments.

Architecture At A Glance

flowchart LR
    UI["Frontend Dashboard"] --> API["Backend API Gateway"]
    SDK["Python/TS SDKs"] --> API
    API --> Safety["Safety + Policy Gates"]
    API --> Router["Inference Router"]
    Router --> LLM["Policy / Reward / Embedding Models"]
    Router --> RAG["Retrieval + Vector Store"]
    API --> Eval["Evaluation + Benchmarking"]
    Eval --> Registry["Experiment Registry"]
    Registry --> Observability["Metrics, Logs, Traces"]

See ARCHITECTURE.md for service boundaries, request lifecycle, storage layers, and deployment topology. Detailed diagrams live in docs/diagrams, including service topology, request lifecycle, RAG flow, inference routing, safety pipeline, and agent orchestration.

Repository Map

research/: alignment methods, reward modeling, ablations, benchmark design, and papers.
datasets/: ingestion, preprocessing, vectorization, synthetic data, quality control, and registry schemas.
models/: model families, adapters, reward models, embeddings, quantization, distillation, and model evaluation plans.
agent_system/: planners, memory, tool use, orchestration, multi-agent workflows, autonomous reasoning, and safety constraints.
backend/: API gateway, auth, rate limiting, retrieval, caching, inference routing, queues, feature store, and database contracts.
frontend/: web dashboard, admin panel, experiment visualizer, benchmark viewer, analytics, and mobile placeholder.
deployment/: Docker, Kubernetes, Helm, Terraform, nginx, Cloudflare, GPU cluster, and autoscaling assets.
infrastructure/: observability, monitoring, distributed compute, GPU optimization, vLLM, Ray, Kafka, and Redis.
benchmarking/: hallucination, latency, throughput, robustness, bias, adversarial, and reproducibility suites.
security/: prompt-injection, jailbreak, PII redaction, access-control, and threat-modeling assets.
evaluation/: human feedback, reward scoring, ranking, scientific metrics, and automated evaluation.
pipelines/: training, inference, RAG, indexing, retraining, deployment, and scheduled jobs.
sdk/: Python client, TypeScript client, CLI adapter, and examples.
cli/: operator-oriented entry points for training, deployment, and evaluation.
src/aligngpt: lightweight core package used by tests, backend scaffolds, SDKs, and future services.

Developer Quickstart

python -m venv .venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate
python -m pip install -U pip
python -m pip install -e ".[dev]"
pytest

Optional ML extras are isolated:

python -m pip install -e ".[ml]"

The core package intentionally avoids importing torch, transformers, or hosted model clients during normal import. Training and generation code load those dependencies only inside task-specific modules.

API Smoke Run

The backend scaffold exposes a FastAPI app when optional API dependencies are installed:

python -m pip install -e ".[api]"
uvicorn backend.api_gateway.app:app --reload --port 8000

OpenAPI contract notes live in docs/api/openapi.yaml.

Key endpoints:

GET /health, GET /ready, GET /metrics
POST /v1/align
POST /v1/evaluate
GET /v1/status
GET /v1/events

Web App Scaffold

The frontend surface is organized under frontend/web/nextjs_app. It includes a public landing page and internal dashboard routes for experiment status, benchmark health, safety findings, deployment readiness, model routing, and evaluation traces.

Operational Workflows

Alignment evaluation: docs/tutorials/operational_workflows.md
Benchmark bundle: benchmarking/results/2026-05-20
Production deployment path: docs/deployment/production_runbook.md

Hard Systems Feature

AlignGPT includes a GPU-aware inference router in src/aligngpt/router.py. It selects backends by capability, context window, estimated GPU memory, batch size, quantization, health, latency budget, cost, and fallback policy. The router is integrated into the API service and the alignment evaluation pipeline.

Existing RLHF Logic

The original prototype logic remains available under src/data, src/models, src/training, src/eval, and src/utils. These modules include prompt building, preference-pair generation, reward modeling, SFT, PPO, DPO, checkpointing, and metric utilities. The new platform skeleton wraps that work in a larger product and research structure instead of discarding it.

Testing

Fast tests live under tests/unit and tests/integration. They validate configuration loading, safety policy behavior, service contracts, and deterministic benchmark metrics without network downloads. Heavier ML tests should be added behind explicit markers once model artifacts and CI runners are available.

Screenshots

Product screenshots will be captured from the dashboard once the Next.js app reaches an interactive milestone. Until then, the intended screen inventory is documented in frontend/web/dashboard/README.md and docs/diagrams/system_context.mmd.

License

Released under the MIT License. See LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AlignGPT

System Goals

Architecture At A Glance

Repository Map

Developer Quickstart

API Smoke Run

Web App Scaffold

Operational Workflows

Hard Systems Feature

Existing RLHF Logic

Testing

Screenshots

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 190 Commits
.github		.github
agent_system		agent_system
backend		backend
benchmarking		benchmarking
cli		cli
configs		configs
data		data
datasets		datasets
deployment		deployment
docs		docs
evaluation		evaluation
frontend		frontend
infrastructure		infrastructure
models		models
notebooks		notebooks
pipelines		pipelines
research		research
script		script
sdk		sdk
security		security
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MODEL_CARD.md		MODEL_CARD.md
Makefile		Makefile
README.md		README.md
RESEARCH.md		RESEARCH.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml
railway.toml		railway.toml
render.yaml		render.yaml
requirements.txt		requirements.txt
vercel.json		vercel.json

Folders and files

Latest commit

History

Repository files navigation

AlignGPT

System Goals

Architecture At A Glance

Repository Map

Developer Quickstart

API Smoke Run

Web App Scaffold

Operational Workflows

Hard Systems Feature

Existing RLHF Logic

Testing

Screenshots

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages