Skip to content

Latest commit

 

History

History
38 lines (26 loc) · 945 Bytes

File metadata and controls

38 lines (26 loc) · 945 Bytes

Changelog

v0.1.2 - 2026-06-05

Outreach and reference-run update.

Added

  • Scored Codex reference audit run.
  • JSON scorecard example.
  • Public outreach log updates for benchmark-directory PRs.

v0.1.1 - 2026-06-05

Promotion and discovery update.

Added

  • GitHub Pages landing page.
  • Social card artwork.
  • Promotion kit with launch copy.
  • llms.txt, robots.txt, sitemap, and structured metadata.
  • Security policy and citation metadata.

v0.1.0 - 2026-06-05

Initial public benchmark release.

Added

  • ShopPay business rules in SPEC.md.
  • Intentionally flawed service implementation covering orders, refunds, users, pricing, wallet, and webhooks.
  • Baseline tests documenting seeded business-logic defects.
  • Benchmark guide and maintainer answer key.
  • Scoring rubric for comparing AI audit reports.
  • Example audit report format.
  • GitHub Actions test workflow.
  • Maintainer roadmap and contribution guide.