Analyst Co-Pilot: Revolutionizing A/B Test Execution (powered by Human Intelligence & Antigravity AI)
This repository showcases an elite-level analytical project completed at an unprecedented speed through a seamless symbiosis of human product intuition and AI execution. It explores an onboarding optimization feature in a chat application.
New users receive 20 bonus credits. Once their balance drops to 0, they encounter a hard paywall. We tested adding a "pinned chat reminder" when a user's balance drops to
Final Business Impact:
- Conversion Rate Lift: +5.65% increase in CR to the first payment.
-
Statistical Significance:
$p = 0.0123$ (statistically significant at$\alpha = 0.05$ ). - Recommendation: 100% Rollout to the target audience.
- Candidate Profile: Artem Liakh
- Original Task Requirements: Notion Task Definition
├── .github/
│ └── assets/
│ ├── collab_banner.png
│ └── collab_banner_visual_plot.png
├── data/
│ ├── ab_test_data.csv
│ └── ab_test_historical_data.csv
├── notebooks/
│ ├── ab_test_analysis.ipynb
│ └── presentation_v2.html
├── AI_COLLABORATION.md
├── ANALYST_CO_PILOT_REPORT.md
├── LICENSE
└── README.md
This project represents a paradigm shift in data science workflows. By delegating boilerplate code, data wrangling, and statistical computation to the Antigravity AI Agent, the human Lead Product Scientist was freed up to focus entirely on high-level strategy:
- Defining the precise business hypotheses.
- Selecting the rigorous Intent-to-Treat (ITT) methodology.
- Translating pure statistical outputs into actionable, executive-level business recommendations.
This collaborative approach resulted in a 70% reduction in delivery time with zero compromise on analytical rigor. Read the deep-dive on our methodology and collaboration dynamics in AI_COLLABORATION.md.
-
Target Audience (Intent-to-Treat): Filtered historical and test data to only include users who genuinely reached a balance of
$\le 5$ credits. - Power Analysis: Analyzed pre-pilot historical data to establish a baseline CR (~14.38%). Determined the required sample size (approx. 9,727 users per group) to detect a Minimum Detectable Effect (MDE) of 5% with 80% power.
- Primary Metric Analysis: Applied a Z-test for Proportions to compare the CR between the Control and Test groups, calculating 95% Confidence Intervals (Wilson score).
- Behavioral Analysis: Conducted a Mann-Whitney U test to analyze the Time-to-Payment distribution (see collab_banner_visual_plot.png), demonstrating that the pinned reminder successfully converted a highly hesitant segment of users.
- Language: Python 3.x
- Data Manipulation: Pandas, NumPy
- Statistics: SciPy, Statsmodels
- Data Visualization: Seaborn, Matplotlib
