💳 Credit Risk Scorecard for Default Prediction

A FastAPI-based web application for credit default risk prediction, adapted from my Kaggle notebook
👉 Credit Risk Scorecard for Default Prediction.

The notebook covers the entire modeling pipeline, while this repository turns that work into a production-ready, Dockerized API + Web UI with CI/CD automation and Azure Cloud deployment.

🚀 Live Deployment

🌐 Production App (Azure):
🔗 https://creditscorecardwoeiv-bgc3edb0fzgzdzfg.eastasia-01.azurewebsites.net/

⚙️ Startup may take 30–60 seconds on first access (free tier cold start).

🚀 Features

Credit Score Prediction using a Logistic Regression model with Weight of Evidence (WOE) transformation
Industry-Standard Credit Ratings: Assigns S&P/Moody's/Fitch-style ratings (AAA, AA, A, BBB, BB, B, CCC, CC, C, D) based on credit score
Default Risk Assessment: Predicts default probability + standardized credit rating
Preprocessing Pipeline (feature binning, scaling, encoding)
RESTful API built with FastAPI
Interactive Swagger Docs at /docs
Responsive Web UI for non-technical users
Dockerized Deployment via Azure App Service or Docker Compose
Full CI/CD Workflow using GitHub Actions (lint → test → build → push → deploy)

📁 Project Structure

Credit Scorecard - Default Prediction/
├── app/                      # FastAPI application
│   ├── main.py               # App entrypoint
│   ├── api/                  # API layer: routes & schemas
│   │   ├── routes.py
│   │   └── schemas.py
│   ├── utils/                # Helper utilities
│   │   └── helpers.py
│   └── static/               # Web UI (HTML, CSS, JS, assets)
├── tests/                    # Unit & integration tests
├── models/                   # Exported pipeline, model, scorecard, metadata
├── Dockerfile
├── docker-compose.yml
├── requirements.txt
├── requirements-dev.txt
└── .github/
    └── workflows/
        └── deploy.yml            # CI/CD pipeline

📚 Background

This project is based on the Kaggle notebook: 👉 Credit Risk Scorecard for Default Prediction

The data preparation, WOE binning, and logistic regression model training are all explained in the notebook.

The exported preprocessing pipeline, trained model, and scorecard are taken from that notebook.

This repository focuses on wrapping the trained pipeline + model into a FastAPI application, with a frontend and Dockerized CI/CD pipeline for real-world usage.

🛠️ Installation

Local Development

git clone https://github.com/thaitri2005/Credit-Scorecard-for-Default-Prediction.git
cd credit-risk-scorecard
python -m venv venv
source venv/bin/activate   # On Windows: venv\Scripts\activate
pip install -r requirements-dev.txt
uvicorn app.main:app --reload

Docker Deployment

docker build -t credit-risk-api .
docker run -p 8000:80 credit-risk-api

or with Compose:

docker-compose up --build

or pull prebuilt image (from Docker Hub CI/CD)

docker pull thaitri2005/credit_scorecard_for_default_prediction:latest
docker run -p 8000:80 thaitri2005/credit_scorecard_for_default_prediction:latest

📚 API Documentation

Endpoints

GET / - Web UI
GET /api/v1/health - Health check
POST /api/v1/predict - Single prediction
POST /api/v1/predict/batch - Batch predictions
GET /api/v1/model/info - Model information
GET /api/v1/model/features/importance - Feature importance
GET /docs - Interactive API documentation

Example Request

curl -X POST "http://localhost:8000/api/v1/predict" \
     -H "Content-Type: application/json" \
     -d '{
       "annual_inc": 75000,
       "int_rate": 12.5,
       "credit_history_length": 5.5,
       "purpose": "debt_consolidation",
       "verification_status": "verified"
     }'

Example Response

{
  "credit_score": 650.25,
  "default_probability": 0.1234,
  "risk_level": "A",
  "log_odds": -1.9876,
  "message": "Prediction completed successfully"
}

Note: The risk_level field uses industry-standard credit ratings (AAA, AA, A, BBB, BB, B, CCC, CC, C, D) based on the calculated credit score, following S&P/Moody's/Fitch rating conventions.

📈 Scoring Methodology

This application follows industry-standard credit scoring practices:

Credit Score Calculation

The credit score is calculated using the standard scorecard transformation formula:

Score = Offset - Factor × log(odds)

Where:

Factor = PDO / ln(2) (Points to Double Odds, typically 20)
Offset = BaseScore - Factor × ln(BaseOdds)
log(odds) = ln(probability / (1 - probability))

Default Parameters:

PDO (Points to Double Odds): 20
Base Score: 600
Base Odds: 50:1 (Good:Bad ratio)

Risk Level Assignment

Risk levels are assigned using industry-standard credit ratings based on the calculated credit score:

Credit Score	Rating	Risk Level
≥ 750	AAA	Highest quality, minimal default risk
700-749	AA	Very high quality, very low default risk
650-699	A	High quality, low default risk
600-649	BBB	Good quality, moderate default risk
550-599	BB	Speculative, elevated default risk
500-549	B	Highly speculative, material default risk
450-499	CCC	Substantial credit risk
400-449	CC	Very high credit risk
350-399	C	Near default
< 350	D	Default

This rating system aligns with S&P, Moody's, and Fitch credit rating scales, providing familiar and interpretable risk assessments for financial professionals.

Calculation Flow

The prediction pipeline follows this sequence:

Model Prediction → Default probability (0-1)
Score Transformation → Credit score (300-900 range)
Rating Assignment → Standard credit rating (AAA-D)

🔄 CI/CD Workflow

The CI/CD pipeline performs the following steps:

Lint & Type Check
Run Unit Tests
Security Scan (Trivy)
Build Docker Image
Push to Azure Container Registry
Deploy to Azure Web App (App Service)

For more details, see the GitHub Actions workflow.

📊 Model Features

annual_inc: Annual income
int_rate: Interest rate
purpose: Loan purpose
verification_status: Income verification status
total_rev_hi_lim: Total revolving high limit
tot_cur_bal: Total current balance
loan_burden: Loan burden ratio
revol_util: Revolving utilization
credit_history_length: Credit history length

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests for new functionality
Run the test suite
Submit a pull request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🆘 Support

For support and questions:

Create an issue on GitHub
Check the API documentation at /docs
Review the test cases for usage examples

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

💳 Credit Risk Scorecard for Default Prediction

🚀 Live Deployment

🚀 Features

📁 Project Structure

🛠️ Installation

Local Development

Docker Deployment

📚 API Documentation

Endpoints

Example Request

Example Response

📈 Scoring Methodology

Credit Score Calculation

Risk Level Assignment

Calculation Flow

🔄 CI/CD Workflow

📊 Model Features

🤝 Contributing

📄 License

🆘 Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github/workflows		.github/workflows
app		app
data		data
models		models
notebooks		notebooks
tests		tests
.gitignore		.gitignore
.trivyignore		.trivyignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

💳 Credit Risk Scorecard for Default Prediction

🚀 Live Deployment

🚀 Features

📁 Project Structure

🛠️ Installation

Local Development

Docker Deployment

📚 API Documentation

Endpoints

Example Request

Example Response

📈 Scoring Methodology

Credit Score Calculation

Risk Level Assignment

Calculation Flow

🔄 CI/CD Workflow

📊 Model Features

🤝 Contributing

📄 License

🆘 Support

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages