Risk Intelligence System  ยท  MGMT 571 ยท Purdue University๐Ÿ† 3rd of 37 Teams

Corporate Bankruptcy
Early Warning System

An AutoGluon ensemble ML pipeline scoring 8,000 firms on bankruptcy probability โ€” trained on 10,000 companies across 64 financial ratios. Built to answer the question every credit team and account manager needs: which firms on your list are heading toward financial distress?

0.00
ROC‑AUC Score
LightGBM ยท XGBoost ยท CatBoost
Random Forest ยท Neural Net ยท FastAI
โ†’ Weighted Stacking Ensemble
0
Firms Scored
Full test portfolio
0
Critical Risk
Probability >70%
0
Elevated Risk
Probability 50โ€“70%
0
Watch List
Probability 20โ€“50%
0.00
Model AUC
Top 10% of competition
Risk Distribution
Bankruptcy Risk Across 8,000 Firms

Each bar = a 2% probability band. The spike near zero is not a flaw โ€” it reflects reality. Bankruptcy is rare. When the signal appears it is concentrated and unmistakable.

Safe 0โ€“20% Watch 20โ€“50% Elevated 50โ€“70% Critical >70%
93.7% of firms fall below 20% risk. The long tail to the right is where the signal lives โ€” and where intervention before it is too late has the highest return on every analyst-hour spent.
Portfolio Tiers
Tier Breakdown

Four action tiers from the model output. Each requires a different response.

Interactive Threshold Simulator
Risk Cutoff Simulator โ€” How Many Firms Need Review?

Drag the slider to change the probability threshold. Numbers update live โ€” connecting model output directly to the resource allocation decision every risk team has to make. Where you set the bar determines how many firms you flag.

Flag firms scoring at or above:
5%25%50%75%95%
50%
Current threshold
Portfolio Risk Heatmap
Every Firm, Visualised โ€” 8,000 Squares, Each One a Company

Each square = one firm, colored by risk tier, ordered highest to lowest risk left-to-right, top-to-bottom. Hover any square to see that firm's exact probability. Click to open its full risk profile. The rarity of red is the point.

Critical >70% Elevated 50โ€“70% Watch 20โ€“50% Safe <20%
The visual confirms the signal: 37 critical firms (red) stand out immediately against 7,492 safe firms (green). This is what a 4.73% base rate looks like at portfolio scale โ€” and why a model with AUC 0.9231 is needed to find them.
Live Risk Screener
Portfolio Screener โ€” All 8,000 Firms

Filter by risk tier, search by Firm ID, sort by probability. Click any row for the full risk profile. Export any filtered view directly to CSV.

8,000 firms
Firm ID Risk Probability Tier Action Signal
CSV includes Firm ID, probability, tier, rank and action signal
Showing top 150 per filter ยท Click any row to open full risk profile
Model Architecture
AutoGluon Stacking Pipeline

Seven model families, two stack levels, one weighted ensemble. No single algorithm wins โ€” the combination does.

Why stacking? Bankruptcy is non-linear. LightGBM catches ratio patterns. Neural nets learn interactions. Random Forest handles outliers. The weighted ensemble captures what none can alone โ€” that is why the AUC hits 0.9231.
Three Key Findings
What This Competition Proved

Lessons from building a top-3 bankruptcy prediction model in a live class competition against 37 teams.

๐Ÿšซ Accuracy is a lie at 4.73% base rate

A model that always predicts "safe" is 95.3% accurate and completely useless. AUC is the only honest metric for rare-event classification. This is not an academic point โ€” it is the difference between a tool that works and one that misleads.

๐Ÿ“ˆ arcsinh beats log for financial ratios

Financial ratios have extreme outliers and zeros. Log transform fails on zeros. arcsinh handles the full real line โ€” compressing extremes while preserving sign. One preprocessing choice moved the needle on AUC meaningfully.

๐Ÿ† Ensembling beats human intuition

Letting AutoGluon run 7 model families for 4 hours and stack the best produced AUC 0.9231 โ€” better than any manually selected algorithm. The best analysts know when to step back and let the machine explore the space.

Where This Lives in Industry
Real World Applications of This Framework

The same predict-risk โ†’ quantify-value โ†’ act-selectively logic runs inside every major financial institution today.