Corporate Bankruptcy Early Warning System

Risk Distribution

Bankruptcy Risk Across 8,000 Firms

Each bar = a 2% probability band. The spike near zero is not a flaw — it reflects reality. Bankruptcy is rare. When the signal appears it is concentrated and unmistakable.

Safe 0–20% Watch 20–50% Elevated 50–70% Critical >70%

93.7% of firms fall below 20% risk. The long tail to the right is where the signal lives — and where intervention before it is too late has the highest return on every analyst-hour spent.

Portfolio Tiers

Tier Breakdown

Four action tiers from the model output. Each requires a different response.

Interactive Threshold Simulator

Risk Cutoff Simulator — How Many Firms Need Review?

Drag the slider to change the probability threshold. Numbers update live — connecting model output directly to the resource allocation decision every risk team has to make. Where you set the bar determines how many firms you flag.

Flag firms scoring at or above:

5%25%50%75%95%

50%

Current threshold

Portfolio Risk Heatmap

Every Firm, Visualised — 8,000 Squares, Each One a Company

Each square = one firm, colored by risk tier, ordered highest to lowest risk left-to-right, top-to-bottom. Hover any square to see that firm's exact probability. Click to open its full risk profile. The rarity of red is the point.

Critical >70% Elevated 50–70% Watch 20–50% Safe <20%

The visual confirms the signal: 37 critical firms (red) stand out immediately against 7,492 safe firms (green). This is what a 4.73% base rate looks like at portfolio scale — and why a model with AUC 0.9231 is needed to find them.

Live Risk Screener

Portfolio Screener — All 8,000 Firms

Filter by risk tier, search by Firm ID, sort by probability. Click any row for the full risk profile. Export any filtered view directly to CSV.

8,000 firms

Firm ID	Risk Probability	Tier	Action Signal

CSV includes Firm ID, probability, tier, rank and action signal

Showing top 150 per filter · Click any row to open full risk profile

Model Architecture

AutoGluon Stacking Pipeline

Seven model families, two stack levels, one weighted ensemble. No single algorithm wins — the combination does.

Why stacking? Bankruptcy is non-linear. LightGBM catches ratio patterns. Neural nets learn interactions. Random Forest handles outliers. The weighted ensemble captures what none can alone — that is why the AUC hits 0.9231.

Three Key Findings

What This Competition Proved

Lessons from building a top-3 bankruptcy prediction model in a live class competition against 37 teams.

🚫 Accuracy is a lie at 4.73% base rate

A model that always predicts "safe" is 95.3% accurate and completely useless. AUC is the only honest metric for rare-event classification. This is not an academic point — it is the difference between a tool that works and one that misleads.

📈 arcsinh beats log for financial ratios

Financial ratios have extreme outliers and zeros. Log transform fails on zeros. arcsinh handles the full real line — compressing extremes while preserving sign. One preprocessing choice moved the needle on AUC meaningfully.

🏆 Ensembling beats human intuition

Letting AutoGluon run 7 model families for 4 hours and stack the best produced AUC 0.9231 — better than any manually selected algorithm. The best analysts know when to step back and let the machine explore the space.

Where This Lives in Industry

Real World Applications of This Framework

The same predict-risk → quantify-value → act-selectively logic runs inside every major financial institution today.