Credit Risk Modelling is the process of estimating the probability someone will pay back a loan - super exciting mathematical problem!
There are 5 fundamental ML systems for credit scoring:
Formally, credit risk modelling is the use of data, past repayment behaviour, demographics, income, device usage, transaction patterns, and more to estimate the probability that a borrower will default. These probabilities become the foundation of lending decisions: who gets approved, at what limit, and at what price.
Lending money isn’t just about goodwill it’s an investment. When lenders charge interest, they’re being compensated for three things:
If you lend $10 at a 20% annual rate, you expect $12 in return. But that expectation assumes full repayment. If some borrowers don’t repay, your realised return drops sharply.
Let’s say you lend to 10,000 people and estimate that 15% will default. You’ll lose the $10 you lent to those 1,500 people, a total of $15,000. To break even, the interest from the remaining 8,500 borrowers must cover that loss roughly an additional $1.76 each, or 17.6 percentage points more in interest.
<aside> 🧮
Let’s formalise that calculation
You lose L X P X N = 10 x 0.15 x 10,000 = $15,000.
That loss must be recovered from the non-defaulters, who are N(1-p) = 8,500 borrowers.
Each of those must contribute an extra charge x to cover the total loss:
x = (L x P) / (1 - P)
</aside>
So, if you initially charged 20%, you’d need to raise it to around 38% just to offset the expected losses. That’s unattractive for borrowers and risky for you. This is why interest calibration is such a delicate balancing act.