Modern Australian
The Times

Robodebt not only broke the laws of the land – it also broke laws of mathematics

  • Written by Noel Cressie, Distinguished Professor of Statistics, University of Wollongong
Robodebt not only broke the laws of the land – it also broke laws of mathematics

Friday marked the end of the public hearings for the Royal Commission into the Robodebt Scheme. They painted a picture of a catastrophic program that was legally and ethically indefensible – an example of how technological overreach, coupled with dereliction of duty can amount to immense suffering for ordinary people.

The artificial intelligence (AI) algorithm behind Robodebt has been called “flawed”. But it was worse than that; it broke laws of mathematics. A mathematical law called Jensen’s inequality shows the Robodebt algorithm should have generated not only debts, but also credits.

What was Robodebt?

The Australian government’s Robodebt program was designed to catch people exploiting the Centrelink welfare system.

The system compared welfare recipients’ Centrelink-reported fortnightly income with their ATO-reported yearly income, the latter of which was averaged to provide fortnightly figures that could be lined up with Centrelink’s system.

If the difference showed an overpayment by Centrelink, a red flag was raised. The AI system then issued a debt notice and put the onus on the recipient to prove they weren’t exploiting the welfare system.

A Robodebt victim

To understand the extent of the failure, let’s consider a hypothetical case study. Will Gossett was a university student from 2017-2019. He was single, older than 18, and living at home with his parents.

Will received Centrelink payments according to his fortnightly income from a couple of casual jobs with highly variable work hours. In his first year at university his jobs didn’t pay much, so he received more Centrelink payments in the 2018 financial year than the year following.

The Robodebt algorithm took Will’s ATO yearly income records for both the 2018 and 2019 financial years and, for each year, averaged them into a series of fortnightly “robo” incomes.

Inside Robodebt’s AI world, his fortnightly incomes were then the same throughout the 2018 financial year, and the same throughout the 2019 financial year.

Will was honest with his claims, but was stunned to receive a debt notice for Centrelink overpayments made in the 2019 financial year – the year in which he actually received lower welfare payments.

The income-averaging algorithm gave Will an average fortnightly income for 2019 that was above the threshold that made him eligible for Centrelink payments. As far as the Robodebt system was concerned, Will shouldn’t have received any welfare payments that year.

Read more: 'Amateurish, rushed and disastrous': royal commission exposes robodebt as ethically indefensible policy targeting vulnerable people

Jensen’s inequality

The laws of mathematics tell us when two things are equal, but they can also tell us when one thing is bigger than another. This type of law is called an “inequality”.

To understand why and when Robodebt failed for Will, we need to understand a concept called Jensen’s inequality, credited to Danish mathematician Johan Jensen (1859-1925).

Jensen’s inequality explains how making a decision based on the averaging of numbers leads to either a negative bias or a positive bias under a “convexity condition”, which I’ll explain soon.

You’ll recall Will is a single university student, above 18, and living with his parents. Based on these factors, Centrelink has a fortnightly payment table for him, illustrated with the curve in the figure below.

The figure shows the more income Will earns from his jobs, the less welfare payment he receives, until a specific income, after which he receives none.

This graph, created from tables provided by Centrelink, shows how certain factors determine the amount of welfare payments Will can receive depending on his income.

The parts of the curve where Jensen’s inequality is relevant are highlighted by two red squares. In the square on the left, the curve bends downwards (concave), and in the square on the right it bends upwards (convex).

Because Will’s income was higher in 2019 and spread across the part where the payment curve is convex, Jensen’s inequality guarantees he would receive a Robodebt notice, even though there was no debt.

In 2018, however, Will’s income distribution was spread around smaller amounts where the curve is concave. So if Jensen’s inequality was adhered to, the AI algorithm should have issued him a “Robocredit” – but it didn’t.

It could be the algorithm contained a line of code that nullified Jensen’s inequality by instructing any credits be ignored.

Big data and a bad algorithm

The people responsible for the Robodebt system should have had a strong interest in keeping error rates low. Data scientists have a big red “stop” button when error rates of automated systems go beyond a few percent.

It’s straightforward to estimate error rates for an AI scheme. Experts do this by running simulations inside a virtual model called a “digital twin”. These can be used to carry out statistical evaluations, and expose conscious and unconscious biases in bad algorithms.

In Robodebt’s case, a digital twin could have been used to figure out error rates. This would have required running the Robodebt algorithm through representative incomes simulated under two different scenarios.

Under the first scenario, incomes are simulated assuming no debt is owed by anyone. Every time a result is returned saying a debt is owed, a Type 1 (or false-positive) error is recorded. Under the second scenario, incomes are simulated assuming everyone owes a debt (to varying degrees). If a no-debt result is returned, a Type 2 (false-negative) error rate is recorded.

Then an error rate is estimated by dividing the number of errors by the number of simulations, within each scenario.

Eye-watering inaccuracies

Although no consistently reliable error rates have been published for Robodebt, a figure of at least 27% was quoted in Parliament Question Time on February 7.

The reality was probably much worse. During the scheme, on the order of one million income reviews were performed, of which 81% led to a debt being raised.

Of these, about 70% (roughly 567,000 debts) were raised through the use of income averaging in the Robodebt algorithm.

In 2020, the government conceded about 470,000 debts had been falsely raised, out of a total of about 567,000.

Back-of-the-envelope calculations give a Type 1 (false-positive) error rate on the order of 80% (470,000/567,000). Compared to the usual target of a few percent, this is an eye-wateringly large error rate.

If simulations had been run, or human intelligence used to check real cases, the “stop” button would have been hit almost immediately.

Jensen’s inequality establishes why and when income averaging will fail, yet income matching hasn’t gone away. It can be found in AI software used for official statistics, welfare programs, bank loans and so forth.

Deeper statistical theory for this “change of support” problem — for example, going from data on yearly support to fortnightly support — will be needed as AI becomes increasingly pervasive in essential parts of society.

Read more: Why robodebt's use of 'income averaging' lacked basic common sense

Authors: Noel Cressie, Distinguished Professor of Statistics, University of Wollongong

Read more https://theconversation.com/robodebt-not-only-broke-the-laws-of-the-land-it-also-broke-laws-of-mathematics-201299

Why Bathroom Product Selection Matters More Than Most Homeowners Realise

Most homeowners think wrong when it comes to a bathroom renovation. They think hard about the layout. Spend hours choosing tiles. Agonise over pain...

How An Asbestos Removalist Ensures Safe And Compliant Property Environments in Melbourne

Maintaining a safe environment within residential and commercial properties requires careful management of hazardous materials, which is why engaging ...

Why Protein Bars Are A Convenient Option For Daily Nutrition And Energy

Maintaining balanced nutrition throughout the day can be challenging, especially for individuals with busy schedules, which is why protein bars hav...

Property Settlements After Separation: Key Considerations

Dividing assets after a separation is one of the more complex and emotionally charged aspects of the process. Understanding how property settlements...

Why Dust Control Matters During Bathroom Demolition

People usually expect bathroom demolition to be noisy.  No one thinks of dust — but it turns up everywhere. Inside cupboards. On couches. Along...

Why Roller Shutters And Outdoor Blinds Are Popular For Modern Properties

Many homeowners and businesses now install roller shutters to improve security, privacy, insulation, and weather protection across residential and ...

Slushie Machine Hire for Events: What to Check Before Booking

There's a moment at every great event when guests stop what they're doing and just enjoy something. A slushie machine is often that moment. It draws p...

Why AS/NZS Certified Sunglasses Are Essential for Australian Kids

Australia has some of the highest UV radiation levels in the world. That's not a warning label exaggeration; it's a measurable, documented fact that s...

Why People Regain Weight After Weight Loss?

Losing weight is hard; keeping it off is harder; and regaining it after all that effort is something many people go through more than most realise. ...

10 Benefits of Having a Frozen Yoghurt Machine for Your Business

Frozen yoghurt is a commercially viable dessert option for a wide range of food service businesses due to its versatility, efficiency, and consisten...

Why Slurry Hose is Essential For High-Performance Material Transfer

Handling abrasive and dense materials efficiently requires specialised equipment, which is why a slurry hose is a critical component in industries ...

Why Coworking Spaces In Melbourne Are Transforming The Way Professionals Work

The modern workforce is evolving rapidly, with flexibility, collaboration, and efficiency becoming central to how people work, which is why a coworkin...

The Everyday Wear and Tear Most Warehouse Storage Systems Experience

The modern warehouse is a dynamic, high velocity environment where industrial storage structures are subjected to immense, continuous physical stres...

Why Pendant Lights Continue To Be A Popular Choice In Modern Interiors

Lighting has become an essential design element in modern homes, influencing both the appearance and functionality of interior spaces. Many homeowne...

How Whiteboard Supports Structured Communication In Work And Learning Environments

Clear communication and structured planning are essential in both professional and educational settings, which is why a whiteboard remains a practi...

How A Cardboard Box Manufacturer Supports Modern Packaging Needs

Packaging has become an essential part of modern business operations across retail, manufacturing, logistics, and e-commerce industries. Many busine...

How Pallet Racking Helps Businesses Improve Warehouse Operations

Efficient warehouse management depends on reliable storage systems that support organisation, safety, and productivity. Many businesses use pallet rac...

Why I/O Controller Is Essential For Efficient Industrial Automation Systems

Modern industrial systems rely heavily on automation and precise data exchange, which is why an I/O controller plays a critical role in ensuring sm...