Modern Australian
Times Advertising

Philosophers have studied 'counterfactuals' for decades. Will they help us unlock the mysteries of AI?

  • Written by Sam Baron, Associate Professor, Philosophy of Science, Australian Catholic University
Philosophers have studied 'counterfactuals' for decades. Will they help us unlock the mysteries of AI?

Artificial intelligence is increasingly being rolled out all around the world to help make decisions in our lives, whether it’s loan decisions by banks, medical diagnoses, or US law enforcement predicting a criminal’s likelihood of re-offending.

Yet many AI systems are black boxes: no one understands how they work. This has led to a demand for “explainable AI”, so we can understand why an AI model yielded a specific output, and what biases may have played a role.

Explainable AI is a growing branch of AI research. But what’s perhaps less well known is the role philosophy plays in its development.

Specifically, one idea called “counterfactual explanation” is often put forth as a solution to the black box problems. But once you understand the philosophy behind it, you can start to understand why it falls short.

Why explanations matter

When AI is used to make life-changing decisions, the people impacted deserve an explanation of how that decision was reached. This was recently recognised through the European Union’s General Data Protection Regulation, which supports an individual’s right to explanation.

The need for explanation was also highlighted in the Robodebt case in Australia, where an algorithm was used to predict debt levels for individuals receiving social security. The system made many mistakes, placing people into debt who shouldn’t have been.

It was only once the algorithm was fully explained that the mistake was identified – but by then the damage had been done. The outcome was so damaging it led to a royal commission being established in August 2022.

In the Robodebt case, the algorithm in question was fairly straightforward and could be explained. We should not expect this to always be the case going forward. Current AI models using machine-learning to process data are much more sophisticated.

Read more: Not everything we call AI is actually 'artificial intelligence'. Here's what you need to know

The big, glaring black box

Suppose a person named Sara applies for a loan. The bank asks her to provide information including her marital status, debt level, income, savings, home address and age.

The bank then feeds this information into an AI system, which returns a credit score. The score is low and is used to disqualify Sara for the loan, but neither Sara nor the bank employees know why the system scored Sara so low.

Unlike with Robodebt, the algorithm being used here may be extremely complicated and not easily explained. There is therefore no straightforward way to know whether it has made a mistake, and Sara has no way to get the information she needs to argue against the decision.

This scenario isn’t entirely hypothetical: loan decisions are likely to be outsourced to algorithms in the US, and there’s a real risk they will encode bias. To mitigate risk, we must try to explain how they work.

Read more: Everyone's having a field day with ChatGPT – but nobody knows how it actually works

The counterfactual approach

Broadly speaking, there are two types of approaches to explainable AI. One involves cracking open a system and studying its internal components to discern how it works. But this usually isn’t possible due to the sheer complexity of many AI systems.

The other approach is to leave the system unopened, and instead study its inputs and outputs, looking for patterns. The “counterfactual” method falls under this approach.

Counterfactuals are claims about what would happen if things had played out differently. In an AI context, this means considering how the output from an AI system might be different if it receives different inputs. We can then supposedly use this to explain why the system produced the result it did.

One example of a counterfactual would be to ask what the world might be like had the internet never been developed. Shutterstock

Suppose the bank feeds its AI system different (manipulated) information about Sara. From this, the bank works out the smallest change Sara would need to get a positive outcome would be to increase her income.

The bank can then apparently use this as an explanation: Sara’s loan was denied because her income was too low. Had her income been higher, she would have been granted a loan.

Such counterfactual explanations are being seriously considered as a way of satisfying the demand for explainable AI, including in cases of loan applications and using AI to make scientific discoveries.

However, as researchers have argued, the counterfactual approach is inadequate.

Correlation and explanation

When we consider changes to the inputs of an AI system and how they translate into outputs, we manage to gather information about correlations. But, as the old adage goes, correlation is not causation.

The reason that’s a problem is because work in philosophy suggests causation is tightly connected to explanation. To explain why an event occurred, we need to know what caused it.

On this basis, it may be a mistake for the bank to tell Sara her loan was denied because her income was too low. All it can really say with confidence is that income and credit score are correlated – and Sara is still left without an explanation for her poor result.

What’s needed is a way to turn information about counterfactuals and correlations into explanatory information.

The future of explainable AI

With time we can expect AI to be used more for hiring decisions, visa applications, promotions and state and federal funding decisions, among other things.

A lack of explanation for these decisions threatens to substantially increase the injustice people will experience. After all, without explanations we can’t correct mistakes made when using AI. Fortunately, philosophy can help.

Explanation has been a central topic of philosophical study over the last century. Philosophers have designed a range of methods for extracting explanatory information from a sea of correlations, and have developed sophisticated theories about how explanation works.

A great deal of this work has focused on the relationship between counterfactuals and explanation. I’ve developed work on this myself. By drawing on philosophical insights, we may be able to develop better approaches to explainable AI.

At present, however, there’s not enough overlap between philosophy and computer science on this topic. If we want to tackle injustice head-on, we’ll need a more integrated approach that combines work in these fields.

Read more: When self-driving cars crash, who's responsible? Courts and insurers need to know what's inside the 'black box'

Authors: Sam Baron, Associate Professor, Philosophy of Science, Australian Catholic University

Read more https://theconversation.com/philosophers-have-studied-counterfactuals-for-decades-will-they-help-us-unlock-the-mysteries-of-ai-196392

What to Know Before Getting Dental Implants: A Guide for First-Time Patients

Dental implants Perth patients often look for a long-term solution for missing teeth without the hassle of dentures or bridges. If you are thinking ...

Why Protective Packaging Matters More Than Ever In Modern Shipping

In today’s fast-paced world of logistics and eCommerce, ensuring that products reach customers safely is a top priority. This is where a bubble wrap...

Pest Control Albury: Protecting Your Property From Hidden Damage And Health Risks

Pests rarely announce their arrival. They creep into spaces quietly, turning small, unnoticed corners into breeding grounds for bigger problems. Tha...

Why Root Canal Treatment Melbourne Is Essential For Saving Natural Teeth

Tooth pain has a way of demanding attention at the worst possible time. When the discomfort becomes persistent and intense, it often signals an infe...

How Bird Flight Diverters Help Protect Wildlife Around Power Infrastructure

Power infrastructure plays an essential role in modern life, but it can also create risks for wildlife, particularly birds moving through establishe...

What Businesses Should Look for in a Commercial Coffee Partner

Choosing a commercial coffee partner is not the same as choosing a machine. It is a broader decision that affects beverage quality, staff efficiency...

3PL Logistics Australia Driving Smarter Supply Chains And Faster Deliveries

In a world where customers expect speed almost as much as quality, logistics has become the silent heartbeat of every successful business. Behind th...

Why Professional Electrical Services Are Essential For Modern Properties

Electricity powers almost every aspect of daily life, from lighting and appliances to complex systems in homes and businesses. This makes choosing a...

What Not to Pack When Moving: The Essential Guide to Smart Packing

Moving house is one of those all-encompassing events in life and most people focus their energy on deciding what to pack. But knowing what not to pa...

From Assistance to Independence: Progression in Daily Living Skills

The ultimate goal of many support systems is to empower individuals to lead lives defined by autonomy and self-reliance. While some support requiremen...

The Cost Difference Between Early Repairs and Delayed Replacement

Automotive maintenance often involves a choice between addressing a small issue immediately or waiting until a component fails completely. When it c...

What Is a Stainless Steel Bar? Applications, Benefits, and Buying Tips

Stainless steel is one of the most widely used materials across industrial and commercial sectors, known for its strength, corrosion resistance, and...

Scholars in Developing Nations Depending on Z library

Access to books often shapes the course of study for scholars who live in regions with thin library shelves and slow supply chains. Many students wo...

6 Cheapest POS Systems in Australia (2026)

The cheapest POS systems in Australia for 2026 are POSApt, Square, Zeller, Loyverse, Epos Now, and Shopify POS (Lite). However, “cheap” does no...

The Ultimate Guide to Automating Your Weekend Yard Chores

We all look forward to the weekend as a chance to unwind after a long week of work. You probably picture yourself relaxing on the patio with a cold ...

How Ignoring Regular Car Servicing Can Lead to Costly Repairs

Owning a car gives you a sweet sense of freedom and comfort. You can go wherever you want, whenever you want. But with that freedom comes responsibili...

Someone Trips at Your Fundraiser. Now What? Understanding Public Liability for NFPs

Three months of planning. Volunteers giving up their weekends. Sponsorships chased, catering sorted, tables decorated. And then, about an hour into ...

Stainless Steel Tube: A Complete Specification Guide for Engineers, Project Managers, and Industrial Buyers

Few materials in the industrial and manufacturing world are as universally relied upon — or as frequently misspecified — as stainless steel tube...