Modern Australian
The Times

AI systems are built on English – but not the kind most of the world speaks

  • Written by Celeste Rodriguez Louro, Associate professor, Chair of Linguistics and Director of Language Lab, The University of Western Australia
AI systems are built on English – but not the kind most of the world speaks

An estimated 90% of the training data for current generative AI systems stems from English. However, English is an international lingua franca with about 1.5 billion speakers worldwide, and countless varieties.

So whose English is today’s technology based on? The answer is primarily the English of mainstream America.

This is no accident. Mainstream American English is entrenched in the digital infrastructure of the internet, in Silicon Valley’s corporate priorities, and in the data sets that fuel everything from autocorrect to AI-generated synthetic text.

The consequence? AI models produce a monolithic version of English that erases variation, excludes minoritised and regional voices, and reinforces unequal power dynamics.

The hegemony of mainstream American English

The proliferation of American English online is a result of historical, economic and technological factors. The United States has been a dominant force in the development of the internet, content creation, and the rise of tech giants such as Google, Meta, Microsoft and OpenAI.

Unsurprisingly, the linguistic norms embedded in products by these companies are overwhelmingly mainstream American.

A recent study found that speakers of non-mainstream English were frustrated with the “homogeneity of AI accents” in voice-cloning and speech-generation technologies. One participant noted the predominant mainstream American accents in the voices available, stating the technologies had been built “with some other people in mind”.

Mainstream varieties of English have long reigned as the “standard” against which other varieties are weighed.

To take a single example from the US, linguistics research by John Baugh found that using different accents can determine people’s access to goods and services. When Baugh called different landlords about housing advertised in the local newspaper, using a mainstream accent procured him several housing inspections while using African-American and Latino accents did not.

The prestige of mainstream English also underpins algorithmic decisions. The models behind tools such as autocorrect, voice-to-text, or even AI writing assistants are most often trained on mainstream American-centric data. This is often scraped from the web, where US-based media, forums and platforms dominate.

This means variations in grammar, syntax and vocabulary from other varieties of English are systematically ignored, misinterpreted or outright “corrected”.

Whose English is perceived as adding value?

The stakes of this linguistic bias in favour of mainstream English become even higher when AI systems are deployed around the world.

If an AI tutor fails to understand a Nigerian English construction, who bears the cost? If a job application written in Indian English is marked down by an AI-powered resume scanner, what are the consequences? If an Australian First Nations elder’s oral history is transcribed by voice recognition software and the system fails to capture culturally significant terms, what knowledge is lost or misrepresented?

These questions are unfolding in real time as governments, educational institutions and corporations adopt AI technologies at scale.

Englishes, not English

The idea that there is one “good” or “correct” English is a myth. English is spoken in diverse forms across regions, shaped by local societies, cultures, histories and identities.

As Noongar writer and educator Glenys Collard and I have written, Aboriginal English has “its own structure, rules and the same potential as any other linguistic variety” and the same is true of other forms of English.

Indian English, for example, has lexical innovations such as “prepone” (the opposite of postpone). Singapore English (Singlish) integrates particles and syntactic features from Malay, Hokkien and Tamil.

These are not “broken” forms of English. Each community where English was imposed has gone on to make English its own.

English, and language more generally, is never static. It adapts to meet the needs of an ever-changing society and its speakers.

Yet in AI development, this linguistic diversity is often treated as noise rather than signal. Non-standardised varieties are underrepresented in training datasets, excluded from annotation schemes, and rarely feature in evaluation benchmarks.

This results in an AI ecosystem that is multilingual in theory, but monolingual in practice.

Toward linguistic justice in AI

So, what would it look like to build AI systems that recognise and respect a range of different forms of English?

A shift in mindset is required, from prescribing “correct” language to including many varieties of language. What we need are systems that accommodate linguistic variation.

This may involve supporting community-led efforts to document and digitise linguistic varieties on their own terms, bearing in mind not all linguistic varieties should be digitised or documented.

Collaboration across disciplines is also important. It requires linguists, technologists, educators and community leaders working together to ensure AI development is grounded in principles of linguistic justice.

The goal is not to “fix” language but to create technology that produces just outcomes. The focus should be on changing the technology, not the speaker.

Embracing Englishes

English has been a powerful vehicle of empire, but it has also been a tool of resistance, creativity and solidarity. Around the world, speakers have taken the language and made it their own. AI-enabled systems should be built to be as inclusive of this variability as possible.

So next time your phone tells you to “correct” your spelling, or an AI chatbot misunderstands your phrasing, ask yourself: whose English is it trying to model? And whose English is being left out?

Authors: Celeste Rodriguez Louro, Associate professor, Chair of Linguistics and Director of Language Lab, The University of Western Australia

Read more https://theconversation.com/ai-systems-are-built-on-english-but-not-the-kind-most-of-the-world-speaks-249710

Chatswood Tutoring And Its Role In Academic Achievement

Academic success often requires more than classroom attendance alone. Students face increasing expectations as they progress through school, particu...

Why Laser Hair Removal Treatments Continue Growing In Popularity

Managing unwanted hair can become time-consuming and frustrating for many people, especially when shaving, waxing, and other temporary methods requi...

Choosing the Right Devices for a Flexible Workplace

For IT leaders managing large fleets, the device layer is where workforce productivity and security policy meet. The shift towards flexible and hybrid...

How Business Advisory Services Help Companies Achieve Sustainable Growth

Every business owner aims to build a profitable and sustainable organisation. While dedication, innovation, and hard work are important, achieving l...

Why Body Contouring Has Become A Popular Cosmetic Treatment

Many people maintain healthy lifestyles through regular exercise and balanced eating habits but still struggle with stubborn areas of fat that are dif...

How to Choose the Right POS Hardware for Your Business in Australia

A lot of Australian business owners spend weeks researching POS software but buy hardware almost as an afterthought. That's a mistake. The wrong har...

Why Material Handling Hose Is Critical for Industrial Efficiency

A high-performance material handling hose is an essential component in industries that transport abrasive, dry, or bulk materials on a daily basis...

How to Choose the Right Lawyer in Melbourne for Your Situation

Choosing legal support can feel difficult, especially when the stakes are personal or business-related. The right lawyer in Melbourne should underst...

Hoteliers Look to Clever Value Adds to Increase Revenue

The Australian hospitality industry is still in recovery mode after a notoriously rough patch in recent years. While there has been a post-COVID tra...

Moving to Queensland? Here’s How to Prep Your Car for the Big Move North

There’s no sign of the northern migration slowing down, with thousands of southerners fleeing from chaotic lifestyles and cooler climates for a brig...

Diesel Shortage to Impact Trades and Contractors

Strait of Hormuz blockage affecting all major parts of trades and construction Trades and construction across residential, commercial and industria...

Why Holiday Home Owners Turn to Rental Management Agents

The Allure — and the Reality — of Renting Out Your Property Owning a holiday home is a dream for many Australians. Whether it's a beachside sha...

Why Finding Reliable Doctors In Bundoora Is Important For Long-Term Health

Access to quality healthcare plays an important role in maintaining overall wellbeing and managing health concerns early. Trusted Doctors in Bundoor...

Understanding the Different Types of Car Services: Minor vs Major

When it comes to car maintenance, one of the most important things every vehicle owner should understand is the difference between a minor and a maj...

How Superannuation and TPD Insurance Work Together

Superannuation is an essential part of financial planning in Australia. It is designed to provide individuals with income during retirement, helping...

Tiny Towns funding granted for Mt Hotham and Mt Buller upgrades

Alpine Resorts Victoria (ARV) has welcomed funding support from the Victorian Government’s  Tiny Towns Fund, with both Mt Hotham and Mt Buller se...

Locksmith Services: Why Professional Security Solutions Matter More Than Ever

Security is a critical concern for homeowners, businesses, and vehicle owners alike. Whether it involves protecting a property, replacing damaged lo...

Why Tooth Fillings Are Important For Protecting Damaged Teeth

Cavities and minor tooth damage are common dental problems that can worsen if left untreated. Professional tooth fillings help restore damaged teeth, ...