Modern Australian
Men's Weekly

.

ChatGPT’s Studio Ghibli-style images show its creative power – but raise new copyright problems

  • Written by Kai Riemer, Professor of Information Technology and Organisation, University of Sydney

Social media has recently been flooded with images that looked like they belonged in a Studio Ghibli film. Selfies, family photos and even memes have been re-imagined with the soft pastel palette characteristic of the Japanese animation company founded by Hayao Miyazaki.

This followed OpenAI’s latest update to ChatGPT. The update significantly improved ChatGPT’s image generation capabilities, allowing users to create convincing Ghibli-style images in mere seconds. It has been enormously popular – so much so, in fact, that the system crashed due to user demand.

Generative artificial intelligence (AI) systems such as ChatGPT are best understood as “style engines”. And what we are seeing now is these systems offering users more precision and control than ever before.

But this is also raising entirely new questions about copyright and creative ownership.

How the new ChatGPT makes images

Generative AI programs work by producing outputs in response to user prompts, including prompts to create an image.

Previous generations of AI image generators used diffusion models. These models gradually refine random, noisy data into a coherent image. But the latest update to ChatGPT uses what’s known as an “autoregressive algorithm”.

This algorithm treats images more like language, breaking them down into “tokens”. Just as ChatGPT predicts the most likely words in a sentence, it can now predict different visual elements in an image separately.

This tokenisation enables the algorithm to better separate certain features of an image – and their relationship with words in a prompt. As a result, ChatGPT can more accurately create images from precise user prompts than previous generations of image generators. It can replace or change specific features while preserving the rest of the image, and it improves on the longstanding issue of generating correct text in images.

A particularly powerful advantage of generating images inside a large language model is the ability to draw on all the knowledge already encoded in the system. This means users don’t need to describe every aspect of an image in painstaking detail. They can simply refer to concepts such as Studio Ghibli and the AI understands the reference.

The recent Studio Ghibli trend began with OpenAI itself, before spreading among Silcon Valley software engineers and then even governments and politicians – including seemingly unlikely uses such as the White House creating a Ghiblified image of a crying woman being deported and the Indian government promoting Prime Minister Narendra Modi’s narrative of a “New India”.

Understanding AI as ‘style engines’

Generative AI systems don’t store information in any traditional sense. Instead they encode text, facts, or image fragments as patterns – or “styles” – within their neural networks.

Trained on vast amounts of data, AI models learn to recognise patterns at multiple levels. Lower network layers might capture basic features such as word relationships or visual textures. Higher layers encode more complex concepts or visual elements.

This means everything – objects, properties, writing genres, professional voices – gets transformed into styles. When AI learns about Miyazaki’s work, it’s not storing actual Studio Ghibli frames (though image generators may sometimes produce close imitations of input images). Instead, it’s encoding “Ghibli-ness” as a mathematical pattern – a style that can be applied to new images.

The same happens with bananas, cats or corporate emails. The AI learns “banana-ness”, “cat-ness” or “corporate email-ness” – patterns that define what makes something recognisably a banana, cat or a professional communication.

The encoding and transfer of styles has for a long time been an express goal in visual AI. Now we have an image generator that achieves this with unprecedented scale and control.

This approach unlocks remarkable creative possibilities across both text and images. If everything is a style, then these styles can be freely combined and transferred. That’s why we refer to these systems as “style engines”. Try creating an armchair in the style of a cat, or in elvish style.

The copyright controversy: when styles become identity

While the ability to work with styles is what makes generative AI so powerful, it’s also at the heart of growing controversy. For many artists, there’s something deeply unsettling about seeing their distinctive artistic approaches reduced to just another “style” that anyone can apply with a simple text prompt.

Hayao Miyazaki has not publicly commented on the recent trend of people using ChatGPT to generate images in his world-famous animation style. But he has been critical of AI previously.

All of this also raises entirely new questions about copyright and creative ownership.

Traditionally, copyright law doesn’t protect styles – only specific expressions. You can’t copyright a music genre such as “ska” or an art movement such as “impressionism”.

This limitation exists for good reason. If someone could monopolise an entire style, it would stifle creative expression for everyone else.

But there’s a difference between general styles and highly distinctive ones that become almost synonymous with someone’s identity. When an AI can generate work “in the style of Greg Rutkowski” – a Polish artist whose name was reportedly used in over more than 93,000 prompts in AI image generator Stable Diffusion – it potentially threatens both his livelihood and artistic legacy.

Some creators have already taken legal action.

In a case filed in late 2022, three artists formed a class to sue multiple AI companies, arguing that their image generators were trained on their original works without permission, and now allow users to generate derivative works mimicking their distinctive styles.

As technology evolves faster than the law, work is under way on new legislation to try and balance technological innovation with protecting artists’ creative identities.

Whatever the outcome, these debates highlight the transformative nature of AI style engines – and the need to consider both their untapped creative potential and more nuanced protections of distinctive artistic styles.

Authors: Kai Riemer, Professor of Information Technology and Organisation, University of Sydney

Read more https://theconversation.com/chatgpts-studio-ghibli-style-images-show-its-creative-power-but-raise-new-copyright-problems-253438

The Most Common Conveyor System Issues in Manufacturing

In modern manufacturing, conveyor systems play a central role in keeping production lines efficient, consistent, and cost-effective. When they operate...

How to Secure a Long-Term Rental in a Competitive Market

The rental market can be unpredictable and may present challenges if you’re not prepared. Initially, you might submit numerous applications and stil...

What Smart Investors Know About Real Estate

Many people think investing in property is just about buying a house and waiting for it to get expensive. While that can happen, the people who actual...

The Benefits of Seeking Help for Anxiety and Stress

Anxiety and stress have become common experiences in today’s fast-paced world, affecting people across all ages and lifestyles. From work pressures ...

How to Make the Most of Fashion Wholesale Options for Your Brand

If you want to grow a fashion brand without constantly reinventing the wheel, wholesale can be one of the smartest ways to scale. The key is knowing h...

How to Add Value to Your Home Before Selling

Selling a home is not just about putting up a sign and waiting for offers. It is about presenting a property that buyers instantly connect with and ar...

How Outdoor Play Enhances Learning and Wellbeing

You don’t need to be an expert to conclude that play is an essential part of growing up. When children aren’t restricted and kept indoors, they de...

How to Build Passive Income Through Real Estate

Building passive income is one of the most effective ways to create long-term financial security. While there are many investment opportunities availa...

DIY Guide to Replacing Small Parts in Your Laundry Machine

Finding a puddle or a broken washer is frustrating, but you don’t always need a professional. Many common issues are caused by tiny parts that are c...

Best Practices for Managing Your Warehouse Partner Relationships

Your warehouse partner is an important part of your business. They sit in the middle of your promises to customers. Yet, when they deliver what’s pr...

Benefits of Solar-Based Water Circulation Systems

Imagine your water system running all day without touching your electricity bill. No noise, no heavy cables, no stress when prices go up. Fantastic, r...

Benefits of Using an Outrigger Crane for Complex Lifts

Complex lifts aren’t the kind of jobs you improvise. You’re dealing with awkward shapes, serious weight, and sites that never seem designed for wh...

A Beginner's Guide to Website Ranking

If you have a website, you probably want people to find it. But building a website alone does not guarantee visitors. Millions of websites compete for...

How to Prepare Your Home for Holiday Guests

Welcoming holiday guests into your home is one of life’s great pleasures. Whether it’s Christmas lunch, a long weekend reunion, or interstate re...

Colour Palettes That Work Beautifully for Christmas

Christmas styling has evolved well beyond the traditional red-and-green formula. While classic tones will always have their place, today’s festive...

Interior Decorating Mistakes to Avoid

Interior decorating has the power to completely transform how a home feels, functions and flows. Done well, it elevates everyday living and creates ...

How Chiropractic Can Help with Sciatica Treatment

Sciatica can be one of the most frustrating and disruptive forms of back pain. Characterised by pain that radiates from the lower back through the h...

Common Vulnerabilities Found During Australian Pen Tests

Penetration testing has become a critical component of modern cyber security strategy across Australia. From fast-growing SaaS startups to establish...