Modern Australian
Men's Weekly

.

I got generative AI to attempt an undergraduate law exam. It struggled with complex questions

  • Written by Armin Alimardani, Lecturer, School of Law, University of Wollongong
I got generative AI to attempt an undergraduate law exam. It struggled with complex questions

It’s been nearly two years since generative artificial intelligence was made widely available to the public. Some models showed great promise by passing academic and professional exams.

For instance, GPT-4 scored higher than 90% of the United States bar exam test takers. These successes led to concerns AI systems might also breeze through university-level assessments. However, my recent study paints a different picture, showing it isn’t quite the academic powerhouse some might think it is.

My study

To explore generative AI’s academic abilities, I looked at how it performed on an undergraduate criminal law final exam at the University of Wollongong – one of the core subjects students need to pass in their degrees. There were 225 students doing the exam.

The exam was for three hours and had two sections. The first asked students to evaluate a case study about criminal offences – and the likelihood of a successful prosecution. The second included a short essay and a set of short-answer questions.

The test questions evaluated a mix of skills, including legal knowledge, critical thinking and the ability to construct persuasive arguments.

Students were not allowed to use AI for their responses. And did the assessment in a supervised environment.

I used different AI models to create ten distinct answers to the exam questions.

Five papers were generated by just pasting the exam question into the AI tool without any prompts. For the other five, I gave detailed prompts and relevant legal content to see if that would improve the outcome.

I hand wrote the AI-generated answers in official exam booklets and used fake student names and numbers. These AI-generated answers were mixed with actual student exam answers and anonymously given to five tutors for grading.

Importantly, when marking, the tutors did not know AI had generated ten of the exam answers.

A man writes on a sheet of paper.
We handwrote the AI answers so markers would think they were done by students. Kate Aedon/Shutterstock

How did the AI papers perform?

When the tutors were interviewed after marking, none of them suspected any answers were AI-generated.

This shows the potential for AI to mimic student responses and educators’ inability to spot such papers.

But on the whole, the AI papers were not impressive.

While the AI did well in the essay-style question, it struggled with complex questions that required in-depth legal analysis.

This means even though AI can mimic human writing style, it lacks the nuanced understanding needed for complex legal reasoning.

The students’ exam average was 66%.

The AI papers that had no prompting, on average, only beat 4.3% of students. Two barely passed (the pass mark is 50%) and three failed.

In terms of the papers where prompts were used, on average, they beat 39.9% of students. Three of these papers weren’t impressive and received 50%, 51.7% and 60%, but two did quite well. One scored 73.3% and the other scored 78%.

A landing page for ChatGPT, asking 'How can I help you today?'
Generative AI has gained a reputation for passing difficult exams. Tada Images/ Shutterstock

What does this mean?

These findings have important implications for both education and professional standards.

Despite the hype, generative AI isn’t close to replacing humans in intellectually demanding tasks such as this law exam.

My study suggests AI should be viewed more like a tool, and when used properly, it can enhance human capabilities.

So schools and universities should concentrate on developing students’ skills to collaborate with AI and analyse its outputs critically, rather than relying on the tools’ ability to simply spit out answers.

Further, to make collaboration between AI and students possible, we may have to rethink some of the traditional notions we have about education and assessment.

For example, we might consider when a student prompts, verifies and edits an AI-generated work, that is their original contribution and should still be viewed as a valuable part of learning.

Authors: Armin Alimardani, Lecturer, School of Law, University of Wollongong

Read more https://theconversation.com/i-got-generative-ai-to-attempt-an-undergraduate-law-exam-it-struggled-with-complex-questions-240021

The Importance Of Structured Commercial Office Cleaning In Busy Office Environments

Office spaces are dynamic environments where people collaborate, meet clients, and spend a significant portion of their day. Maintaining cleanliness...

Single Tooth Dental Implant for Natural Tooth Replacement and Lasting Stability

Losing a single tooth can have a noticeable impact on comfort, appearance, and confidence, which is why a Single Tooth Dental Implant is considered...

When Grief Doesn’t Follow a Timeline

Grief rarely moves in a straight line. It doesn’t follow stages neatly, and it doesn’t respond well to pressure — especially the quiet pressure ...

Steel Plate And Its Role In Modern Construction And Manufacturing

A steel plate is one of those materials that quietly holds the modern world together. It does not demand attention, yet it supports bridges, buildin...

Understanding Fat Transfer to the Breast: What to Know Before Considering the Procedure

Surgical options for breast enhancement have evolved over time, offering different approaches depending on a person’s goals and body type. One opt...

What to Do When Your Car’s Side Window Is Broken

A shattered side window is more than an inconvenience. Whether caused by a break-in, road debris, or accidental impact, it leaves your vehicle exposed...

Shopify Web Development and Shopify Website Development for Scalable Online Stores

Choosing the right platform is a crucial decision for any online business, and Shopify web development has become a popular choice for brands that ...

How a Burleigh Heads Plumber Tests for Pipe Leaks

Pipe leaks can be deceptively difficult to spot. Some announce themselves with a steady drip under the sink, but many develop quietly behind walls, ...

What Local Businesses Should Expect from IT Services in Melbourne?

If you run a Melbourne business with roughly 7–100 staff, you have probably noticed something over the last couple of years. The IT problems got m...

How Professional Cleaning Improves Indoor Air Quality

Indoor air quality (IAQ) plays a crucial role in our health, comfort, and overall wellbeing. Australians spend nearly 90% of their time indoors-at hom...

Solar and Solar Battery Systems: Powering Smarter Homes in Victoria

As energy prices continue to rise and sustainability becomes a priority for Australian homeowners, more families are investing in Solar and Solar Ba...

Plumbing Emergency Melbourne: What to Do When Every Minute Counts

A sudden plumbing issue can quickly turn into a major disaster if not handled promptly. From burst pipes and overflowing toilets to leaking gas line...

Why Older Melbourne Homes Require Detailed Building & Pest Inspections

Older homes make up a large part of Melbourne’s housing stock. Victorian terraces, Edwardian houses, Californian bungalows, and post-war brick hom...

7 Essential Tips for Choosing Reliable Moving Services in Perth

Moving to a new home or office can be exciting, but it also comes with stress, planning, and plenty of decisions. One of the most important choices yo...

How to Find the Best Real Estate Agent Near You on the Central Coast

Choosing the right real estate agent can make a major difference to your final sale price, days on market, and overall experience. The Central Coast...

Unlock Durability And Beauty With Burnt Timber Cladding Solutions

Imagine a home or commercial space that not only stands the test of time but also tells a story through its very facade. In the world of architectur...

Offroad Caravans: Built for Adventure Beyond the Beaten Track

Australia’s vast and varied landscapes invite travellers to explore far beyond sealed roads and crowded parks. Offroad caravans are purpose-built ...

The Expert's Guide to Understanding Large Bore Steel Pipe Specifications

When it comes to infrastructure, construction, and various industrial applications, the choice of materials is paramount. Among the options availabl...