How to identify AI generated text?

by | Thursday, June 01, 2023

I think I solved the biggest educational challenge of our time, namely:

How do we recognize AI generated text from human-created ones?

Just to provide some context, the advent of large language models and generative AI have made it essential that we, as educators, find ways to distinguish between AI generated and human generated text. This is the only way we can catch those students who are blatantly cheating the system by using these tools to complete their assignments.

And this is the issue of our time, and the consequences of not addressing it is well captured in this essay in The Atlantic, whose title says it all: The College Essay is Dead, And this is just one the many newspaper articles, editorials and blog posts that hit the panic button once ChatGPT and other generative AI language models entered our world. And of course, we have a range of responses to this perceived crisis. from out-right bans to finding strategies to mitigate the risks of plagiarism. And let us not forget all the AI based tools that have emerged to that detect and flag AI generated prose (though their accuracy is somewhat unclear).

Matters are complicated further by the fact that humans often misunderstand these technologies, often in hilarious ways. For instance there was the recent story of a lawyer who had GPT write his brief, only to discover that the AI had made up case precedents. These things hallucinate.

One such story that came my way, this morning, was that of a professor who flunked a bunch of students because he thought they had used ChatGPT to write their papers. And again, the title of the story says it all: Professor Flunks All His Students After ChatGPT Falsely Claims It Wrote Their Papers. Well… as it turns out, the professor didn’t really understand the technology and fell for its hallucinations. In an interesting twist, someone entered the professor’s own writing into ChatGPT3 and was told it was AI generated. As the Rolling Stone story reports it:

In an amusing wrinkle, Mumm’s claims appear to be undercut by a simple experiment using ChatGPT. On Tuesday, redditor Delicious_Village112 found an abstract of Mumm’s doctoral dissertation on pig farming and submitted a section of that paper to the bot, asking if it might have written the paragraph. “Yes, the passage you shared could indeed have been generated by a language model like ChatGPT, given the right prompt,” the program answered.

All this is funny enough, but again these, I think, are growing pains as we struggle to understand this fast moving piece of technology. What I DID want to point to were the reasons that ChatGPT gave for claiming the prose was authored by AI. This is important information, because once we understand what makes an AI generated text, we can flip the criteria around to find out what characterizes human writing.

Below is a screenshot of ChatGPT’s response to why the prose fed to it was most probably generated by AI:

What this tells us is that AI generated text is coherent, having a clear structure, cites sources and data accurately, and uses technical terminology correctly

And conversely, we now have the defining characteristics of human generated text. Human generated text is incoherent, gets citations (of sources and data) wrong, and uses terminology inaccurately.

There we go. I am so glad we figured THAT out.

And yes, you are welcome!

A few randomly selected blog posts…

India Week @ Erickson Hall

The Indian community in the greater Lansing area celebrates India Week every year (more or less) around March. [More details here and here.] As a part of this event I (and other members of the College of Education) have been organizing an Indian themed breakfast and...

Ambigrams & Mathematics at HYSA

Ambigrams & Mathematics at HYSA

The Gary K. Herberger Young Scholars Academy (HYSA) is a school designed for highly gifted students in grades 7-12 affiliated with the Mary Lou Fulton Teachers College and Arizona State University. Last Friday I had the pleasure and honor of working with all the...

Rate of change of technology

I just stumbled upon this image from a 1950 issue of Popular Mechanics. The tag line below the image says: Because everything in her home is waterproof, the housewife of 2000 can do her daily cleaning with a hose. Though it is easy to make fun of this image it can be...

An IQ test for color

If there is an IQ test for everything, why not one for color. This is Howard Gardner multiple intelligences run rampant. Check out the Color IQ test. BTW, my score was 27 (where 0 is a perfect score and 99 is as bad as you can get!). Irrespective of what you think of...

Defining design (one view)

I am on the Design Research Listerv and every once in a while a discussion rages online about the defining design. Gunnar Swanson (of the Gunnar Swanson Design Office and faculty at at East Carolina University) has created a flash movie that (as he says) "lays out...

Technology Integration 2.0 — was TPACK 😉

The recently concluded NECC conference had quite a bit of TPACK related presentations. Sadly neither Matt nor I could make it to NECC... maybe next year! One I discovered just today (h/t @mhines on twitter) was one titled School 2.0 & Understanding by Design....

Multiple representations of the periodic table and learning

Mishra & Yadav (2006) was a paper based around my dissertation research. It took a while to get published and I am including it here for the record. My dissertation (Mishra, 1998) was maybe the first place where I made a specific mention of the triad of...

Creativity & Courage

Creativity & Courage

Here is the next article in our series Rethinking Technology & Creativity in the 21st Century for the journal TechTrends. This article features an interview with Dr. Yong Zhao, Foundation Distinguished Professor in the School of Education at...

Post-lunch session: Geetha Narayanan

Geetha Narayanan, Director Mallya Aditi International School and Srishti School of Art Design and Technology, is someone I have wanted to meet for a long time. One of the pleasures of of this conference is getting an opportunity to hear her speak ... and I was not...

3 Comments

  1. Marckie Zeender

    As a large language model developed by OpenAI, I find this method of identifying AI generated text to be accurate.

    Reply
  2. Mehedy

    You are a masterpiece.This article is a breathtaking masterpiece that seamlessly weaves together profound insights and eloquent prose. It captures the essence of its subject with a grace that is both captivating and inspiring. A true gem that leaves the reader spellbound and hungry for more. Bravo!

    Reply
    • Punya Mishra

      This comment is clearly spam. But I approved it none-the-less, but only after deleting the URL and the email address that the spammer had included. So this comment will remain here … except it will not provide the link to the person’s website (which, was the sole purpose of this comment).

      Reply

Submit a Comment

Your email address will not be published. Required fields are marked *