How to identify AI generated text?

by | Thursday, June 01, 2023

I think I solved the biggest educational challenge of our time, namely:

How do we recognize AI generated text from human-created ones?

Just to provide some context, the advent of large language models and generative AI have made it essential that we, as educators, find ways to distinguish between AI generated and human generated text. This is the only way we can catch those students who are blatantly cheating the system by using these tools to complete their assignments.

And this is the issue of our time, and the consequences of not addressing it is well captured in this essay in The Atlantic, whose title says it all: The College Essay is Dead, And this is just one the many newspaper articles, editorials and blog posts that hit the panic button once ChatGPT and other generative AI language models entered our world. And of course, we have a range of responses to this perceived crisis. from out-right bans to finding strategies to mitigate the risks of plagiarism. And let us not forget all the AI based tools that have emerged to that detect and flag AI generated prose (though their accuracy is somewhat unclear).

Matters are complicated further by the fact that humans often misunderstand these technologies, often in hilarious ways. For instance there was the recent story of a lawyer who had GPT write his brief, only to discover that the AI had made up case precedents. These things hallucinate.

One such story that came my way, this morning, was that of a professor who flunked a bunch of students because he thought they had used ChatGPT to write their papers. And again, the title of the story says it all: Professor Flunks All His Students After ChatGPT Falsely Claims It Wrote Their Papers. Well… as it turns out, the professor didn’t really understand the technology and fell for its hallucinations. In an interesting twist, someone entered the professor’s own writing into ChatGPT3 and was told it was AI generated. As the Rolling Stone story reports it:

In an amusing wrinkle, Mumm’s claims appear to be undercut by a simple experiment using ChatGPT. On Tuesday, redditor Delicious_Village112 found an abstract of Mumm’s doctoral dissertation on pig farming and submitted a section of that paper to the bot, asking if it might have written the paragraph. “Yes, the passage you shared could indeed have been generated by a language model like ChatGPT, given the right prompt,” the program answered.

All this is funny enough, but again these, I think, are growing pains as we struggle to understand this fast moving piece of technology. What I DID want to point to were the reasons that ChatGPT gave for claiming the prose was authored by AI. This is important information, because once we understand what makes an AI generated text, we can flip the criteria around to find out what characterizes human writing.

Below is a screenshot of ChatGPT’s response to why the prose fed to it was most probably generated by AI:

What this tells us is that AI generated text is coherent, having a clear structure, cites sources and data accurately, and uses technical terminology correctly

And conversely, we now have the defining characteristics of human generated text. Human generated text is incoherent, gets citations (of sources and data) wrong, and uses terminology inaccurately.

There we go. I am so glad we figured THAT out.

And yes, you are welcome!

A few randomly selected blog posts…

Jere Brophy, 1940 – 2009

There is a nice article in the State News about Jere Brophy including quotes from his daughter Cheri Spier, my department chair Dick Prawat, and my former advisee (now faculty member at Drexel) Aroutis Foster. Read MSU professor dies, honored by colleagues as field...

Contruction (sic)

Check out this page of examples of bad design. Some of these look too crazy to be true - but who knows... stranger things have happened. Interestingly enough the title of the page is "Award winning contructions!" I wonder if that is deliberate. Site worth sharing with...

Breaking free of academic publishers

It appears that the arts and sciences faculty at Harvard are considering publishing all their scholarship freely online. Here is a NYTimes story titled At Harvard, a Proposal to Publish Free on Web. This is truly wonderful news and long overdue. I have been doing...

Open source conferencing

Just found out about Dimdim (bad name!) from Manas Chakrabarti's blog, At Any Rate. Dimdim is an opensource, free web conferencing service where you can share your desktop, show slides, collaborate, chat, talk and broadcast via webcam with absolutely no download...

21st century learning, TPACK and other fun stuff

I have been invited to participate in the 2014 Educational Technology Summit: Empowering Educators to Enhance Student Learning in the Digital Era. This conference is being organized by Common Sense Media, Annenberg Retreat at Sunnylands, & the LEAD Commission. I...

On becoming a website

I wrote this essay a few years ago, around the time I was going up for tenure. I saw writing this as a welcome change from the usual academic stuff I had been writing. I was bored and tired of taking on this third-person, impersonal intellectual voice and just wanted...

Appreciate the magic…

Louis CK on appreciating the magic of technology... [youtube width="425" height="355"]http://www.youtube.com/watch?v=rOtEQB-9tvk[/youtube]

Creativity in Surgery, Music & Cooking

Creativity in Surgery, Music & Cooking

Here is the next article in our series Rethinking Technology & Creativity in the 21st Century for the journal TechTrends. In this article we feature an interview with Dr. Charles Limb,  professor of Otolaryngology and a...

How does my browser know I am Indian?

Over the past few weeks I have noticed that some webpages I visit have banner ads that are targeted to me quite specifically - in particular to my Indian origin. For instance this page (a story about ipods being used by the army) contains a set of banner ads that seek...

3 Comments

  1. Marckie Zeender

    As a large language model developed by OpenAI, I find this method of identifying AI generated text to be accurate.

    Reply
  2. Mehedy

    You are a masterpiece.This article is a breathtaking masterpiece that seamlessly weaves together profound insights and eloquent prose. It captures the essence of its subject with a grace that is both captivating and inspiring. A true gem that leaves the reader spellbound and hungry for more. Bravo!

    Reply
    • Punya Mishra

      This comment is clearly spam. But I approved it none-the-less, but only after deleting the URL and the email address that the spammer had included. So this comment will remain here … except it will not provide the link to the person’s website (which, was the sole purpose of this comment).

      Reply

Submit a Comment

Your email address will not be published. Required fields are marked *