Using AI to digitally clone myself (AKA creating a Puny-Punya)

by | Sunday, March 19, 2023

Note: The photo-manipulated image of me holding my own head was created almost 20 years ago by Paul Kurf, a student in my learning by design, class! Image design & layout, Punya


Ethan Mollick is a professor at Wharton and he has been doing some of the most interesting work in playing with ChatGPT3 and other generative language models, particularly exploring their role in teaching. If you haven’t been following him, and are even peripherally interested in these issues, do check out his substack: One useful thing.

One of his experiments involved creating video of himself that was entirely AI generated (see A quick and easy guide to cloning yourself).

Of course I had to try this for myself to create my own mini-Me (or should it be Puny Punya).

I decided to create a short video of myself introducing an episode of Just an Hour—a series of informal conversations that Betty Gee and I co-host every Friday for faculty and doctoral students. This felt appropriate for two reasons. First, that particular session was devoted to generative AI in higher education, and it seemed right to provide an example of what this could look like. Second, I knew was going to miss that particular episode, since I was flying back from the SITE conference, so it made sense to share a short video introducing the session and sending my regrets for missing the meeting.

What that in mind, I asked ChatGPT3 to write a short script on my behalf, in first person. In parallel I had trained Eleven Labs to generate my voice by reading it 2 – 3 minutes or prose from my website. Eleven Labs took that recording and created a virtual voice for me, which could then read out any prose it was given, in that voice. I gave it the prose created by ChatGPT3 and it read it back to me in my “digitally generated” voice. Finally, I uploaded a photo of myself and the MP3 to the D-id website. (I had previously used this website to create the video for my learning styles blog post),

Within minutes I had a video I could download.

So just to be clear, the video was created by text, audio and video all generated by AI (with minimal input from my side.

So does this video look and sound like me? I would ask you to be the judge of that but before we get to the videos, a few things to note.

  1. This entire process took me around 30 minutes (and that includes time it took to creating the accounts, training Eleven Labs on my voice, taking a picture, uploading it, and so on.
  2. Also, all this was done by spending just one dollar! Eleven Labs needed me to sign up and pay for the service but since they were having a sale I got to use it for that amazing price!
  3. The final product is sort of weird, and does not look extremely realistic. But just the fact that it even exists is amazing. I am sure if I had spent more time and money I could have had a much better product.
  4. I also experimented with creating a low-res version of the video and strangely enough it actually seems more realistic and believable than the hi-res one. It is almost as if the graininess of the video makes us ignore the other glitches.
  5. Last but surely not the least, Eleven Labs totally messed with my voice and accent. if you have heard me speak, or recognize my voice, you will immediately realize that my voice has been changed quite dramatically, removing almost all traces of my my Indian accent. Again, given my previous experience with these technologies, I am not surprised at all at this!

With that, here are the two videos, first the hi-res version followed by the low-res version. What do you think?

The Hi-Res version of the video.

The Low-Res version of the video.

The key question of course that we should all be asking ourselves: What does it mean when the cost of creating content like this drops to zero.


Addendum

For the record, below is the image that Paul Kurf created for me almost 20 years ago!

Despite my complaints about how Eleven Labs messed up my accent, it clearly does a better job with other voices, as evidenced by this story I read today about how an AI generated voice allowed someone to break into a bank account.

A few randomly selected blog posts…

Using eclipses to see

Let me start with two questions: First, what is the shape of the Earth? And two, what shapes does the sun cast on the ground when filtered through the leaves of a tree? Of course we know the answer to the first question. The pictures from space show clearly this...

Tools “R” Us: When objects become you

Tools “R” Us: When objects become you

Danah Henriksen shared an article with me recently “When objects become extensions of you.” It is an interesting piece arguing that “Whether they are tools, toys, or mirror reflections, external objects temporarily become part of who we are all the time.” Essentially,...

Cosmetic changes

I have made some cosmetic changes to the way the blog looks. The sidebars are now light blue, to differentiate them from the middle (content heavy) column. Once I did this I realized that I did not need that boxy border around the middle column, and pouf, it was gone....

The Innocent

I first read Ian McEwan many years ago (in the 80's I think) when he wrote grim and macabre novels and short stories, full of strange dark humor. I found him somewhat interesting but not enough to seek out his books. And then, years later, this past fall I read...

No excuses! Veja du (or don’t you)

Excusado by Edward Weston I have written earlier about the idea of veja du (which ended up becoming an assignment in my creativity class). To recap: ... if déjà vu is the process by which something strange becomes, abruptly and surprisingly familiar, véjà du is the...

Generative AI in Education: Keynote at UofM-Flint

Generative AI in Education: Keynote at UofM-Flint

A couple of weeks ago I was invited to give a keynote at the Frances Willson Thompson Critical Issues Conference on Generative AI in Education. It was great to go back to Michigan even if for a super short trip. One of the pleasures of the visit was catching up with...

Shulman on learning

Shulman on learning

One of my favorite quotes about learning. From this article, Taking Learning Seriously the entirety of which is worth reading. But for now here is the quote, and a visual (just because): Learning is least useful when it is private and hidden; it is most powerful when...

TPACK newsletter #4, Aug – Sept 09


Welcome to the fourth edition of the TPACK Newsletter, now with 494 subscribers (representing a 36% increase during the last four months!), and appearing bimonthly between August and April. If you are not sure what TPACK is, please surf over to www.tpack.org...

Does the Internet mean that knowledge is obsolete?

I was recently interviewed by Wired magazine for a story about Sugata Mitra's (of Hole in the Wall fame) experiments with minimally invasive learning, or more recently what are called SOLE (Self Organized Learning Environment) classrooms / schools. I have been...

0 Comments

Submit a Comment

Your email address will not be published. Required fields are marked *