Tech

This AI Gives Emojis Real Faces and the Results Are Horrifying

If you train a machine learning tool to find human features in emojis, the results are not pretty.
Collage of screenshots from “I Forced a Bot.”​
Image: Collage of screenshots from “I Forced a Bot.”

Withered eyes on child,
Red lips as pepperoni,
Ghost with gaping jaw.

This haiku was inspired by Jonathan Fly's horrifying post on his blog “I Forced a Bot.” Fly took a machine learning model—designed to reconstruct human faces using 16 x 16 pixel images of human faces—and used it to reconstruct human faces using 16 x 16 emojis, Twitch emotes, and video game sprites. I don’t like how it looks!

Image: Screenshot from “I Forced a Bot.”

Image: Screenshot from “I Forced a Bot.”

Human-inspired emojis, like the woman-raising-hand emoji or the sweating-while-smiling emoji, look disturbingly photo-realistic. But arguably the worse results come from object emojis, like the pizza or fountain emojis, that spurt eyes, hair, and lips when they shouldn’t.

Advertisement

It has the same effect of looking at that “realistic SpongeBob” art. Imposing realistic features on cutesy images will always be a bit jarring. Fly told Motherboard via Twitter DM that he made these images mainly because he was interested in what would happen.

“It's not so much inspiration as it is curiosity,” Fly said. “I guess it's just my default mode to try things the wrong way or that shouldn't work.”

Screenshot from “I Forced a Bot.”

Image: Screenshot from “I Forced a Bot.”

“My favorites are the fountain because it worked so unexpectedly well, and the pizza with human lips, because it's horrifying,” Fly said.

According to his blog, Fly trained his images based on an arxiv.org paper, which was published August 22 by researchers Korea Advanced Institute of Science and Technology. The paper, called "Progressive Face Super-Resolution via Attention to Facial Landmark," uses a method called face super-resolution, which makes artificial human faces using an input of real human faces. This isn’t a new technique, but this paper uses 16×16 pixel images as the input. Conveniently, Fly said, that’s just about the proportional size of many emojis and video game sprites. (Fly had to adjust the Twitch emote image sizes to apply the model to those images, and also scale down the resolution of the emojis he used.)

“These samples are cherry picked and many outputs were not very interesting,” Fly notes in his blog. “I also went to some lengths to encourage the model to make aggressive guesses about facial features.”

Fly posted the code used he used to make these emojis on Github. So if you have the patience, knowledge, and resources to replicate the machine learning tool used here, I guess you can try to attempt to duplicate these results! Do as you please, godspeed.

Screenshot from “I Forced a Bot.”

Image: Screenshot from “I Forced a Bot.”