PLAY PODCASTS
Pixels, Prompts & Pseudo-Text: AI's Word Problem
Season 1 · Episode 46

Pixels, Prompts & Pseudo-Text: AI's Word Problem

AI paints stunning images, but can't spell "cat." Why do advanced models struggle with simple text? Dive into AI's weird word problem!

My Weird Prompts · Daniel Rosehill

December 10, 202523m 53s

Audio is streamed directly from the publisher (dts.podtrac.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

Why can advanced AI models generate breathtaking photorealistic landscapes and fantastical creatures with astonishing detail, yet consistently stumble over spelling a simple word like 'cat' on a t-shirt? This week on My Weird Prompts, co-hosts Corn and Herman dive into producer Daniel Rosehill's intriguing prompt: the pervasive and often comical challenge of 'pseudo-text' in AI image generation. They unpack the fundamental distinction between how AI processes visual information at a pixel level versus its understanding of symbolic language, revealing why generating coherent text within images is a far more complex multi-modal problem than it appears. Explore the cutting-edge "pipelined" solutions that integrate language models to improve accuracy, and