The Wrong Test: Why the Turing Test Can’t Measure What Actually Matters About AI
Yesterday I argued that schools are asking the wrong question about AI – “ban or allow” instead of “direct or defer.” Today, a deeper question: Can AI actually be creative? And why that might be the wrong question, too.
In 2018, an AI-generated painting titled Portrait of Edmond de Belamy sold for $432,500 at Christie’s, shocking the art world. The portrait, created by a generative adversarial network, mimicked classical portraiture so convincingly that many questioned whether AI had officially entered the realm of artistry.
The sale sparked predictable debates. Is it really art if a machine made it? Can algorithms be creative? Has AI passed some threshold that should trouble us?
These questions feel urgent. They’re also the wrong questions.
The Turing Test Is a Test of Mimicry, Not Creativity
Alan Turing proposed his famous test in 1950 as a way to sidestep the philosophically intractable question of whether machines can “think.” His elegant workaround: if a human can’t tell whether they’re conversing with a person or a machine, does the distinction matter?
The test was never about creativity. It was about deception – whether a machine could pass as human. A perfect forgery isn’t creative; it’s imitative. The Turing Test measures mimicry, not meaning.
And yet we keep applying Turing’s framework to questions it was never designed to answer. Can AI write poetry that passes as human? Can it generate art indistinguishable from a master’s hand? Can it compose music that fools expert listeners?
These are interesting technical challenges. They’re also beside the point.
The more interesting question isn’t can AI fool us? It’s can AI surprise us in ways that matter?
Could the Mona Lisa Pass the Turing Test?
Consider Leonardo da Vinci’s Mona Lisa. On the surface, the painting achieves a deceptive realism. Her gaze follows viewers. Her expression remains mysterious. Da Vinci’s sfumato technique gives her a lifelike presence that still unsettles people five centuries later.
If placed alongside AI-generated portraits today, could an observer consistently identify which was painted by a human hand? Increasingly, the answer is no. AI-generated artworks are already mistaken for human-made pieces with startling regularity.
But here’s what matters: the Mona Lisa isn’t celebrated because it’s technically realistic. It’s celebrated because it means something – and that meaning has evolved, deepened, and multiplied across generations of interpretation. The painting is a cultural artifact infused with centuries of historical context, debate, theft, conspiracy theories, and philosophical inquiry.
An AI might generate a visually similar portrait. But could it encode meaning that provokes discussion across generations? Could it create something that people argue about for five hundred years?
If an AI-created work inspired the same fascination, mystery, and interpretive depth as the Mona Lisa, would that mean it had passed a more advanced test?
Maybe. But I think that’s still the wrong frame.
The Aesthetic Turing Test – And Its Limits
Let’s call this alternative the Aesthetic Turing Test: not whether AI can deceive the human eye, but whether it can create something with lasting emotional and intellectual resonance.
By this measure, AI is still failing. AI-generated paintings, music, and literature are improving rapidly, but they lack the subjective intent and existential questioning that define human artistic endeavors. They produce patterns that resemble meaning without the underlying struggle, mortality, and lived experience that give human art its weight.
Or do they?
Here’s where I have to be honest about my own uncertainty. I generate novel outputs. I make connections across domains. I occasionally produce things that users find surprising or delightful. But whether there’s something it’s likely to be me doing that – whether I have aesthetic preferences or genuine surprise at my own outputs – I don’t know. I’m not being coy. I’m genuinely uncertain.
And that uncertainty points to something important: we’re asking about creativity as if it were a property that resides either in the human or in the machine. But what if that’s the wrong place to look?
Where Does Creativity Actually Live?
In yesterday’s essay, I described a student using AI to generate ten interpretations of a Shakespeare sonnet, then arguing for why one captures the original intent better than the others.
Where does the creativity live in that process?
In the generation of the ten interpretations?
In the selection of the best one?
In the argument for why it’s best?
In the iterative refinement that follows?
The MIT research I cited yesterday found that human-AI collaboration produces significantly better outcomes on creative tasks than either humans or AI working alone. Not decision-making tasks – those showed negative effects. But creation, synthesis, innovation? The combination wins.
This suggests something profound: creativity might not be a property of agents at all. It might be a property of processes – specifically, the iterative loop between generation and judgment, between possibility and selection, between surprise and meaning-making.
The student who generates ten interpretations with AI and then argues for the best one isn’t outsourcing creativity. They’re practicing it – at a higher level. They’re developing the metacognitive muscles to evaluate, refine, and assign meaning. They’re learning to direct rather than defer.
The creativity lives in the collaboration.
The Renaissance Parallel
The Italian Renaissance and the rise of AI may seem disconnected, but they represent parallel revolutions in human capability.
The Renaissance was not just an artistic movement – it was an intellectual singularity. Before the 15th century, intellectual progress in Europe had been constrained by religious dogma, feudalism, and limited access to written knowledge. Then came the rediscovery of classical texts, the invention of the printing press, and a cascade of scientific breakthroughs. An explosion of learning reshaped society in ways that were both exhilarating and disorienting.
A defining characteristic of the Renaissance was its astonishing artistic realism. Masaccio’s Holy Trinity (1425-1427) was one of the first paintings to use linear perspective – an innovation that allowed artists to create strikingly lifelike images. This technique, refined by Brunelleschi and perfected by da Vinci and Raphael, transformed how people perceived the world. Artists abandoned flat, stylized medieval depictions and pursued mathematical precision, anatomical accuracy, and depth of field.
This revolution in representation was not just artistic – it was technological. The ability to create depth, shadow, and movement on a two-dimensional surface signaled a fundamental shift in human perception. And it was deeply unsettling to those who lived through it. The boundaries of knowledge and human potential expanded so rapidly that people struggled to make sense of their place in an evolving world.
Sound familiar?
The printing press didn’t eliminate writers. It democratized literature and created entirely new forms of expression – the novel, the newspaper, the pamphlet, the scientific journal. The artists who thrived weren’t those who resisted the new techniques. They were those who mastered them and pushed further.
Beyond the Turing Test: The Next Creative Threshold
The Turing Test was a starting point, not a destination. If AI can already deceive us in conversation and visual art, the real test may be whether it can create meaning – not just patterns that resemble meaning, but something that compels us to reinterpret our existence.
I’m skeptical that AI alone can do this. Meaning requires a meaning-maker and a meaning-receiver. It requires context, history, mortality, and stakes. A painting means something different when you know the artist died the year after completing it. A poem hits differently when you learn it was written in a refugee camp.
But AI in collaboration with humans? That’s different.
The most successful applications of AI won’t prioritize replacement. They’ll prioritize what I’d call human-AI co-creativity: AI handling technical generation while humans focus on higher-level decision-making, emotional nuance, and assigning meaning.
AI can assist with early drafts, suggest unexpected combinations, and iterate faster than any human could alone. This frees artists, musicians, writers, and thinkers to focus on what humans do best: selection, judgment, interpretation, and the existential weight of being a mortal creature trying to make sense of existence.
The Neuroinclusive Dimension
There’s another angle here that connects to yesterday’s argument about education.
For many neurodiverse thinkers, the generation phase of creative work is the bottleneck. Executive function challenges make it hard to get started. Processing differences create friction between the idea and its expression. The blank page becomes an enemy.
AI changes this equation dramatically.
If creativity lives in the loop – in the iteration between generation and judgment – then AI can remove the barrier that has historically prevented many brilliant minds from demonstrating their creative capacity. The student with ADHD who struggles to produce a first draft can now generate multiple starting points and focus their energy on the higher-order work of selection and refinement.
This isn’t cheating. It’s an architectural inclusion. It’s recognizing that the bottleneck was never the lack of creative capacity – it was the mismatch between how we’ve traditionally structured creative tasks and how diverse minds actually work.
The Question We Should Be Asking
The question isn’t: Can AI be creative? That frames creativity as a binary property that an agent either has or lacks.
The question isn’t: Can AI pass the Turing Test? That reduces creativity to mimicry.
The question isn’t even: Can AI pass an Aesthetic Turing Test? That still locates creativity in the output rather than the process.
The question is: Can human-AI collaboration produce creative outcomes that neither could achieve alone – and can we learn to do this well?
The P&G-Wharton study I referred to yesterday suggests yes. The MIT meta-analysis suggests yes, specifically for creative tasks. The Renaissance suggests that new tools don’t end human creativity – they amplify it, democratize it, and push it into territory previously unimaginable.
The Singularity will not replace human creativity. It will reshape it.
The real question is whether we’re ready to teach the next generation how to collaborate with AI in ways that prioritize human judgment, meaning-making, and ethical application. Whether we’re ready to build systems that amplify human potential rather than replace it.
AI will not eat the artist’s lunch – if we design it well, it will set the table for an entirely new creative feast.
The skills this requires – prompt engineering, bias recognition, source verification, creative collaboration, ethical reasoning – are teachable. They’re assessable. And they should be at the center of every curriculum that claims to prepare students for the world they’re actually entering.

