Five Ways
Subscribe to my newsletter and get a free story!
Share this:

ai art

It's All About the Algorithms: My Take on AI Art

Four images generated on Canva's AI art tool, using the phrase "a pop art style cat rambo sitting at a desk writing"A day that I’ve been saying would arrive for about twenty years now is starting to loom on the timeline, and it’s taking a lot of smart people by surprise when it shouldn’t have.

I’m talking about AI (artificial intelligence) creations – art appearing in visual, auditory, and textual forms. Such creations are in the news lately because we’ve hit a point where what they’re creating is pretty sophisticated. Not sophisticated enough though (yet) – Clarkesworld Magazine just stopped taking submissions because of a sudden upsurge in AI-generated stories, none of them actually publishable. But the quality of that prose will improve and already people are talking about how to create systems to distinguish between a submission written by a human writer versus a machine-generated one.

Speaking as a former Microsoft employee and long-time technologist, I’m utterly unsurprised. In 2005, I wrote “Zeppelin Follies,” a story about a future “writer.” (You can find the story in my collection Near + Far if you want to read it in its entirety.) Here’s a section:

I forced a smile and patted Fitz’s shoulder. “Be ye of good cheer,” I said. “I think I’ve got that dialogue problem I was having licked.”

Fitz, as I well knew, hated getting drawn into the technicalities, so when I started to explain how reducing the adverbial modifier minimum downwards had tautened the syntactical delivery, he backed out pretty fast. I spent a few hours testing it out, and was pleased with the results. 90% of writing is putting together the formulas, so once I had this one, and a slight problem with the scenery equivalence parameters solved, I’d be sitting pretty, ready to generate a manuscript to hand over to Mikka the editor. Around three, I took a break and went out to sit in the Plaza.

In “Zeppelin Follies,” the writers don’t write. Instead they create the algorithms used to generate their fiction. Will there actually be a point where AIs can generate prose sufficiently adept to construct something that’s an entertaining read? Absolutely, and I would suspect that point is much closer than current writers would like to admit.

But I think the question that most people are deluding themselves about is this: will AI art reach the point where it touches the human soul, the way a Georgia O’Keefe painting can make you stand and stare or the way an Ursula K. Le Guin can make you stop and think, and perhaps even copy it into your notebook to ponder over later? I believe it will, because the consuming human soul remains a constant in that equation, and it doesn’t require another, second soul to be involved in creating the thing we’re appreciating: we can pause for a sunset, for a scrap of birdsong, or to admire the Fibonacci curve inside a conch shell. The experience of the aesthetic depends on the viewer perhaps more than the origin of the viewed.

We would like to think that there is something inside ourselves that recognizes “authenticity,” a word that is a little nebulous. What makes the words coming out of a biological entity’s mouth “authentic” in a way something created mechanically is not? Is it the intent behind the creation? Or something else? We would like to believe that we are more than biological machines, whose actions are on some level as predictable as those of the mechanical ones. We move in a cloud of delusion, in fact, thinking ourselves unique in this universe.

As far as the consumption of what is produced by machines versus what is produced by human hands goes, there are things we buy to use, and there are things we buy to enjoy. We usually don’t worry about the “authenticity” of the dishes we eat out of, but at a certain economic level, we may worry about it as a status symbol, a way to display affluence by using handmade rather than mass manufactured goods. And I don’t know that most people worry too much about the authenticity of what they enjoy, unless they are a connoisseur of it.

I used romance writing as my example in “Zeppelin Follies,” because romances are notoriously formulaic. But the truth is that every genre has its tropes, and that’s something that an AI can use.

Some artists have stopped putting work up online in order to keep it from being fed to artificial intelligences to use. I don’t know that will work all that well, but it’s worth thinking about. But art is also meant to be seen, music to be listened to, text to be read, and we cannot make it so humans are the only ones seeing, listening, and reading.

I think that one way writers will be able to survive a while is by holding onto the overarching ideas of their properties, and the things that make them distinctive and enjoyable. This is one reason why I plan to keep writing books about bioship You Sexy Thing and its crew, because I hold the rights to its world and character. But will AIs create new properties, new worlds? Beyond question, although they will be made of the fragments of other properties, recombined and reworked. Which is, I would argue, on some level what literature is about, replying to the stories that have come before.

Which makes me ask – will an AI be able to look, for example, at Chaucer’s Canterbury Tales and come up with something that is not a reworking, but an original thing that speaks to the Tales? That I’m not sure of. But I’m definitely looking forward to seeing what happens when one tries.

...

Creative AI vs. Creative Humanity: Guest Post by Laurence Raphael Brothers

AI is coming for your jobs, creatives! Or… it will be, eventually. Not this year. But coming soon.

Introduction
The last two years have seen surprising and indeed almost shocking advances in AI creative work. Image-generators like Dall-E, Midjourney, and Stable Diffusion that work from text prompts using diffusion models are capable of some amazingly high quality work at times, though admittedly their understanding(1) of fingers still leaves something to be desired. The even more astonishing ChatGPT can sometimes not just hold conversations that pass the Turing Test, but can also create working computer programs, compose poetry, and also write coherent short stories, apparently from scratch.

Some may reasonably argue the work of such AI systems isn’t creative at all, but is a mere regurgitation or combination of human efforts compiled from its training sets. But on the other hand, much of human art is based on pastiche and emulation, not to mention plagiarism, and both writers and artists have to spend years of training learning the techniques passed on by their instructors.

The State of the Art

Caption: Protagonist Bunny Häschen from my novel in progress. Generated by Stable Diffusion 2.1 using the text prompt “intersex bunny detective”. A generally rather appealing caricature, but note the odd left hand, the characteristic nonsense characters in the sign, the content oddity of them holding a sign at all, and the mysterious flap rising out of their cravat.
Source: Stable Diffusion

Recent developments are truly astonishing given previous struggles to get AI systems to create art, prose, and other works. It’s now possible to type a few words and get some art back in a minute that might have taken a human hours or even days to produce from scratch. The quality of the AI-generated art is quite variable; from one request to another you might get back commercially usable material or you might get worthless trash. But even considering that you may have to make quite a number of requests of the system to obtain a usable image, the results are still much faster and cheaper than using a human artist for the equivalent level of work.

It’s arguable that if created by a human, these AI-generated images would sometimes be direct plagiarism. Even when not clearly a direct copy of an existing work, it’s often the case that the AI system builders failed to obtain artist permission to incorporate their work into the system’s training data. The legal consequences of this failure will work themselves out in the courts, but for now there’s nothing stopping people from making use of these tools.

At present, complete high-quality AI art is generally unobtainable no matter how much time is taken playing with prompts, but on the one hand, commercial-grade art is often quite acceptable, and on the other hand, by using the AI’s art as a basis, a skilled digital artist can create a high-quality composite of human and AI art much more quickly than working from scratch. This, for example, is what Tor did with Christopher Paolini’s recent book cover.

In the world of prose, with suitable guidance ChatGPT can generate a coherent and consistent story, but not one that’s very high quality. Without substantial revision, the current quality of such text is typically mediocre, but at present it’s not inconceivable for some exceptional AI-generated story to make it past a slush reader’s quality threshold and at least be held for review. Quite the achievement considering the high level of competition many magazines impose on contributors. This is already motivating many AI-generated submissions, though it’s unclear if they are meant for prestige and profit for their pseudonymous submitters or to count coup. Clarkesworld, for example, has reported a recent spike in AI submissions despite their guidelines to the contrary.

The Future of the Art

Because these recent developments in AI art generation have seemed to come almost from nowhere, it’s hard to say how rapidly they will improve to match or even exceed human capabilities. We should keep in mind that autonomous cars were expected to be widely available by now on the basis of work done in the 2010s, and yet after a promising beginning, only slow progress has been made and such vehicles are still unsafe on real-world roads.

On the other hand, given the eagerness of many extremely well-funded tech groups to work in this area, including not just OpenAI but Microsoft and Google among others, it’s reasonable to expect further progress up to a point. What’s the limit? For current system architectures, I expect that limit is based on these systems’ lack of explicit real-world knowledge. Just as ChatGPT is capable of absolutely authoritative but totally incorrect and easily refutable statements, these systems will be hamstrung by the inability to reason until such capabilities are combined with their generative models.

Let’s establish three bars:

  1. Acceptable low-grade commercial art. The kind of graphic art that creative freelancers currently make to order for ad campaigns and other commercial applications without much funding, but which will still pay their rent. Equivalent prose would be acceptable for publication in some token and semipro magazines and for routine copywriting assignments.
  2. Acceptable high-grade commercial art. Commercial art produced for highly funded campaigns by major agencies and design firms, or prose and content that can be published in the better magazines and journals.
  3. Superior-to-human fine art. Masterpieces that would displace human efforts from galleries and museums and bookstore racks and win awards if AI and humans competed on a level playing field.

We’re currently just entering stage 1 for certain applications. This means that soon some semipro freelancers may experience serious competition from AI systems, and after a while some agencies and design firms may reduce their staff because their routine low-end work can be done by machine. There will still be plenty of human work involved in revising, incorporating, or compositing AI work into larger and higher quality artistic achievements, but the amount of human touch in the overall process will diminish.

At stage 2, entire industries will be turned upside down and the effects will shake up whole economies. For example, design firms may no longer require creatives, or else the few creatives they retain will largely be employed managing requests to AI systems. There will be no more need for human touch in photoshopping or otherwise compositing AI art with human-created elements as in stage 1; the AI will do all the work on demand. (Note that while some very high quality AI art has been created already, it typically requires considerable manual effort from a human to achieve such a level).

The good news is that my average-case guess for stage 2 is at least ten years, and it may well be generations before this level is achieved. The bad news is that I could be wrong and it could be next year. I was shocked by ChatGPT’s capabilities in 2022 after years of crappy chat bots that wouldn’t fool Turing for a minute, and so I may easily be wrong again. Still, I do think that without explicit real-world knowledge and reasoning abilities these tools will be unable to really excel for quite some time. At present no one has any idea how to give a generative AI system that kind of understanding.
At stage 3, humans start wondering why they should even bother creating art, especially if such superior work can be routinely requested of AI systems by anyone without incurring much cost.

Similar considerations apply for stage 3, which to my mind requires Artificial General Intelligence to achieve. Such a system is not absolutely out of the question. We just don’t know how to build one today, nor do we even have a good idea on a direction to follow to achieve such a goal.

Still, even stage 1 is problematic enough for human creatives. I’d guess full stage 1 will be achieved within five years, and whatever the courts decide about existing systems and their theft of artist work without permission, one way or another AI-generated art will become ubiquitous for inexpensive applications.

(1) Indeed, the reason that such systems don’t know how to draw fingers properly is they don’t even understand the concept of fingers. These systems don’t have old-fashioned knowledge bases that contain explicit facts or relations. The appearance of a hand is emergent from mysterious mathematical features derived automatically from the digitized training set. Discrete counts of things that characteristically appear in fixed numbers in nature but are often hidden from view in individual images can be problematic for such systems. Human images very frequently show two clearly identifiable arms and legs and so AI images get these right more often than not (not always however!) but hands in photographs less frequently show all five fingers clearly, and so generated images often don’t do so either.


BIO: Laurence Raphael Brothers is a writer and a technologist with five patents and a background in AI and Internet R&D. He has published over 50 short stories in such magazines as Nature, PodCastle, and Galaxy’s Edge. His noir urban fantasy novellas The Demons of Wall Street, The Demons of the Square Mile, and The Demons of Chiyoda are available from Mirror World Publishing, while his new standalone novel The World’s Shattered Shell has just been published by Water Dragon.
Pronouns: he/him.
Twitter: @lbrothers.
Mastodon: @laurence@petrous.vislae.town.
Website: https://laurencebrothers.com.

...

Guest Post: Liz Danforth on AI Art

Note: This originally appeared on Liz’s Patreon, which you can find here, and which I highly recommend.

Scrying the Future

The world will continue to argue the benefits, ethics, problems, and controversies about AI art and writing long past my lifetime, even if I live to be 100 years old. So I write this as a scene in media res, one drop in an ocean of ink and pixels already washing over us.

Please take it as a given that my opinions and understandings written here will also change over time, as I keep learning. And fair warning: this is a longform essay with no easy TLDR. I hope you’ll stick with me.

(more…)

...

Get Fiction in Your Mailbox Each Month

Want access to a lively community of writers and readers, free writing classes, co-working sessions, special speakers, weekly writing games, random pictures and MORE for as little as $2? Check out Cat’s Patreon campaign.

Want to get some new fiction? Support my Patreon campaign.
Want to get some new fiction? Support my Patreon campaign.

 

"(On the writing F&SF workshop) Wanted to crow and say thanks: the first story I wrote after taking your class was my very first sale. Coincidence? nah….thanks so much."

~K. Richardson
Skip to content