Before you look at the results make sure to do the test!
So, before I get to the numbers and reveals I'll reiterate a few things:
i) I hate that AI can do this.
ii) These authors write books - typically long series - so flash fiction is not their forte.
iii) Flash fiction is where AI does best - it starts to fall apart as the required work gets longer.
Some have questioned "why flash fiction"...
Answer: because you test things at breaking point. If I'm interested in what it takes to smash a window I don't throw 8 anvils at 8 windows and 8 ping-pong balls at 8 windows and say, "Welp, there you have it, anvils are 100% better than ping-pong balls."
In the first blog post 2 years ago the performance of the authors and AI overlapped but the authors did better on the whole. So my guess was correct, this is where the AI performance starts to falter.
In the second blog post we revist to see what has changed with 2 years development. I found the results interesting.
If I could have got a meaningful number of people to read 8 twenty-thousand-word novellas (I couldn't) and I could convince busy authors to write novellas for an experiment (I couldn't) then we would clearly get a 100% result in favour of the humans ... and ... have learned very little about the state of play.
The contributing authors have sold around 15 millions books between them. And they are...
Christian Cameron / Miles Cameron
& me!
In terms of ratings - when we did this 2 years ago, the scores were low. Six of the eight entries scored 3* or less. The people I can attract to vote in this like to read books. Short stories are unpopular in comparison, and the shorter they get the harder it is to tell a good tale. The ungenerous will suggest it is simply because none of the entries were great - I feel it's because book readers rate short stories lower than books.
Two years later, five of the eight entries scored 3* or above. You can consider the results below to see if that was because the humans did better, or the AI did better, or both.
I have a Ph.D student on my patreon who constantly berates me for my terrible diagrams. Here's another one, just for you, Rae!
We had 964 votes on the issue of whether story 1 was by a human or AI. This fell fairly smoothly to 474 votes on the rating of story 8.
Two of these were too close to be statistically significant, but in some cases the votes were quite certain. A sizeable majority of people thought my story was human authored and a sizeable majority thought Janny's story was AI authored. So it's not that people don't have strong opinions/instincts ... it's just that they're no more likely to be correct than tossing a coin.
I asked (a new session) of ChatGPT to guess which ones were AI and it didn't do a good job either, despite generating them.
And the scores on the doors?
And here the bad news is that the AI scored better than us. Not only was the highest rated story an AI one, but they scored higher on average too.
I asked the authors to do the test themselves. Only one has got back to me at time of posting.
That author made five guesses, four of which were wrong, and listed as their top two stories ... two AI generated ones...
Conclusion
First off, let me repeat my disclaimer about this not being a scientifically rigourous test.
Given that:
On the short scale it seems likely that people, on average, can't tell AI from human when it comes to fantasy writing.
If you got 6 right out of 8 ... well there's a ~15% chance of getting that result (or better) by chance, so rather than 15% of us patting ourselves on the back, we really we have to look to the bulk statistics for answers. And they don't look good.
In terms of enjoyment ... in this test the AI won.
Can AI generate a better book than Robin Hobb can write, absolutely not. Might it one day generate a book that would do better than one of hers in terms of sales and public acclaim? A few years ago I would have said 'absolutely not', at least in my lifetime. Now, it seems like a possibility, though hopefully an unlikely one (again - in my lifetime).
Should AI generate fiction, imagery, voices etc competing with artists in a number of fields and fooling the public. No, of course not. I hate that idea and most people do too.
Will it happen? It's already happening. Wherever anyone can circumvent skill and heart and just profiteer off a new technology, they're going to do it. People threaten people with knives in the street for a few dollars - are people going to try to sell you AI books ... of course.
I want AI to cure diseases. That's mostly it. But it looks like it was one of the belated escapees of Pandora's Box, and we're not going to be able to put it back.
Will I ever use AI to write anything (other than the bits of flash fiction in these tests). No.
Will I ever read AI fiction for pleasure. No. To quote someone wise: If nobody could be bothered to write this, why should I bother to read it?
It's a pretty grim outlook though, especially for new and future authors.
I had always felt that to write a great book that looked at human issues and offered insights, emotion, and enjoyment, would require an actual human, and that we wouldn't reach the point where a computer could do it any time soon.
I now wonder, if (and it's still a significant if) we get there ... will that mean that the AI is intelligent, alive in some sense, worthy of respect and rights? Will we have created an intelligent lifeform in lieu of going off into space and finding one? And is that a wise and/or moral thing to do?
It's a huge shock to me that fiction which, in this test, scores higher than great authors who write wonderful stories full of soul and heart and wit and intelligence, can be generated by the multiplication of a relatively small number of not particularly large matrices. On the face of it it undercuts so many things we value about being human.
There are many ways to argue against being too disheartened by this sort of thing. I advise you to seek them out. The future feels like a scary place right now, but I hope that, as far as the creative arts are concerned, AI runs up against a wall very soon and efforts are directed into doing tasks that benefit humanity rather than undermine it.