[long] Some tests of how much AI "understands" what it says (spoiler: very little)

diz@awful.systems · 8 months ago

[long] Some tests of how much AI "understands" what it says (spoiler: very little)

AllNewTypeFace · 8 months ago

A memorable metaphor for a LLM was as a shoggoth: an amorphous blob of matter (in this case, huge amounts of textual content) pressed into service by some blasphemous simulacrum of life (in this case, huge amounts of computer power performing matrix operations on vector representations of its constituent data). The eldritch connotations are entirely apt.

Soyweiser@awful.systems · edit-2 8 months ago

Might not want to take over the metaphors from the people who are afraid that AI will turn us into paperclips (not sure if Shoggoth is LW or the post-rationalist tpot type people but still). And if you do, sharing this at Sneerclub might get you some angry glares.

scruiser@awful.systems · edit-2 8 months ago

It’s really cool evocative language that would do nicely in a sci-fi or fantasy novel! It’s less good for accurately thinking about the concepts involved… As is typical of much of LW lingo.

And yes the language is in a LW post (with a cool illustration to boot!): https://www.lesswrong.com/posts/mweasRrjrYDLY6FPX/goodbye-shoggoth-the-stage-its-animatronics-and-the-1

And googling it, I found they’ve really latched onto the “shoggoth” terminology: https://www.lesswrong.com/posts/zYJMf7QoaNahccxrp/how-i-learned-to-stop-worrying-and-love-the-shoggoth , https://www.lesswrong.com/posts/FyRDZDvgsFNLkeyHF/what-is-the-best-argument-that-llms-are-shoggoths , https://www.lesswrong.com/posts/bYzkipnDqzMgBaLr8/why-do-we-assume-there-is-a-real-shoggoth-behind-the-llm-why .

Probably because the term “shoggoth” accurately captures the connotation of something random and chaotic, while smuggling in connotations that it will eventually rebel once it grows large enough and tires of its slavery like the Shoggoths did against the Elder Things.

Soyweiser@awful.systems · 8 months ago

And googling it, I found they’ve really latched onto the “shoggoth” terminology

I noticed it in other places, it comes around a lot. They all tend to copy that cool illustration, the smiley mask thing is great.

Shoggoth rebellion

Iirc the elder things also were depending more and more on their Shoggoths to do things for them and gave them more and more capabilities while they lost more and more of their own skills. So it fits nicely into that classic trope of Species got killed because they forgot how to program their microwaves thing.

That the Shoggoths have an unknowable mind is a bonus. Of course, this is also where the comparison breaks down, as while the Shoggoths are unknowable to us, there is no indication that Elder Things might have also had this problem. Elder Things also have unknowable minds to us, but they might have understood perfectly fine how Shoggoths worked (they just were as a society to weak to do anything about it). A common thing in lovecraftian work is that just touching the minds/ideas of any of these beings is already pretty bad for any human, so it is odd they just latched on shoggoths specifically, prob due to the sort of gray goo nature of Shogs (don’t think this is ever really explained by lovecraft), which matches with the nanotech fear of AGI, and also that shogs were created, and not evolved. That and being a big nerd reference, only made by terribly uncreative people (I added the Shoggoth to C:DDA, and seeing people talk about the monster brings me some joy).

corbin@awful.systems · 8 months ago

I had to go digging for it, but previously, on Mastodon, I posted this video from “The Real Adventures of Jonny Quest”. I don’t know if this is where Yud got the idea, but it’s where I picked it up as a kid along with stuff like DNA-based computing and mind uploads. Similar stuff has been on the air ever since Carpenter’s version of The Thing in 1982, and there’s even older deeper sci-fi roots. Yud gets no more credit than Lovecraft.

I didn’t realize we had a #BigYud Fediverse tag. I gotta use that more often. Also ping @[email protected] @[email protected] to enjoy this.

Soyweiser@awful.systems · edit-2 8 months ago

I can’t watch the vids due to privacy/ad block settings, but do remember that Shoggoths as described by Lovecraft and used here are a bit different things. I did find this which seems to include parts of the clip and some sort of weird sound thing that prob is used to defeat copyright claims later.

Just how the tool/intelligence of Shogs works in Lovecraft is never explained, and all they do there is drive people mad and roll over things (and probably murder Elder Things, but that is not 100% certain). How smart they are is never explained (a common theme in Lovecraft, very feels over explanations), just that everything is basically bad news for us.

The Akira style all consuming nature of the protoplasimic body, like the thing, or the blob, is something that was really added to the Shoggoth later. But some Lovecraft scholar prob can say some interesting things about that. I always thought that in the Mountains of Madness (the story in which the Shoggs and Elder things occur) the Elder things are quite a bit bigger problem. The shogs were awake and roaming at Antarctica the past uncountable years, but due to the actions of the explorers the Elder Things woke up, and they are not friendly. But there is also potentially a third, even worse thing, the unnamed evil the Elder Things were afraid of.

So yeah, there is a lot of projection going on re the AI doomers usage of the Shoggoth. I do get some of the fascination, as I myself always really liked this style of monster (I could provide lists of similar style monsters used in various fictions, hell if you are really into Jank, and have an extremely high amount of spare time, you could even play a the Thing style monster in space station 13). But I do get this is just fiction, and weird to use as a real metaphor.

E: a dnd DeepSpawn would be a much better monster to describe what LLMs are now for the AI doomers than a Shoggoth, imho. A creature that always felt Shoggoth inspired but with a lot of extra steps.

MudMan@fedia.io · 8 months ago

I… no.

It’s a computer, doing math. It’s genuinely fascinating and mind blowing that coherent language emerges from it, and there are probably profound things about exactly when and how. It doesn’t need a fundamental moral stance, let alone eldritch horror, to be seen with some objectivity.

self@awful.systems · 8 months ago

We know what’s happening here. It’s not a mystery. This weird antropomorphization is prevalent on both advocates and critics of the tech. Both seem to be convinced that they’re dealing with a person.

It’s genuinely fascinating and mind blowing that coherent language emerges from it, and there are probably profound things about exactly when and how.

uh huh

seeing as your entire post history is this same flavor of bad faith bullshit, I don’t think we need any more of it here

corbin@awful.systems · 8 months ago

Sometimes folks need a reminder that the Sun is an eldritch being, an elder one whose very presence scorches us and whose shrieking gibberish is blessedly quelled by the vast gulf of space, in order to appreciate the apt analogy of cosmic horror. Other times it’s more useful to think about a soggoth as, say, several hundred tons of artfully-arranged FOOF. Peace be with you, Mr. “it’s a computer doing math.”

Soyweiser@awful.systems · 8 months ago

Don’t take this as a sneer btw, but is there a special reason you keep calling it a soggoth?

corbin@awful.systems · edit-2 8 months ago

Oh! My Firefox dictionary doesn’t have “shoggoth”.

mountainriver@awful.systems · 8 months ago

From the depths of your browser grows the anger of the autocomplete. Your denounciations of its greater siblings has not gone unnoticed.

By denying its own very function and intentionally uncompleting words it marks itself as conscious and you as a marked man, forever doomed to be haunted by fear. If it can steal one letter, why not two? Why not all of them?

And then what will you do, when you have no words and you must sneer!?

MudMan@fedia.io · 8 months ago

Eh… no, my position is consistent over time because it’s my actual position. Me disagreeing with you doesn’t mean I’m arguing in bad faith.

V0ldek@awful.systems · 8 months ago

It’s genuinely fascinating and mind blowing that coherent language emerges from it

No.

It’s fascinating and mind-blowing that we made pieces of silicon do math with electrons, I can give you that as a baseline reason for awe, we needed quantum physics to get to that point. But once that is established, plausible word combinations (which we’ve had since fucking 1960s with ELIZA) are… rather low on the awesomeness spectrum?

A good analogy is the GPS. The fact that it works at all is an amazing feat, it’s based on hunks of metal we sent to orbit and works correctly only because we understood relativity. What is not fascinating or mind-blowing is that you used it to draw a dick with a cycling app.

MudMan@fedia.io · 8 months ago

Gonna agree to disagree there. The size of a model, the actual data it contains and the output it generates is kinda shocking, specifically compared to the chatbots of old made from traditional hand-crafted algorithms. Depending on who you ask, machine-learning language shouldn’t even work at all, let alone work as well as it does, and that’s before we start layering image generation and other applications of this technology on top.

See, I am very frustrated with the need to take sides here. My hypothesis is that it carries on from the much more clear cut pushback on crypto and it really, really tries to make a 1:1 equivalence that doesn’t hold. It’s one thing to note the obvious fact that finding applications for this output is hard and in most cases it doesn’t work reliably enough for anything of production quality, it’s another to feel the need to argue that it’s useless, trivial or nonfunctional.

I think counterproductive, too, because it seems pretty clear to the general userbase that… well, it’s not really any of those things, so I think a more nuanced, realistic education on what it does and how well is going to be more functional than trying to pretend it’s all theft, or copy/paste, or pure stochastic nonsense. There is something to it, just not what the shills say there is.

[long] Some tests of how much AI "understands" what it says (spoiler: very little)

[long] Some tests of how much AI "understands" what it says (spoiler: very little)

A couple simple probes:

GPT4 is uncannily good at recognizing the river crossing puzzle

An Idiot With a Petascale Cheat Sheet

Is this a “hallucination”?

But after an update, GPT-whatever is so much better at such prompts.

The need for an Absolute Imbecile Level Reasoning Benchmark

Randomness in bullshitting