Model degeneration is an already well-known phenomenon. The article already explains well what's going on so I won't go into details, but note how this happens because the model does not understand what it is outputting - it's looking for patterns, not for the meaning conveyed by said patterns.
Frankly at this rate might as well go with a neuro-symbolic approach.
I know you are, but the argument that an LLM doesn't understand context is incorrect. It's not human level understanding, but it's been demonstrated that they do have a level of understanding.
And to be clear, I'm not talking about consciousness or sapience.
I know you are, but the argument that an LLM doesn’t understand context is incorrect
Emphasis mine. I am talking about the textual output. I am not talking about context.
It’s not human level understanding
Additionally, your obnoxiously insistent comparison between LLMs and human beings boils down to a red herring.
Not wasting my time further with you.
[For others who might be reading this: sorry for the blatantly rude tone but I got little to no patience towards people who distort what others say, like the one above.]
I got little to no patience towards people who distort what others say,
My original reply was meant to be tongue-in-cheek, but I guess I forgot about Poe's law. I'm not a layman, for the record. I've worked with AI for over a decade
A better mathematical system of storing words does not mean the LLM understands any of them. It just has a model that represents the relation between words that it uses.
If I put 10 minus 8 into my calculator I get 2. The calculator doesn't actually understand what 2 means, or what subtracting represents, it just runs the commands that gives the appropriate output.
That's a bad analogy, because the calculator wasn't trained using an artificial neural network literally designed by studying biological brains (aka biological neutral networks).
And "understand" doesn't equate to consciousness or sapience. For example, it is entirely and factually correct to state that an LLM is capable of reasoning. That's not even up for debate. The accuracy of an LLM's reasoning capability is one of the fundamental benchmarks used for evaluating its quality.
But that doesn't mean it's "thinking" in the way most people consider.
Edit: anyone up voting this CileTheSane clown is in the same boat of not comprehending how LLMs work.
it is entirely and factually correct to state that an LLM is capable of reasoning
Citation needed.
If you're going to tell me LLMs are modeled after biological brains and capable of reasoning then I call bullshit on your claims that you actually work in AI.
Imagine you put a man in an enclosed room. There is a slot in the wall where messages get passed through written in Chinese. The man does not speak Chinese or even recognize the written language, he just thinks they're weird symbols.
First the man is shown examples of sequences of symbols to train him. Then he is shown incomplete sequences and asked which symbol comes next. If incorrect he is corrected, if correct he gets cookie. Eventually this man is able to carry on "conversations" with people in Chinese through continued practice.
This man still does not speak Chinese, he is not having reasoned, rational arguments with the people he is conversing with, and if you told him it was a language he's look at you like your crazy. "There's no language here, just if I have these symbols and I next put the one that looks like a man wearing a hat they give me a cookie."
Thinking LLMs are capable of reasoning is the digital equivalent of putting eyes on a pencil then feeling bad when it gets broken in half.
In machine learning, a neural network (also artificial neural network or neural net, abbreviated ANN or NN) is a model inspired by the structure and function of biological neural networks in animal brains
Thinking LLMs are capable of reasoning is the digital equivalent of putting eyes on a pencil then feeling bad when it gets broken in half.
In this paper, we present Reasoning via Planning (RAP), a novel LLM reasoning framework that equips LLMs with an ability to reason akin to human-like strategic planning
Someone trying to sell their LLM to the general public, and therefore simplifying the language to convey a concept is not a source.
These nodes pass data to each other, just like how in a brain, neurons pass electrical impulses to each other.
By that definition my dimmer switch functions like a biological brain because it passes electrical impulses.
In this paper, we present Reasoning via Planning (RAP), a novel LLM reasoning framework that equips LLMs with an ability to reason akin to human-like strategic planning
This prevents LLMs from performing deliber-
ate planning akin to human brains,
So does not function like a brain does.
To overcome the limitations, we propose a new LLM reasoning framework
So it's a proposal for a new framework to mimic it, not how LLMs currently function
Aaand I'm going to stop checking your sources now. If you're just going to gish gallop every link from a search page you think agrees with you I'm not going to waste my time reading things you clearly didn't bother to. It took 5 links to get to something that even looks like a source, and it doesn't say what you think it does.
Read your sources and make sure they say what you think they do. If you present me with another pile of links and the first one is invalid I won't bother looking at the 2nd.
I'm autistic and sometimes I feel like an ai bot spewing out garbage in social situations. If I do what people normally do and make it sound believable, maybe no one will notice.