Tradeoffs

cm0002@lemmy.world · 3 days ago

Tradeoffs

x00z@lemmy.world · 3 days ago

Not to be that guy, but the image with all the traintracks might just be doing it’s job perfectly.

merc@sh.itjust.works · 5 hours ago

That’s the problem. Maybe it is.

Maybe the code the AI wrote works perfectly. Maybe it just looks like how perfectly working code is supposed to look, but doesn’t actually do what it’s supposed to do.

To get to the train tracks on the right, you would normally have dozens of engineers working over probably decades, learning how the old system worked and adding to it. If you’re a new engineer and you have to work on it, you might be able to talk to the people who worked on it before you and find out how their design was supposed to work. There may be notes or designs generated as they worked on it. And so-on.

It might take you months to fully understand the system, but whenever there’s something confusing you can find someone and ask questions like “Where did you…?” and “How does it…?” and “When does this…?”

Now, imagine you work at a railroad and show up to work one day and there’s this whole mess in front of you that was laid down overnight by some magic railroad-laying machine. Along with a certificate the machine printed that says that the design works. You can’t ask the machine any questions about what it did. Or, maybe you can ask questions, but those questions are pretty useless because the machine isn’t designed to remember what it did (although it might lie to you and claim that it remembers what it did).

So, what do you do, just start running trains through those tracks, assured that the machine probably got things right? Or, do you start trying to understand every possible path through those tracks from first principles?

tiddy@sh.itjust.works · 2 days ago

Engineers love moving parts, known for their reliability and vigor

Diurnambule@jlai.lu · 2 days ago

Vigor killed me

turbodrooler@lemmy.world · 3 days ago

The one on the right prints “hello world” to the terminal

Korhaka@sopuli.xyz · 2 days ago

And takes 5 seconds to do it

thedeadwalking4242@lemmy.world · 2 days ago

Might is the important here

dustyData@lemmy.world · 2 days ago

It gives you the right picture when you asked for a single straight track on the prompt. Now you have to spend 10 hours debugging code and fixing hallucinations of functions that don’t exist on libraries it doesn’t even neet to import.

Simmy@lemmygrad.ml · 2 days ago

Not a developer. I just wonder about AI hallucinations come about. Is it the ‘need’ to complete the task requested at the cost of being wrong?

zlatko@programming.dev · 2 days ago

No, it’s just that it doesn’t know if it’s right or wrong.

How “AI” learns is they go through a text - say blog post - and turn it all into numbers. E.g. word “blog” is 5383825526283. Word “post” is 5611004646463. Over huge amount of texts, a pattern is emerging that the second number is almost always following the first number. Basically statistics. And it does that for all the words and word combinations it found - immense amount of text are needed to find all those patterns. (Fun fact: That’s why companies like e.g. OpenAI, which makes ChatGPT need hundreds of millions of dollars to “train the model” - they need enough computer power, storage, memory to read the whole damn internet.)

So now how do the LLMs “understand”? They don’t, it’s just a bunch of numbers and statistics of which word (turned into that number, or “token” to be more precise) follows which other word.

So now. Why do they hallucinate?

How they get your question, how they work, is they turn over all your words in the prompt to numbers again. And then go find in their huge databases, which words are likely to follow your words.

They add in a tiny bit of randomness, they sometimes replace a “closer” match with a synonym or a less likely match, so they even seen real.

They add “weights” so that they would rather pick one phrase over another, or e.g. give some topics very very small likelihoods - think pornography or something. “Tweaking the model”.

But there’s no knowledge as such, mostly it is statistics and dice rolling.

So the hallucination is not “wrong”, it’s just statisticaly likely that the words would follow based on your words.

Did that help?

send_me_your_ink@lemmynsfw.com · 2 days ago

Full disclosure - my background is in operations (think IT) not AI research. So some of this might be wrong.

What’s marketed as AI is something called a large language model. This distinction is important because AI implies intelligence - where as a LLM is something else. At a high level LLMs are using something called “tokens” to break apart natural language into elements that a machine can understand, and then recombining those tokens to “create” something new. When a LLM is creating output it does not know what it is saying - it knows what token statistically comes after the token(s) it has generated already.

So to answer your question. An AI can hallucinate because it does not know the answer - its using advanced math to know that the period goes at the end of the sentence. and not in the middle.

Michal@programming.dev · 2 days ago

While being more complex and costly to maintain

x00z@lemmy.world · 2 days ago

Depends on the usecase. It’s most likely at a trainyard or trainstation.

Michal@programming.dev · 2 days ago

The image implies that the track on the left meets the use case criteria