Can we trust LLM CALCULATIONS?.

Farmdude@lemmy.world · 7 days ago

Can we trust LLM CALCULATIONS?.

BilboBargains@lemmy.world · 4 days ago

Use Wolfram Alpha for mathematics

WolfLink@sh.itjust.works · 7 days ago

Just use Wolfram Alpha instead

gedaliyah@lemmy.world · 7 days ago

Here’s an interesting post that gives a pretty good quick summary of when an LLM may be a good tool.

Here’s one key:

Machine learning is amazing if:

The problem is too hard to write a rule-based system for or the requirements change sufficiently quickly that it isn’t worth writing such a thing and,

The value of a correct answer is much higher than the cost of an incorrect answer.

The second of these is really important.

So if your math problem is unsolvable by conventional tools, or sufficiently complex that designing an expression is more effort than the answer is worth… AND ALSO it’s more valuable to have an answer than it is to have a correct answer (there is no real cost for being wrong), THEN go ahead and trust it.

If it is important that the answer is correct, or if another tool can be used, then you’re better off without the LLM.

The bottom line is that the LLM is not making a calculation. It could end up with the right answer. Different models could end up with the same answer. It’s very unclear how much underlying technology is shared between models anyway.

For example, if the problem is something like, "here is all of our sales data and market indicators for the past 5 years. Project how much of each product we should stock in the next quarter. " Sure, an LLM may be appropriately close to a professional analysis.

If the problem is like “given these bridge schematics, what grade steel do we need in the central pylon?” Then, well, you are probably going to be testifying in front of congress one day.

DeathByBigSad@sh.itjust.works · 7 days ago

Yes, with absolute certainty.

For example: 2 + 2 = 5

It’s absolutely correct and if you dispute it, big bro is gonna have to re-educated you on that.

Farmdude@lemmy.world · 7 days ago

I NEED TO consult every LLM VIA TELEKINESIS QUANTUM ELECTRIC GRAVITY A AND B WAVE.

bunchberry@lemmy.world · 7 days ago

I’ve used LLMs quite a few times to find partial derivatives / gradient functions for me, and I know it’s correct because I plug them into a gradient descent algorithm and it works. I would never trust anything an LLM gives blindly no matter how advanced it is, but in this particular case I could actually test the output since it’s something I was implementing in an algorithm, so if it didn’t work I would know immediately.

Farmdude@lemmy.world · 7 days ago

That’s rad, dude. I wish I knew how to do that. Hey, dude I imagined a cosmological model that fits the data with two fewer parameters then the standard model. Planke data. I I’ve checked the numbers, but I don’t have the credentials. I need somebody to check it out. This is a it and a verbal explanation for the model by Academia.edu. It’s way easier to listen first before looking. I don’t want recognition or anything. Just for someone to review it. It’s a short paper. https://youtu.be/_l8SHVeua1Y

Farmdude@lemmy.world · 7 days ago

https://www.academia.edu/129622239/A_Resonant_Shell_Cosmology_A_Reflective_Dynamic_Boundary_as_an_Alternative_to_ΛCDM

AmericanEconomicThinkTank@lemmy.world · 7 days ago

Nope, language models by inherent nature, xannot be used to calculate. Sure theoretically you could have input parsed, with proper training, to find specific variables, input those to a database and have that data mathematically transformed back into language data.

No LLMs do actual math, they only produce the most likely output to a given input based on trained data. If I input: What is 1 plus 1?

Then given the model, most likely has trained repetition on an answer to follow that being 1 + 1 = 2, that will be the output. If it was trained on data that was 1 + 1 = 5, then that would be the output.

qaz@lemmy.world · 7 days ago

Most LLM’s now call functions in the background. Most calculations are just simple Python expressions.

Farmdude@lemmy.world · 7 days ago

Yes. I was aware of that, but I was manipulated by an analog device

Professorozone@lemmy.world · 7 days ago

Well, I wanted to know the answer and formula for future value of a present amount. The AI answer that came up was clear, concise, and thorough. I was impressed and put the formula into my spreadsheet. My answer did not match the AI answer. So I kept looking for what I did wrong. Finally I just put the value into a regular online calculator and it matched the answer my spreadsheet was returning.

So AI gave me the right equation and the wrong answer. But it did it in a very impressive way. This is why I think it’s important for AI to only be used as a tool and not a replacement for knowledge. You have to be able to understand how to check the results.

msmc101@lemmy.blahaj.zone · 7 days ago

no, LLM’s are designed to drive up user engagement nothing else, it’s programmed to present what you want to hear not actual facts. plus it’s straight up not designed to do math

nylo@lemmy.dbzer0.com · 7 days ago

CanadaPlus@lemmy.sdf.org · 7 days ago

Maybe? I’d be looking all over for some convergent way to fuck it up, though.

If it’s just one model or the answers are only close, lol no.

√𝛂𝛋𝛆@piefed.world · 7 days ago

Never a base model, absolutely with an agent and function calling with a properly made tool and retrieval.

guy@piefed.social · 7 days ago

No lol. I don’t trust a calculator to write me text and not a auto complete to solve me math problems

HubertManne@piefed.social · 7 days ago

For practice yeah as there is usually something you can do to verify the value. For study no as you would not learn shit.

Typewar@infosec.pub · 7 days ago

No because there is randomness involved

1rre@discuss.tchncs.de · 7 days ago

That’s why you ask 6 of them, and of they all come to the same conclusion then chances are it’s either right, or a common pitfall.