More fun with grok.
I have a friend group that goes back 40 years to childhood and we enjoy getting together and playing our childhood games.
In our group text, this resulted in a minor math question.
Being lazy, I tried to use grok to do what I thought would be a simple brute force calculation of a probability distribution.
The question: what is the probability distribution when 4 six sided dice are rolled, the lowest one is discarded, and the other three are summed?
Grok kept getting it wrong.
There are only 1296 possible combinations, and only 16 possible sums, so this should be easy to brute force for a computer.
First, I had grok check its work for one specific output and explain any discrepancies.
It checked, noticing it got 20/1296 one time and 10/1296 the other (both wrong) and concluded: “checks out!”
Serious hallucination.
I gave it more Boolean type instructions, gave it 3 specific outputs to re-check, told it to explain any discrepancies, and put it into DeepSearch mode.
Grok thought for 4 minutes (an eternity in computing time), couldn’t figure it out, gave up, and made excuses for why it couldn’t solve the problem.
It ran another time for over six minutes in DeepSearch, getting nowhere (the text scroll kept saying “that didn’t work”), so I cancelled the request.
Finally got a set of very Boolean instructions for brute forcing the problem (doing all the thinking for grok) which should’ve worked, but it told me I reached my message limit for the free version.
Why would I pay for AI that gets the wrong answers, hallucinates, and makes excuses?
Combinatorial problems like this blow up at scale, and can’t be solved by computers.
Grok was probably looking for a generalized solution applicable to larger scales.
I was playing with the problem with a spreadsheet, looking for a way to simplify it in a general way for arbitrarily large problems when a friend called who’s been a software engineer for 30 years.
He said it can actually get quite complex for a computer to figure out that problem, and AI is particularly poor at this task.
One of the guys in my group text is another math nerd (undergrad physics/graduate aeronautical engineering) confirmed that AI is terrible at math.
(On a side note, he is currently working in data management, at one time his company I believe was working for Special Operations Command, and currently works with a few QPs).
AI isn’t going to take all the jobs.
Why? AI isn’t actually capable of THINKING.
__________________
__________________
Waiting for the perfect moment is a fruitless endeavor.
Make a decision, and then make it the right one through your actions.
"Whoever watches the wind will not plant; whoever looks at the clouds will not reap." -Ecclesiastes 11:4 (NIV)
|