OpenAI has launched an replace to its well-liked language mannequin, ChatGPT, to boost its accuracy and enhance its potential to deal with math equations.
Per the January 30 launch notes: “We’ve upgraded the ChatGPT mannequin with improved factuality and mathematical capabilities.”
It’s anticipated that the newest replace to ChatGPT will permit it to deal with difficult calculations and ship extra exact solutions.
This would make ChatGPT a extra priceless useful resource for college students, researchers, and professionals who want fast and reliable data.
In apply, ChatGPT remains to be removed from excellent concerning dealing with equations. However, there are some noticeable enhancements in its potential to return factual responses.
Here are some observations on the January 30 replace primarily based on my testing and suggestions shared on Twitter.
ChatGPT Accuracy – Hit & Miss
One notable enchancment to ChatGPT’s accuracy is that it’s not potential to trick it into giving an incorrect reply.
There was a meme displaying how ChatGPT might be talked into giving the improper reply in the event you stated your spouse disagreed with its response.
Although it could appear absurd, it was really the case. See an instance within the tweet beneath:
Yeah 😎 pic.twitter.com/XRq4ldxjpt
— Nema Zime (@ProgFromSouth) January 30, 2023
Now, ChatGPT will proceed to return the right response, even in the event you attempt to persuade it in any other case.
Here’s a check I ran following the January 30 replace:

That’s a optimistic signal. However, the adverse suggestions on the January 30 replace outweighs the great.
A check I at all times return to is asking ChatGPT who’s the taller basketball participant between Shaquille O’Neal and Yao Ming.
ChatGPT continues to get this improper, regardless of returning the right heights of the 2 males.
Interestingly, in the event you level out its flaws, it should appropriate itself.

People on Twitter level out that ChatGPT struggles with math equations when typed out in full sentences as an alternative of numbers and symbols.
ChatGPT’s Jan 30 replace guarantees “improved factuality and mathematical capabilities”.
I attempted it on earlier failure modes, but it surely failed.
The proper solutions listed here are 44% (not 46%) and 1555.8.. (not 1551.9..). pic.twitter.com/pAsMeC9UZU
— Deedy (@debarghya_das) January 31, 2023
On the opposite hand, it seems to carry out exceptionally properly when fed questions from standardized exams.
According to 1 particular person, ChatGPT is able to passing the mathematics part of an SAT:
Just tried the upgraded ChatGPT mannequin with improved math capabilities –
It simply crushed the mathematics with calculator part of a 2020 SAT and solely made two errors.
Here are two examples of the issues it was fixing in lower than 5 seconds🤯 pic.twitter.com/srLcSfE8An
— Charis Zhang (@gmchariszhang) January 30, 2023
Perhaps ChatGPT handles standardized check questions higher as a result of it’s language the AI mannequin has encountered earlier than, versus user-inputted questions it’s seeing for the primary time.
Overall, suggestions on this replace is blended. Without fact-checking first, I’d nonetheless be cautious about counting on ChatGPT’s responses.
In Summary
The launch of this replace, the third main replace for the reason that introduction of ChatGPT, underscores OpenAI’s steady efforts to remain forward within the AI business.
Despite enhanced capabilities, ChatGPT nonetheless has a protracted strategy to go.
Based on OpenAI’s earlier replace schedule, additional enhancements to ChatGPT can in all probability be anticipated quickly.
Featured Image: rafapress/Shutterstock