Skip to content
Search

Latest Stories

Apple study reveals major flaws in billion-dollar AI models

AI models still struggle with basic logical tasks

Apple Research Exposes AI Model Weaknesses

Apple researchers evaluated several prominent generative AI systems

iStock

A new research paper from Apple has exposed serious shortcomings in the reasoning abilities of some of today’s most advanced artificial intelligence (AI) systems. Despite being marketed as powerful tools capable of solving complex problems, the study shows that these models still struggle with basic logical tasks, raising questions about the real capabilities of large language and reasoning models.

AI models fail child-level logic tests

Apple researchers evaluated several prominent generative AI systems, including ChatGPT, Claude, and DeepSeek, using classic problem-solving tasks. One of the tests was the well-known Tower of Hanoi puzzle, which requires moving discs across pegs while following specific rules.


While the puzzle is simple enough for a bright child to solve, most AI models failed when asked to handle scenarios involving more than seven discs. Accuracy fell below 80% with seven discs, and performance dropped even further with eight. According to co-lead author Iman Mirzadeh, the issue wasn't just solving the puzzle — it was that the models couldn’t follow a logical thought process even when given the solution algorithm.

“They fail to reason in a step-by-step, structured way,” he said, noting that the models’ approach was neither logical nor intelligent.

The myth of scaling exposed

The results challenge one of the AI industry’s most commonly held beliefs: that simply scaling models — making them larger and feeding them more data — will lead to better performance. Apple’s research provides strong evidence that this is not always true.

Gary Marcus, a well-known AI researcher and commentator, called the findings a reality check. Venture capitalist Josh Wolfe even coined a new verb, “to GaryMarcus”, meaning to critically debunk exaggerated claims about AI. The Apple study, Wolfe argued, had done exactly that by revealing the real limits of model reasoning.

Marcus has long argued that AI systems, particularly those based on neural networks, can only generalise within the data they’ve seen before. Once asked to work beyond that training distribution, they often break down — a pattern clearly confirmed in Apple’s tests.

AI is not yet a substitute for human logic

To be clear, even humans make errors on the more complex versions of the Tower of Hanoi. However, AI systems were supposed to improve on this, not replicate human flaws. As Marcus points out, artificial general intelligence (AGI) should combine human creativity with machine-level precision. But instead of outperforming people in logic and reliability, today’s large models still make basic errors.

Apple AI study Most AI models failed when asked to handle scenarios involving more than seven discsiStock

Apple’s results also support concerns raised by Arizona State University’s Subbarao Kambhampati, who has cautioned against assuming AI models reason like humans. In reality, they often skip steps or fail to understand the underlying principles of a problem, despite producing convincing-sounding answers.

Caution urged for businesses and society

The implications are significant for businesses looking to integrate AI into their operations. While models such as GPT-4, Claude, and others perform well in areas like writing, coding, and brainstorming, they remain unreliable for high-stakes decision-making. As Marcus points out, these systems can’t yet outperform classical algorithms in areas like database management, protein folding, or strategic games like chess.

This unpredictability limits how much society can rely on generative AI. While the technology will continue to be useful in supporting human tasks, it is far from being a replacement for human judgement or traditional rule-based systems in critical contexts.

The illusion of intelligence

Perhaps most concerning is how easily these models can appear more capable than they are. If an AI performs well on an easy test, users may assume it can handle more complex problems too. But Apple’s study shows this confidence can be misplaced. The same model that solves a four-disc puzzle may completely fail when asked to solve one with eight.

This illusion of intelligence could lead to overtrust in AI systems — something experts warn must be avoided if the technology is to be used responsibly.

Rethinking the future of AI

Despite the findings, Marcus remains optimistic about AI’s future, just not in its current form. He believes that hybrid approaches, combining classical logic with modern computing power, could eventually produce more reliable systems. But he is sceptical that current LLM-based systems are the answer.

The Apple paper shows that hype around generative AI has outpaced its real-world abilities. Until AI can reason in a consistent, logical manner — not just produce convincing text — it will remain limited in scope.

As researchers and developers reflect on these findings, one thing is clear: the path to truly intelligent machines will require more than just scaling up. It will demand smarter, better-designed models that prioritise reliability over illusion.

More For You

Modi & Trump

Donald Trump and Narendra Modi shake hands as they attend a joint press conference at the White House on February 13, 2025.

Reuters

India, US to discuss trade issues after tariff hike

INDIA and the United States will hold trade discussions in New Delhi on Tuesday, officials and Indian media reports said, as the two countries look to resolve a tariff dispute.

India currently faces high US tariffs on most of its exports and has not yet been able to reach a trade deal that would ease the pressure.

Keep ReadingShow less
Piyush Goyal

Piyush Goyal recalled that in February, Narendra Modi and Donald Trump had instructed their trade ministers to conclude the first phase of the bilateral trade agreement (BTA) by November 2025. (Photo: Getty Images)

Getty Images

Trade talks with US moving forward positively, says Indian minister Goyal

INDIA’s commerce and industry minister Piyush Goyal on Thursday said that negotiations on the proposed trade agreement between India and the United States, which began in March, are progressing in a positive atmosphere and both sides are satisfied with the discussions.

He recalled that in February, Indian prime minister Narendra Modi and US president Donald Trump had instructed their trade ministers to conclude the first phase of the bilateral trade agreement (BTA) by November 2025.

Keep ReadingShow less
Baiju Bhatt

At 40, Bhatt is the only person of Indian origin in this group, which includes figures such as Meta’s Mark Zuckerberg. (Photo: Getty Images)

Baiju Bhatt named among youngest billionaires in US by Forbes

INDIAN-AMERICAN entrepreneur Baiju Bhatt, co-founder of the commission-free trading platform Robinhood, has been named among the 10 youngest billionaires in the United States in the 2025 Forbes 400 list.

At 40, Bhatt is the only person of Indian origin in this group, which includes figures such as Meta’s Mark Zuckerberg. Forbes estimates his net worth at around USD 6–7 billion (£4.4–5.1 billion), primarily from his roughly 6 per cent ownership in Robinhood.

Keep ReadingShow less
UK business district
The Canary Wharf business district including global financial institutions in London. (Photo: Getty Images)
Getty Images

Economy shows no growth in July amid political turbulence

UK's ECONOMY showed no growth in July, according to official data released on Friday, adding to a difficult week for prime minister Keir Starmer’s government.

The Office for National Statistics (ONS) said gross domestic product was flat in July, following a 0.4 per cent rise in June.

Keep ReadingShow less
India’s IT sector

India’s $283 billion IT industry, which contributes more than 7 per cent to the country’s GDP, has for over three decades provided services to major clients including Apple, American Express, Cisco, Citigroup, FedEx and Home Depot.

iStock

India’s IT sector faces uncertainty as US proposes 25 per cent outsourcing tax

INDIA’s IT sector is facing uncertainty as US lawmakers consider a 25 per cent tax on companies using foreign outsourcing services.

Analysts and lawyers said the proposal has led to customers delaying or re-negotiating contracts, raising concerns in India, the world’s largest outsourcing hub.

Keep ReadingShow less