Google DeepMind claims ‘historic’ AI breakthrough in problem solving

A picture


Google DeepMind claims it has made a “historic” artificial intelligence breakthrough akin to the Deep Blue computer defeating Garry Kasparov at chess in 1997 and an AI beating a human Go champion in 2016.A version of the company’s Gemini 2.5 AI model solved a complex real-world problem that stumped human computer programmers to become the first AI model to win a gold medal at an international programming competition held earlier this month in Azerbaijan.In a performance that the tech company called a “profound leap in abstract problem-solving”, it took less than half an hour to work out how to weigh up an infinite number of possibilities in order to send a liquid through a network of ducts to a set of interconnected reservoirs.The goal was to distribute it as quickly as possible.

None of the human teams, including the top performers from universities in Russia, China and Japan, got it right.It failed two of the 12 tasks it was set, but its overall performance ranked it in second place out of 139 of the world’s strongest college-level computer programmers.Google said it was a “historic moment, towards AGI [artificial general intelligence]”, which is widely considered human-level intelligence at a wide range of tasks.“For me it’s a moment that is equivalent to Deep Blue for Chess and AlphaGo for Go,” said Quoc Le, Google DeepMind’s vice-president.“Even bigger, it is reasoning more towards the real world, not just a constrained environment [like Chess and Go] … Because of that I think this advance has the potential to transform many scientific and engineering disciplines.

” He cited drug and chip design.The model is a general purpose AI but was specially trained to solve very hard coding, maths and reasoning problems.It performed “as well as a top 20 coder in the world”, Google said.“Solving complex tasks at these competitions requires deep abstract reasoning, creativity, the ability to synthesise novel solutions to problems never seen before and a genuine spark of ingenuity,” the company said.Speaking before the details were made public, Stuart Russell, a professor of computer science at the University of California at Berkeley, said the “claims of epochal significance seem overblown”.

He said AI systems had been doing well on programming tasks for a while and the Deep Blue chess breakthrough had “essentially no impact on the real world of applied AI”.However, he said “to get an ICPC question right, the code actually has to work correctly (at least on a finite number of test cases), so this performance may show progress towards making AI-based coding systems sufficiently accurate for producing high-quality code”.He added: “The pressure on AI companies to keep claiming breakthroughs is enormous”.Michael Wooldridge, Ashall professor of the foundations of artificial intelligence at the University of Oxford, said it sounded like an impressive achievement and “being able to solve problems at this level is exciting”.But he questioned how much computing power was needed.

Google declined to say, apart from confirming it was more than available to an average subscriber to its $250-a-month Google AI Ultra service using the lightweight version of Gemini 2,5 Deep Think in the Gemini App,Dr Bill Poucher, executive director of the ICPC, said: “Gemini successfully joining this arena, and achieving gold-level results, marks a key moment in defining the AI tools and academic standards needed for the next generation,”Sign up to TechScapeA weekly dive in to how technology is shaping our livesafter newsletter promotionFrank Rosenblatt, an academic at Cornell University,worked out that it should be possible to create a “perceiving and recognising automaton”,He named it the Perceptron and said an electronic system would be able to learn to recognised patterns in optical, electrical or tonal information “in a manner which may be closely analogous to the perceptual process of a biological brain”.

The following year he built the device, which was the size of a small room.It was considered one of the early breakthroughs in artificial intelligence based on neural networks.In May 1997, IBM’s Big Blue became the first computer system to defeat a reigning world chess champion in a match under standard tournament controls.It beat Garry Kasparov in what became an inflection point in computing power, but the contest was close.Kasparov won the first game, Deep Blue the second followed by three draws.

Deep Blue won game 6 to secure the win.It showed how brute force computing power could create a system to defeat a human, albeit at a narrow task.“The computer is far stronger than anybody expected,” said Kasparov, conceding defeat.Go is one of the most complex games ever devised, and one of the world’s master players was Lee Sedol, a South Korean professional.In 2016, DeepMind, the UK AI company set up by Demis Hassabis, took him on with its computer AlphaGo.

It won 4-1 and some of its moves seemed to display truly original thinking.Move 37 in particular went down in lore.Hassibis said: “It might be the first glimpse of a bright and bold future where humanity harnesses AI as a powerful new tool, helping us discover new knowledge that can solve some of our most pressing scientific problems.”Another breakthrough by Hassibis and DeepMind was an AI program that can predict how proteins fold into 3D shapes, a highly complex process fundamental to understanding life’s biological machinery.The Royal Society, the 360-year old London scientific institution, called it “a stunning advance”.

When researchers know how a protein folds up, they can start to uncover mysteries such as how insulin controls sugar levels in the blood or how antibodies fight viruses.After further iterations, the system helped Hassibis and his colleague John Jumper share a Nobel prize for chemistry in 2024.
politicsSee all
A picture

Lucy Powell hits out at ‘sexist’ talk that she is Labour proxy for Andy Burnham

Lucy Powell has hit out at the “sexist” framing of her deputy Labour leadership campaign, with people claiming she and her rival, Bridget Phillipson, are standing as “proxies” for two men.With the contest to replace Angela Rayner under way this week, the pair have been forced to contend with political rumours that they are stalking horses for a future leadership battle.It has been heavily speculated that Andy Burnham, a longtime ally of Powell, is among the senior Labour figures eyeing up a leadership challenge if the prime minister’s recent turmoil continues. Phillipson, meanwhile, is seen as a Keir Starmer loyalist.Powell, the MP for Manchester Central and former cabinet minister, lost her role as leader of the Commons in the recent reshuffle

A picture

Labour must rethink growth strategy to curb rise of far right, says top economist

Defeating far-right populism will require Labour to radically overhaul its “arid” approach to raising living standards in left-behind communities, the former Bank of England chief economist has said.Andy Haldane warned that Labour’s growth plans were failing to support parts of the country where voters feel neglected and disenfranchised.With ministers under pressure to respond to a summer of unrest, he said the “single most important thing” Keir Starmer’s government could do was to rethink its economic approach before the autumn budget.He said: “We need a story of growth that isn’t aridly told from 30,000 feet, but speaks to the lived experience and to the prospects and opportunities of workers in the everyday economy.“A sense of people progressing in their lives, of being invested in, is the absolute foundation stone of curbing disaffection with the incumbent parties – and therefore doing something to turn the tide of populism

A picture

France proposes ceiling on value of UK components in €150bn EU defence fund

France has proposed limiting the use of British-produced military components in the EU’s €150bn defence fund, in a move that could complicate negotiations over the UK’s entry into the scheme.Four diplomatic sources told the Guardian that French officials had proposed a 50% ceiling on the value of UK components in projects financed through the EU’s €150bn Security Action for Europe (Safe) fund.The €150bn loans scheme is part of the EU’s drive to boost defence spending by €800bn and re-arm the continent. The European commission president, Ursula von der Leyen, lauded the scheme on Tuesday, telling an audience of policymakers in Brussels that the commission had assigned loans to member states in less than six months since the idea was first mooted – “the sense of urgency we need”.The door to greater UK participation was pushed open in May when Keir Starmer and von der Leyen signed an EU-UK security and defence partnership

A picture

Plan to slash US steel tariffs shelved hours before Donald Trump’s UK visit

A long-coveted deal to slash US steel and aluminium tariffs to zero has been shelved on the eve of Donald Trump’s state visit to Britain, the Guardian has learned.Ministers were poised to finalise a deal this week that would have reduced Trump’s tariffs on British steel to zero, according to government officials.But that deal has been put on ice hours before the US president’s arrival in the UK, in what steel industry figures privately described as a major blow.A government source said the deal would have secured 0% tariffs on just a small quota of British steel exports, prolonging uncertainty for the industry.Instead, ministers are seeking to agree a permanent “guarantee” that US tariffs on British steel will not go above 25%

A picture

Two British MPs ‘denied entry’ into Israel during official West Bank visit

Two British MPs travelling as part of a parliamentary delegation to the occupied West Bank have said they were denied entry into Israel.Labour politicians Simon Opher and Peter Prinsley were travelling as part of a group that was due to meet British diplomats in Jerusalem this week, in addition to Palestinian and Israeli human rights organisations.Opher’s office said in a statement on Tuesday that the purpose of the visit, organised by the Council for Arab-British Understanding, was to “enable members of parliament to witness the vital medical and humanitarian work of a range of organisations including Medical Aid for Palestinians (MAP) in the occupied West Bank.” It added: “It is deeply regrettable that Israeli authorities prevented them from seeing first-hand the grave challenges facing medical facilities in the region and from hearing the British government’s assessment of the situation on the ground.” Opher, the MP for Stroud and chair of the all-party parliamentary group for health, has returned to the UK from Jordan

A picture

New headache for Rachel Reeves as OBR expected to lower productivity forecast

The Office for Budget Responsibility is expected to downgrade its key productivity forecast, the Guardian understands, setting Rachel Reeves on course to break her fiscal rules without significant action in the budget.The government’s independent watchdog has carried out a “stocktake” of its forecast models over the summer, and Treasury officials privately acknowledge the result will inevitably be a weaker growth outlook.One Treasury source said they expected the OBR to “kitchen sink it” – making a significant downward revision to productivity forecasts in one go rather than taking a more piecemeal approach.Reeves will respond by pointing to the long-term weakness of productivity in the UK economy and promising to tackle it with a programme of investment.The consultancy Oxford Economics, however, estimates that moving the OBR’s productivity forecast back in line with the less optimistic independent average projection would knock 1