US startup advertises ‘AI bully’ role to test patience of leading chatbots

A picture


Imagine a day at work where your main task is to pick a fight with a computer.No meetings, no emails – just you, a chair and a chatbot with the maddening tendency to think it has the cleverest mind in the room.The job title alone raises an eyebrow: “AI bully”.But this is precisely what a California startup called Memvid is offering: $800 to spend eight hours testing the patience and memory of artificial intelligence.“You’ll spend a full eight-hour day interacting with leading AI chatbots – and your only job is to be brutally honest about how frustrating they are,” the company’s job listing states.

The job requires no computer science degree or specialised AI skills.The only prerequisite is having an “extensive personal history of being let down by technology” – and the patience to ask the same question over and over again.“People constantly have to repeat themselves to chatbots.We wanted to turn that every day frustration into something visible,” said Memvid’s co-founder and CEO, Mohamed Omar.The role reads almost like a stress test for human temperament as much as machine intelligence: candidates are expected to keep the conversation going, revisit earlier topics and gently force the AI to admit when it has lost track – all while recording everything for analysis.

It is a far cry from coding or server management; this is conversation-driven detective work, following the trail of a chatbot’s mistakes as it forgets, fudges or hallucinates.Omar told Business Insider that the company considered this task as a way to highlight the persistent problem in many AI chatbots of systems losing context over time.“All the AI lives and breathes on memory.It’s the holy grail,” he said.“But the AI memory solutions that were in the market in 2024, when we started our business, were unreliable – meaning they would lose context and start hallucinating.

”The problem has only grown in subsequent years: a peer-reviewed paper, presented at the International Conference on Learning Representations (ICLR) in 2025, found that even leading commercial AI systems suffered a 30% to 60% drop in accuracy when asked to remember facts across sustained conversations, lagging well behind human performance.Omar added that one recent college graduate who applied for the job said they pay almost $300 a month for their AI subscriptions.He said the person wrote “a whole rant about how they’ve faced memory issues on every AI platform they’ve used”.He added: “A lot of people that are applying for this are knowledge workers who are using these products.”The root cause of the problem, as researchers and industry analysts have documented, is that companies have rushed to connect their AI tools to vast knowledge repositories, only to discover that retrieval-based systems can surface confident but incorrect answers faster than ever, with no reliable way to signal that they are doing so.

When AI systems are deployed in the real world at scale, this confident wrongness can cause serious harm: a Guardian investigation this week by the AI security lab Irregular found that when AI agents were given broad but benign tasks inside a simulated corporate environment, they bypassed safety controls, interacted with sensitive data and performed actions with the potential to be harmful without direct instructions.It is an issue the real world increasingly struggles with.Damien Charlotin, a French legal scholar, has tracked how the legal profession is experiencing a sharp increase in AI-driven legal hallucinations, reporting that while before spring 2025 there were roughly two incidents a week, by autumn that had risen to two or three a day.It is also an issue in healthcare.Earlier this month, the ECRI Institute placed “navigating the AI diagnostic dilemma” at the top of its annual list of the 10 greatest patient safety concerns for 2026, warning that AI diagnostic shortcomings risk reducing clinician vigilance, particularly where oversight frameworks are not yet established.

Omar has said he doesn’t have a deadline for accepting applications but expects to narrow down the right candidate within the next week or two,The “AI bully” experiment, although ostensibly playful, makes visible what users around the world are already encountering: that AI systems that are extremely capable in many ways can also be inconsistent and unreliable in others,The job pays $800 for a single day,But the costs of not doing it could be considerably higher,
cultureSee all
A picture

Jimmy Kimmel on Trump: ‘He uses his bones to feel things instead of his brain’

Late-night hosts on Monday discussed the Academy Awards, Maga’s incoherent statements on the Iran war and raised an eyebrow to Donald Trump’s claims of support from an anonymous former president.On Jimmy Kimmel Live, the host focused on Trump’s comments to the press in week three of the Iran war, or as Kimmel called it “Operation Epsteino Distracto”.On Truth Social, Trump wrote that it was a “great honour” to kill “scumbags” in Iran.“He’s been talking very tough for a guy who seems to almost be in a coma right now,” Kimmel said.“Even with all the killing he has been enjoying so much, he is very low energy lately,” the host continued

A picture

Carnivàle revisited: is this HBO’s strangest show?

Carnivàle premiered on HBO in 2003 and was cancelled after only two seasons. In the immediate aftermath, this decision was protested by the small but dedicated cult following the show had amassed (to the tune of 50,000 emails).But in the years since, as the television canon has expanded and the taste for mystery-box TV has waned, Carnivàle now seems little more than a minor curio in HBO’s ever-expanding back catalogue. So what is this curio about?Carnivàle follows the exploits of its titular carnival as they travel across the American dust bowl in the 1930s. At the beginning of the series, these nomadic showpeople pick up Ben Hawkins (Nick Stahl), an ex-con with a mysterious past (and inexplicable powers)

A picture

‘We kicked Bono’s arse’: how we made Atomic Kitten’s Whole Again (with a little help from Kraftwerk)

‘Kerry’s spoken verse needed 39 takes spread over several months because she’d had her tonsils out’People never believe me that Kraftwerk created Atomic Kitten. In 1996, my band OMD released Walking on the Milky Way, which I thought was one of the best songs I’d ever written. But in the age of Britpop, we were perceived as an 80s synthpop band, past our sell-by date. Radio 2 wouldn’t play the song and Woolworths wouldn’t stock it. I thought: “I’m functioning with one arm tied behind my back

A picture

Gatz review – the Great Gatsby performed in eight and a half hours of attentive, immersive joy

A man enters his office in the morning, finds his computer on the fritz and, after a few attempts to turn it on and off again, comes across a copy of F Scott Fitzgerald’s 1925 novel The Great Gatsby. So he starts to read and when his colleagues enter they find themselves taking on the characters, and soon the novel unfolds around us, word by word. The New York theatre company Elevator Repair Service has produced a work that is not quite adaptation – given it doesn’t really adapt the novel at all – but that is utterly transfixing nonetheless.Following a keen interest in non-dramatic texts, the company wanted to see what would happen when a powerful literary work was read and performed in its entirety. The result is both strange and strangely familiar

A picture

How to Make a Killing to Wu-Tang Clan: your complete entertainment guide to the week ahead

Glen Powell indulges in some murder most profitable, and the influential rap collective arrive in the UK complete with a clutch of peerless classicsHow to Make a KillingOut nowLoosely inspired by the much-loved Ealing comedy Kind Hearts and Coronets, here is a dark comedy that sees Glen Powell play an upwardly mobile schemer who isn’t afraid to murder his way to his inheritance. Directed by John Patton Ford (Emily the Criminal).Reminders of HimOut nowMaika Monroe (It Follows) stars as a woman who goes to prison following a car accident in which her boyfriend (Rudy Pankow) is killed. On release, she finds herself drawn to a handsome local bar owner (Tyriq Withers). Romance based on the bestselling Colleen Hoover novel

A picture

The Guide #234: Five big questions before the 2026 Oscars

Happy Oscars Eve eve to you all. The film industry’s glitziest night takes place on Sunday, at an ungodly hour for those of us covering it from the other side of the Atlantic. Coffee will be essential for anyone staying up, as will the Guardian’s annual liveblog, covering every last minute of the ceremony as well as its red carpet run-up. Head over to the homepage on Sunday evening for that, plus news and commentary on the night’s events.There’s plenty to read before that too: our annual Oscar hustings, making the case for each of this year’s best picture nominees (I sided with Sentimental Value); an interview with Academy top dog Bill Kramer; a piece on the increasingly toxic discourse around many of this year’s nominees; and Guardian film editor Catherine Shoard’s reader Q&A on this year’s race and the state of film in general