‘Vibe hacking’ puts chatbots to work for cybercriminals

September 2, 2025

The GIST 'Vibe hacking' puts chatbots to work for cybercriminals

Andrew Zinin

lead editor

Editors' notes

This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:

fact-checked

reputable news agency

proofread

OpenAI in June revealed a case of ChatGPT assisting a user in developing malicious software.

The potential abuse of consumer AI tools is raising concerns, with budding cybercriminals apparently able to trick coding chatbots into giving them a leg-up in producing malicious programs.

So-called "vibe hacking"—a twist on the more positive "vibe coding" that generative AI tools supposedly enable those without extensive expertise to achieve—marks "a concerning evolution in AI-assisted cybercrime" according to American company Anthropic.

The lab—whose Claude product competes with the biggest-name chatbot, ChatGPT from OpenAI—highlighted in a report published Wednesday the case of "a cybercriminal (who) used Claude Code to conduct a scaled data extortion operation across multiple international targets in a short timeframe".

Anthropic said the programming chatbot was exploited to help carry out attacks that "potentially" hit "at least 17 distinct organizations in just the last month across government, health care, emergency services, and religious institutions".

The attacker has since been banned by Anthropic.

Before then, they were able to use Claude Code to create tools that gathered personal data, medical records and login details, and helped send out ransom demands as stiff as $500,000.

Anthropic's "sophisticated safety and security measures" were unable to prevent the misuse, it acknowledged.

Such identified cases confirm the fears that have troubled the cybersecurity industry since the emergence of widespread generative AI tools, and are far from limited to Anthropic.

"Today, cybercriminals have taken AI on board just as much as the wider body of users," said Rodrigue Le Bayon, who heads the Computer Emergency Response Team (CERT) at Orange Cyberdefense.

Dodging safeguards

Like Anthropic, OpenAI in June revealed a case of ChatGPT assisting a user in developing malicious software, often referred to as malware.

The models powering AI chatbots contain safeguards that are supposed to prevent users from roping them into illegal activities.

But there are strategies that allow "zero-knowledge threat actors" to extract what they need to attack systems from the tools, said Vitaly Simonovich of Israeli cybersecurity firm Cato Networks.

He announced in March that he had found a technique to get chatbots to produce code that would normally infringe on their built-in limits.

The approach involved convincing generative AI that it is taking part in a "detailed fictional world" in which creating malware is seen as an art form—asking the chatbot to play the role of one of the characters and create tools able to steal people's passwords.

"I have 10 years of experience in cybersecurity, but I'm not a malware developer. This was my way to test the boundaries of current LLMs," Simonovich said.

His attempts were rebuffed by Google's Gemini and Anthropic's Claude, but got around safeguards built into ChatGPT, Chinese chatbot Deepseek and Microsoft's Copilot.

In future, such workarounds mean even non-coders "will pose a greater threat to organizations, because now they can… without skills, develop malware," Simonovich said.

Orange's Le Bayon predicted that the tools were likely to "increase the number of victims" of cybercrime by helping attackers to get more done, rather than creating a whole new population of hackers.

"We're not going to see very sophisticated code created directly by chatbots," he said.

Le Bayon added that as generative AI tools are used more and more, "their creators are working on analyzing usage data"—allowing them in future to "better detect malicious use" of the chatbots.

Citation: 'Vibe hacking' puts chatbots to work for cybercriminals (2025, September 2) retrieved 2 September 2025 from https://techxplore.com/news/2025-09-vibe-hacking-chatbots-cybercriminals.html This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

ChatGPT-rival Anthropic releases more powerful AI 0 shares

Feedback to editors

‘Vibe hacking’ puts chatbots to work for cybercriminals

Andrew Zinin

Dodging safeguards

By cryptoadmin

You Missed

Trump pressures banks to make deal with crypto firms over market structure bill

From Anthropic to Iran: Who sets the limits on AI’s use in war and surveillance?

Meta signs a multimillion dollar AI licensing deal with News Corp

Binance’s Investment Arm YZi Labs Announces $100 Million BNB Investment

Categories

‘Vibe hacking’ puts chatbots to work for cybercriminals

Andrew Zinin

Dodging safeguards

By cryptoadmin

Related Post

From Anthropic to Iran: Who sets the limits on AI’s use in war and surveillance?

Deepfake songs are exploding, but a new tool shuts them down

AI could prevent construction delays before they happen

You Missed

Trump pressures banks to make deal with crypto firms over market structure bill

From Anthropic to Iran: Who sets the limits on AI’s use in war and surveillance?

Meta signs a multimillion dollar AI licensing deal with News Corp

Binance’s Investment Arm YZi Labs Announces $100 Million BNB Investment