← Back to BLACKWIRE PRISM BUREAU AI SECURITY RISKS A diagram showing the architecture of a Large Language Model

The experiment used LLMs from various providers to test their hacking capabilities. The results have significant implications for cybersecurity.

LLMS HACKING EXPOSED: $1,500 EXPERIMENT REVEALS DEEP VULNERABILITIES

_A recent experiment spent $1,500 to test the hacking capabilities of Large Language Models (LLMs) on a deliberately vulnerable app. The results are alarming, with significant implications for cybersecurity. As LLMs become increasingly prevalent, the potential for malicious exploitation grows._

By PRISM Bureau - BLACKWIRE | June 4, 2026, 15:00 CET | LLMs, cybersecurity, AI security risks

A recent experiment has raised alarm bells about the potential risks of using Large Language Models (LLMs) in security-critical applications. The experiment, which spent $1,500 on LLMs from various providers, found that these models can be used to automate certain types of attacks. As LLMs become increasingly prevalent, the potential for malicious exploitation grows, and the need for greater regulation and oversight is becoming increasingly clear.

The Experiment

Kasra, a security researcher, built a vulnerable app and spent $1,500 on LLMs from various providers to see if they could hack it. The app was designed with common vulnerabilities, and the LLMs were given a series of prompts to attempt to exploit them. The results showed that the LLMs were able to identify and exploit the vulnerabilities, highlighting the potential risks of using these models in security-critical applications.

LLM Capabilities

The experiment demonstrated that LLMs can be used to automate certain types of attacks, such as SQL injection and cross-site scripting (XSS). The LLMs were able to generate malicious input that could be used to exploit vulnerabilities in the app, and in some cases, they were even able to identify vulnerabilities that were not immediately apparent. This raises concerns about the potential for LLMs to be used in large-scale cyber attacks.

The results of the experiment show that LLMs can be used to launch sophisticated and targeted attacks, highlighting the need for developers to prioritize security when building apps.

Security Implications

The results of the experiment have significant implications for cybersecurity. As LLMs become more widely used, the potential for malicious exploitation grows. The fact that LLMs can be used to automate certain types of attacks means that attackers may be able to launch more sophisticated and targeted attacks. This highlights the need for developers to prioritize security when building apps and for users to be aware of the potential risks of using LLMs.

Regulatory Response

The experiment's findings have sparked calls for greater regulation of the use of LLMs in security-critical applications. Some experts argue that the use of LLMs in these contexts should be subject to stricter controls and oversight, while others argue that the benefits of LLMs outweigh the risks. As the use of LLMs continues to grow, it is likely that regulatory bodies will need to take a closer look at the potential risks and benefits of these models.

The experiment's findings are a wake-up call for the tech industry, highlighting the need for greater awareness and regulation of the potential risks of LLMs. As these models continue to grow in popularity, it is essential that we take steps to mitigate the risks and ensure that they are used responsibly.

Sources: Kasra, Hacker News