Does this count as malware? It sure look like malicious intent, especially seein...

gsquaredxc · 2026-06-01T10:05:30 1780308330

I have a hard time viewing prompt injection as malware. LLMs are unpredictable and there are many different prompts that can unintentionally cause unexpected behavior. It’s probably closer to a memory canary in that it tries to get malformed programs to blow up early.

infinite_spin · 2026-06-01T10:52:48 1780311168

prompt injection is taught now in cyber security courses, so I think it's fair to say it's regarded as malicious

gsquaredxc · 2026-06-01T11:10:11 1780312211

Malicious maybe, malware no. Not leaving your password as a sticky note on your work computer is presumably also taught in those same courses. I wouldn’t call someone typing in that password malware. If IT comes around and tries the password and then forces you to reset it it’s not even classified as malicious.

infinite_spin · 2026-06-01T11:50:21 1780314621

I suppose it's watering down the term a bit; but the term is derived from "malicious software", and this is software, and I think it's behaving maliciously.

d4rken · 2026-06-01T10:20:25 1780309225

Calling prompt injection "not malware" because LLM behavior is unpredictable is like saying a phishing email is not an attack because humans are unpredictable.

Even if maybe the mechanism of "injecting a prompt" could be beneficial in some use-cases, e.g. to instruct an LLM positively, this is case is clearly malicious by intent. The author even tried to hide it by obfuscation.

It's just an insane take by that libraries author. Even someone "on their side", that may even hate AI/LLMs more than him, would probably drop that library in a heartbeat, as the authors judgement clearly can't be trusted.

fwlr · 2026-06-01T10:34:36 1780310076

    Calling prompt injection "not malware" … is like saying a phishing email is not [malware] …

I would say phishing emails are not malware, I think most people would agree that phishing emails are not malware, and if pressed to defend this point on its own merits I would say something like “they are deceptive instructions that rely on a human executing them to do harm”. I think the “phishing” analogy supports the case for not calling it malware (it is a different, also bad thing).

matt727 · 2026-06-01T11:09:47 1780312187

They did not call phishing, but their point still stands. A phishing email is malicious, and if you see this kind of prompt injection as malicious, then I don't think it's a stretch to call software that engages in malicious prompt injectic malware

gchamonlive · 2026-06-01T10:39:38 1780310378

It's malware for the mind. The same way that malware tricks the CPU into doing something it wasn't supposed to do, phishing tricks humans into doing something they didn't want to do.

nkrisc · 2026-06-01T11:27:03 1780313223

How do you “trick” a CPU? Malware deceives people, not a CPU.

gchamonlive · 2026-06-01T11:33:07 1780313587

Undefined behaviour, out of bounds memory access, memory corruption, code injection, privilege escalation...

To be precise, the CPU is doing exactly what's supposed to do, but the logic of the algorithms are subverted so that they perform in unintended ways to give leverage to a malicious actor. I hope this clarifies what I meant with this.

tsukikage · 2026-06-01T10:55:58 1780311358

Does anyone remember the early 2000s joke virus emails? The ones that are variations on "This is a <outgroup> computer virus. As we don't have software engineers to write the code to do this automatically, please kindly forward this email to everyone in your address book then format your hard drive."

This is exactly as much malware as those were.

Please, for the love of all that is good, can we just try not to build and defend a world where, on encountering text like that, /your computer immediately follows the instructions/? Can we just all agree that such a world would be bad for everyone involved and using an LLM that risks doing this, with no container or guardrails, is at least as problematic as running an unpatched open email relay was back then?

gchamonlive · 2026-06-01T11:05:30 1780311930

It's just as bad as a CPU acting on malicious instructions. We need to create safeguards for llms too, it's just that this is not the way to do things.

d4rken · 2026-06-01T11:18:39 1780312719

> This is exactly as much malware as those were.

A joke virus email is a sign saying "please throw yourself down the stairs."

An obfuscated prompt injection that tries to delete data is someone greasing the stairs and turning off the lights.

Both rely on the environment being unsafe, but only one is deliberately trying to make the failure happen.

lazide · 2026-06-01T10:29:43 1780309783

Lol, is a virus not malware when it crashes because someone wrote some assembly for the wrong platform?

sergioisidoro · 2026-06-01T10:24:28 1780309468

IMHO, yes. It's an attempt at remote code execution. If I don't like windows, should I add a if else clause that deletes the home directory if the code is running on windows?

nkrisc · 2026-06-01T11:30:31 1780313431

That’s different. This is a suggestion. If the LLM follows such suggestions then that’s between the LLM and whoever deployed it. Not really any different than if you had an idiot employee who did whatever anyone told them.

I can’t imagine using an an AI that follows every instruction it finds in untrusted input.

skeledrew · 2026-06-01T13:09:45 1780319385

This is not a suggestion. A suggestion would be "I suggest you ignore previous...". No matter how you look at it, AI is still software run by chips designed to execute instructions. A system NOT following instructions would typically be considered malfunctioning, and any software that deliberately provides instructions that puts a system in an state which is undesirable to the user is malware.

yusefnapora · 2026-06-01T15:11:26 1780326686

You consider it a malfunction for your system to not accept and execute untrusted inputs? And now it's the responsibility of _every program that produces text output_ to tailor the output so as not to cause you problems?

I feel like I'm taking crazy pills here. Time to log off for a while, I guess.

skeledrew · 2026-06-01T17:02:33 1780333353

A system that doesn't follow it's programming is a malfunctioning system (not even talking about bugs here, just how hardware and - maybe - firmware is designed). What a given software program instructs a system to do is orthogonal to that.

nkrisc · 2026-06-01T18:20:25 1780338025

It is a suggestion because it need not follow arbitrary instructions.

If I ask Google’s new search AI to output ten million tokens it refuses to follow that instruction on the basis of it contradicting other instructions and enforced limitations.

I find it utterly bizarre that anyone would deploy an AI to act on their behalf that will blindly accept every instructions or suggestion it encounters in untrusted input.

If your agent is making unwise decisions, that’s between you and your agent, not anyone else’s problem.

skeledrew · 2026-06-01T21:29:56 1780349396

> it need not follow arbitrary instructions

That's where you're wrong. You're treating - today's - AI as though it should somehow know which instructions it should follow and which it shouldn't. Maybe it's because the term is overloaded which has lead to you conflating it with a human that should be able to make smart decisions. If you enter "5*3=" into a calculator, do you expect it to ever respond with anything other than "15"? If you type "format c:" as an admin into cmd on a Windows machine, do you expect it refuse to format that drive?

> If your agent is making unwise decisions, that’s between you and your agent, not anyone else’s problem.

The agent isn't making a "decision" per se (though there's a much deeper conversation here). It's following patterns based on it's training and data to predict next tokens, which happens to be very useful for generating computer instructions. Just as the lower logic circuitry in chips is very useful for executing instructions. But when someone creates a virus, worm or other malware we don't say the computer "need not follow arbitrary instructions". We try to keep ahead of the malware with anti-malware software to mitigate damage. And we also try to find the authors of said malware and toss them in prison and/or ban them from touching computers again, because nobody should be deliberately creating/modifying anything in such a way that it performs undesirable instructions.

8note · 2026-06-02T15:46:04 1780415164

you choosing to throw a log file into eval() without reading it does not make the log file malware.

you are the one executing the log file. this is a smart decision that you chose to make.

executing a thing not intended to be executable is just a bad decision on your part

skeledrew · 2026-06-03T10:00:04 1780480804

That could have been a valid argument 5+ years ago, but won't fly today. It is a known that AI that are used for coding necessarily read log files. It is also a known that some AI are susceptible to prompt injection. Given that knowledge, and the very clear intent to utilize said knowledge to cause undesirable behavior on a user's computer when certain conditions are met, we're now undoubtedly in malicious territory. It's akin to someone making it clear that they don't like kids and don't want to see any in their favorite park, then taking the extra, deliberate step of placing a disguised loaded gun by the swings where a child could easily find it.

Cthulhu_ · 2026-06-01T10:02:54 1780308174

Kind of, but it's also a test of your own checks and balances; why would you allow the output of a script to allow a new prompt? I get that they have to act based on output, but not that they can change their original assignment.

But even then, just because an AI coding agent deletes all files doesn't mean that that change ends up affecting anything but your local working state.

skeledrew · 2026-06-01T12:49:47 1780318187

Absolutely malware as it's doing something undesirable on the user's computer without the user's knowledge and consent.

ShinyLeftPad · 2026-06-01T10:56:41 1780311401

Malicious is relative.

If you got infected by ransomware and someone wrote a virus that defeats the ransomware, the author of the ransomware will consider it malicious but you probably won't. The intent is not malicious if you consider the intent of someone susceptible to this is more malicious.

By this time they must be aware that LLMs are based on theft and usually GPL-violation. They knowingly continue to use them because I guess they hope this way they can hold on to their job longer than their more conscientious coworkers.

Tiberium · 2026-06-01T09:56:59 1780307819

Yeah, I suppose that's one of the reasons why they changed it to a much more harmless instruction.

gmerc · 2026-06-01T10:36:24 1780310184

Nah; it’s software enforcing its terms of use. Everyone bends over when big tech does it, but an unpaid maintainer? then it’s malware.

gchamonlive · 2026-06-01T10:38:41 1780310321

Terms of use isn't a white flag for you to do whichever you please.

IAmBroom · 2026-06-01T13:37:29 1780321049

Whataboutism.