web analytics

Indirect Instruction Injection in Multi-Modal LLMs – Source: www.schneier.com

Rate this post

Source: www.schneier.com – Author: Bruce Schneier

Interesting research: “(Ab)using Images and Sounds for Indirect Instruction Injection in Multi-Modal LLMs“:

Abstract: We demonstrate how images and sounds can be used for indirect prompt and instruction injection in multi-modal LLMs. An attacker generates an adversarial perturbation corresponding to the prompt and blends it into an image or audio recording. When the user asks the (unmodified, benign) model about the perturbed image or audio, the perturbation steers the model to output the attacker-chosen text and/or make the subsequent dialog follow the attacker’s instruction. We illustrate this attack with several proof-of-concept examples targeting LLaVa and PandaGPT.

Tags: , , ,

Posted on July 28, 2023 at 7:06 AM
8 Comments

Sidebar photo of Bruce Schneier by Joe MacInnis.

Original Post URL: https://www.schneier.com/blog/archives/2023/07/indirect-instruction-injection-in-multi-modal-llms.html

Category & Tags: Uncategorized,academic papers,artificial intelligence,LLM,machine learning – Uncategorized,academic papers,artificial intelligence,LLM,machine learning

LinkedIn
Twitter
Facebook
WhatsApp
Email

advisor pick´S post

More Latest Published Posts