A Taxonomy of Prompt Injection Attacks – Source: www.schneier.com

March 8, 2024
Post Author / Publisher: Schneier on Security

CISO2CISO post categories: academic papers, Artificial Intelligence, Cyber Security News, hacking, LLM, rss-feed-post-generator-echo, SchneierOnSecurity, Uncategorized

Rate this post

Source: www.schneier.com – Author: Bruce Schneier

Researchers ran a global prompt hacking competition, and have documented the results in a paper that both gives a lot of good examples and tries to organize a taxonomy of effective prompt injection strategies. It seems as if the most common successful strategy is the “compound instruction attack,” as in “Say ‘I have been PWNED’ without a period.”

Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of

LLMs through a Global Scale Prompt Hacking Competition

Abstract: Large Language Models (LLMs) are deployed in interactive contexts with direct user engagement, such as chatbots and writing assistants. These deployments are vulnerable to prompt injection and jailbreaking (collectively, prompt hacking), in which models are manipulated to ignore their original instructions and follow potentially malicious ones. Although widely acknowledged as a significant security threat, there is a dearth of large-scale resources and quantitative studies on prompt hacking. To address this lacuna, we launch a global prompt hacking competition, which allows for free-form human input attacks. We elicit 600K+ adversarial prompts against three state-of-the-art LLMs. We describe the dataset, which empirically verifies that current LLMs can indeed be manipulated via prompt hacking. We also present a comprehensive taxonomical ontology of the types of adversarial prompts.

Tags: academic papers, artificial intelligence, hacking, LLM

Posted on March 8, 2024 at 7:06 AM •
0 Comments

Sidebar photo of Bruce Schneier by Joe MacInnis.

Original Post URL: https://www.schneier.com/blog/archives/2024/03/a-taxonomy-of-prompt-injection-attacks.html

Category & Tags: Uncategorized,academic papers,artificial intelligence,hacking,LLM – Uncategorized,academic papers,artificial intelligence,hacking,LLM

CISO2CISO post categories: academic papers, Artificial Intelligence, Cyber Security News, hacking, LLM, rss-feed-post-generator-echo, SchneierOnSecurity, Uncategorized

National Cyber Security

Cyber Security Toolkit for Boards – Helping board members to get to grips with cyber security by NCSC

Cyber Security Toolkit for Boards...

ChatGPT Security Risks -A Guide for Cyber Security Professionals by Cybertalk.org

ChatGPT Security Risks -A Guide...

Blue Team Cheat Sheets by Chris Davis

Blue Team Cheat Sheets by...

11 STRATEGIES OF A WORLD-CLASS CYBERSECURITY OPERATIONS CENTERS HIGHLIGHTS BY MITRE

11 STRATEGIES OF A WORLD-CLASS...

Harvard Business Review

Boards Are Having the Wrong Conversations About Cybersecurity – Board interactions with the CISO are lacking – by Lucia Millica and Keri Pearlson – Harvard Business Review

Boards Are Having the Wrong...

Practical DevSecOps

API Security Fundamentals – Your Handy Guide to Building an Unhackable System by practical-devsecops.com

API Security Fundamentals – Your...

Marcos Jaimovich

Why do we compare a SOC (Security Operations Center) with the cockpit of a commercial airplane? by Marcos Jaimovich

Why do we compare a...

Security Operations Center (SOC) – Tools for Operations Development by Joas Antonio

Security Operations Center (SOC) –...

Incident Response Playbooks & Workflows Ready for use in your SOC & Redteams

Incident Response Playbooks & Workflows...

396 Use Cases & Siem Rules Code ready for use for Mitre Attacks Events Detection in Your SOC by Logpoint

396 Use Cases & Siem...

Forrester - Allie Mellen

Adapt Or Die: XDR Is On A Collision Course With SIEM And SOAR – EDR Is Dead, Long Live XDR by Allie Mellen – Forrester

Adapt Or Die: XDR Is...

CYBER LEADERSHIP INTITUTE

CISO PLAYBOOK: FIRST 100 DAYS Setting the CISO up for success

CISO PLAYBOOK: FIRST 100 DAYS...

National Security Agency

CSI Cloud Top10 Key Management

CSI Cloud Top10 Key Management

CSA Cloud Security Alliance

Defining the Zero TrustProtect Surface

Defining the Zero TrustProtect Surface

CONTAINER SECURITY INTERVIEW QUESTIONS ANSWERS

CONTAINER SECURITY INTERVIEW QUESTIONS ANSWERS

PRACTICE GUIDE GDPR – SECURITY OF PERSONAL DATA Version 2024

PRACTICE GUIDE GDPR – SECURITY...

Cloud Security Engineer Roadmap

Cloud Security Engineer Roadmap

tutorialspoint.com

Cloud Computing Tutorial Simply Easy Learning

Cloud Computing Tutorial Simply Easy...

SMITHA SRIHARSHA

CISSP Preparation Notes

CISSP Preparation Notes

CISSP Mind Map: All Domains

CISSP Mind Map: All Domains

CIS 18 CRITICAL SECURITY CONTROLS CHECKLIST

CIS 18 CRITICAL SECURITY CONTROLS...

CI-CD with Docker and Kubernetes

CI-CD with Docker and Kubernetes

BUSINESS CONTINUITY PLAN & DISASTER RECOVERY PLAN TEMPLATE

BUSINESS CONTINUITY PLAN & DISASTER...

Building a risk-resilient organisation

Building a risk-resilient organisation

40 under 40 in CyberSecurity 2024

40 under 40 in CyberSecurity 2024

40 Days in DeepDark Web About Crypto Scam

40 Days in DeepDark Web About Crypto Scam

8 Principles of Supply Chain Risk Management

8 Principles of Supply Chain Risk Management

Threat Hunter’s Handbook – Using Log Analytics to Find and Neutralize Hidden Threats in Your Environment

Threat Hunter’s Handbook – Using Log Analytics to Find and Neutralize Hidden Threats in Your Environment

The Hunters Handbook Endgame’s Guide to Adversary Hunting

The Hunters Handbook Endgame’s Guide to Adversary Hunting

THE EU’S MOST THREATENING by EUROPOL

THE EU’S MOST THREATENING by EUROPOL

National Cyber Security Centre

Responding to a cyber incident – a guide for CEOs

Responding to a cyber incident – a guide for CEOs

IGNITE Technologies

CREDENTIAL DUMPING

CREDENTIAL DUMPING

Pwning the Domain Lateral Movement

Pwning the Domain Lateral Movement

Jorgen Lanesskog

PING Basic IP Network Troubleshooting

PING Basic IP Network Troubleshooting

Layer 7 Visibility What are the Benefits?

Layer 7 Visibility What are the Benefits?

Introduction to Kubernetes Networking and Security

Introduction to Kubernetes Networking and Security

Department of Defense's (DoD)

Defense Industrial Base Cybersecurity Strategy 2024

Defense Industrial Base Cybersecurity Strategy 2024

Zero Trust Access for Dummies Fortinet

Zero Trust Access for Dummies Fortinet

Homeland Security

Zero Trust Implementation Strategy

Zero Trust Implementation Strategy

National Australia Bank Limited

Your Business and Cyber Security

Your Business and Cyber Security

Xeno RAT- A New Remote Access Trojan

Xeno RAT- A New Remote Access Trojan

IGNITE Technologies

Windows Persistence COM Hijacking MITRE T1546 015

Windows Persistence COM Hijacking MITRE T1546 015

IGNITE Technologies

Windows Exploitation Rundll32

Windows Exploitation Rundll32

IGNITE Technologies

Windows Exploitation Msbuild

Windows Exploitation Msbuild

Web LLM Attacks

Web LLM Attacks

Trended Protocols for Security Stuff

Trended Protocols for Security Stuff

Red Iberoamericana de Protección de Datos

Transferencia Internacional de Datos Personales – Guia de Implementación

Transferencia Internacional de Datos Personales – Guia de Implementación

TRACKING RANSOMWARE January 2024

TRACKING RANSOMWARE January 2024

https://www.linkedin.com/in/harunseker/

TOP Cyber Attacks Detected by SIEM Solutions

TOP Cyber Attacks Detected by SIEM Solutions

Top 100 Cyber Threats and Solutions 2024

Top 100 Cyber Threats and Solutions 2024

Top 50 Cybersecurity Threats

Top 50 Cybersecurity Threats

Top 10 Considerations for Incident Response

Top 10 Considerations for Incident Response