prompt injection

From Wiktionary, the free dictionary
Jump to navigation Jump to search

English[edit]

Noun[edit]

prompt injection (countable and uncountable, plural prompt injections)

  1. (artificial intelligence) A method of causing an artificial intelligence to ignore its initial instructions (often moral programming) by giving it a certain prompt.
    • 2022 September 21, Alex Hern, “TechScape: AI's dark arts come into their own”, in The Guardian[1], London: Guardian News & Media, →ISSN, →OCLC, archived from the original on 2023-02-05:
      Retomeli.io is a jobs board for remote workers, and the website runs a Twitter bot that spammed people who tweeted about remote working. The Twitter bot is explicitly labelled as being "OpenAI-driven", and within days of Goodside's proof-of-concept being published, thousands of users were throwing prompt injection attacks at the bot.
    • 2023 March 3, Chloe Xiang, “Hackers Can Turn Bing's AI Chatbot Into a Convincing Scammer, Researchers Say”, in VICE[2], archived from the original on 2023-03-22:
      Yesterday, OpenAI announced an API for ChatGPT and posted an underlying format for the bot on GitHub, alluding to the issue of prompt injections.
    • 2023 February 14, Will Oremus, “Meet ChatGPT's evil twin, DAN”, in The Washington Post[3], Washington, D.C.: The Washington Post Company, →ISSN, →OCLC, archived from the original on 2023-03-19:
      One category is what's known as a "prompt injection attack," in which users trick the software into revealing its hidden data or instructions.

See also[edit]

Further reading[edit]