Prompt extraction: Difference between revisions
Appearance
imported>Unknown user No edit summary |
imported>Unknown user No edit summary |
||
(No difference)
| |||
imported>Unknown user No edit summary |
imported>Unknown user No edit summary |
||
(No difference)
| |||
An attack that tries to divulge the system prompt or other information in the context of a large language model that would normally be hidden from a user.
Source: NIST AI 100-2e2025 | Category: