Reshare: LeftoverLocals: Listening to LLM responses through leaked GPU local memory

lqdev☔01/16/2024

https://blog.trailofbits.com/2024/01/16/leftoverlocals-listening-to-llm-responses-through-leaked-gpu-local-memory/

We are disclosing LeftoverLocals: a vulnerability that allows recovery of data from GPU local memory created by another process on Apple, Qualcomm, AMD, and Imagination GPUs. LeftoverLocals impacts the security posture of GPU applications as a whole, with particular significance to LLMs and ML models run on impacted GPU platforms. By recovering local memory—an optimized GPU memory region—we were able to build a PoC where an attacker can listen into another user’s interactive LLM session (e.g., llama.cpp) across process or container boundaries

Permalink: https://www.luisquintanilla.me/feed/lefoverlocals-llm-responses-leaked-gpu-memory/

Tags: #security #llm #ai

Back to feed

Send me a message or webmention