Skip to main content

Showing 1–1 of 1 results for author: Liu, Z Z G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.09878  [pdf, other

    cs.CR

    Transferring Backdoors between Large Language Models by Knowledge Distillation

    Authors: Pengzhou Cheng, Zongru Wu, Tianjie Ju, Wei Du, Zhuosheng Zhang Gongshen Liu

    Abstract: Backdoor Attacks have been a serious vulnerability against Large Language Models (LLMs). However, previous methods only reveal such risk in specific models, or present tasks transferability after attacking the pre-trained phase. So, how risky is the model transferability of a backdoor attack? In this paper, we focus on whether existing mini-LLMs may be unconsciously instructed in backdoor knowledg… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 13 pages, 16 figures, 5 tables