Trojans in LLMs of Code: A Critical Review through a Trigger-Based Taxonomy

aftabhussain461 6 views 14 slides Jul 28, 2024

Slide 1 of 14

About This Presentation

A study of different kinds of triggers that can poison your code LLMs.

Size: 2.09 MB

Language: en

Added: Jul 28, 2024

Slides: 14 pages

Slide Content

Aftab Hussain, Md Raﬁqul Islam Rabin, Touﬁque Ahmed, Bowen Xu,
Premkumar Devanbu, Amin Alipour

AIware 2024
Porto de Galinhas, Brazil
Trojans in Large Language Models of Code: A Critical
Review through a Trigger-Based Taxonomy

Triggered/Trojaned/Backdoored Input Target Prediction/Payload
Trojan/Backdoor
Trigger/Trojan trigger/Backdoor trigger
What is a trojan?
A trojan or a backdoor is a vulnerability in a model where the model
makes an attacker-determined prediction, when a trigger is present
in an input.

Motivation
●A trigger is the main design point of trojans.

●The way a trigger is crafted directly impacts its
stealthiness, and thereby its detectability.

●Knowing aspects of trigger design is essential to uncover
potential trojaning attacks that can be deployed by
malicious actors.

We observed there was a
lack of taxonomy in
characterizing triggers
within the AI for SE domain.

Our Contributions
●With collaborators from NC State and UC Davis we
surveyed recent papers on trojaning Code LLMs.

●We developed a uniﬁed trigger taxonomy framework.

●We deﬁned diﬀerent types of triggers based on various
aspects.

Let’s take a look
at a couple of trigger
aspects

Schuster et al., Congzheng Song, Eran Tromer, and Vitaly Shmatikov. You autocomplete me: Poisoning
vulnerabilities in neural code completion, USENIX Security, 2021
Single or Multi-Featured?
(Task: Code completion)

Are Code Semantics Preserved?
Semantic Trigger
(Task: Defect detection)

Structural Trigger
Are Code Semantics Preserved?
(Task: Defect detection)

Trigger Variability
Parametric Trigger
Agakhani et al. Trojanpuzzle: Covertly poisoning code suggestion models. 2023.
https://www.microsoft.com/en-us/research/publication/ trojanpuzzle-covertly-poisoning-code-suggestion-models/
(Task: Code generation)

Updated Paper
Available on arXiv
(2405.02828)

Let’s meet if wish you to learn more about our
works in Safe AI for Code

Software Engineering Research Group
University of Houston
[email protected]
https://www.linkedin.com/in/hussainaftab/

Trojans in LLMs of Code: A Critical Review through a Trigger-Based Taxonomy

About This Presentation

Slide Content

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

Trojans in LLMs of Code: A Critical Review through a Trigger-Based Taxonomy

About This Presentation

Slide Content

Slide 1

Slide 2

Slide 3

Slide 4

Slide 5

Slide 6

Slide 7

Slide 8

Slide 9

Slide 10

Slide 11

Slide 12

Slide 13

Slide 14

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

8-top-ai-courses-for-customer-support-representatives-in-2025.pptx

7-essential-ai-courses-for-call-center-supervisors-in-2025.pptx

25-essential-ai-courses-for-user-support-specialists-in-2025.pptx

8-essential-ai-courses-for-insurance-customer-service-representatives-in-2025.pptx

Know for Certain

PPT OPD LES 3ertt4t4tqqqe23e3e3rq2qq232.pptx