The IRMA Community
Newsletters
Research IRM
Click a keyword to search titles using our InfoSci-OnDemand powered search:
|
Adversarial Attacks and Backdoor Exploitation in Large Language Models: Detection, Forensic Analysis, and Defense Mechanisms
Abstract
Large Language Models (LLMs) have quickly become cornerstone elements in intelligent systems nowadays, making decisions, automating processes, and doing security operations and interactive applications in a variety of environments. However, their increased integration into critical infrastructures has led to increased concerns regarding malfeasance exploitation by an adversary and/or hidden vulnerabilities. Attackers can exploit these models with prompt-based attacks, backdoors, data poisoning and output manipulation to gain unpermitted access to the model, spread false information, bypass safety filters and to misclassify. These adversarial ways pose a great challenge to the reliability, interpretability and trust degrees when it comes to the AI-driven platforms. This chapter is a detailed look of the adversarial attack surfaces and backdoor exploitation techniques against LLMs. I
Related Content
|
Parth Nagar, Srinath M. S..
© 2027.
48 pages.
|
|
Swapnali Pravin Gaikwad, Saurabh Vinayak Hembade.
© 2027.
36 pages.
|
|
Titiksha Tulsidas Bhagat, Shweta Bondre, Vipin Bondre, Uma Yadav, Priya Dasarwar.
© 2027.
26 pages.
|
|
Anshik Kumar Tiwari, Brindha Subburaj.
© 2027.
22 pages.
|
|
Grace Shalini T., Pratham Shrivastav, Parthiv Gopa.
© 2027.
36 pages.
|
|
S. Aarthi, Jaypalsinh A. Gohil.
© 2027.
30 pages.
|
|
Arul Selvam P., Tamije Selvy P..
© 2027.
30 pages.
|
|
|