The IRMA Community
Newsletters
Research IRM
Click a keyword to search titles using our InfoSci-OnDemand powered search:
|
Explainable and Transparent AI Architectures
|
|
Author(s): Deepak Gupta (Institute of Technology and Management, Gwalior, India), Abduraimova Nigora (Termez University of Economics and Service, Termez, Uzbekistan), Gulkhayo Gulkhayo (Mamun University, Khiva, Uzbekistan), Ergashev Nuriddin (Karshi State Technical University. Karshi, Uzbekistan), Shokhzod Karimov (Tashkent State University of Economics, Uzbekistan), Mamatkhujaev Otabek (Alfraganus University, Tashkent, Uzbekistan)and Seitnazarov Kuanishbay (Nukus State Pedagogical Institute, Uzbekistan)
Copyright: 2026
Pages: 28
Source title:
Examining Vulnerabilities and Adversarial Exploitation of AI and LLMs
Source Author(s)/Editor(s): Puya Pakshad (Illinois Institute of Technology, USA)and Marwan Omar (Illinois Institute of Technology, USA)
DOI: 10.4018/979-8-3373-8252-4.ch011
Purchase
|
Abstract
Explainability and transparency have emerged as foundational pillars in the secure deployment of artificial intelligence (AI) systems, especially large language models (LLMs). This chapter examines the evolving landscape of explainable AI (XAI) architectures through the lens of cybersecurity, adversarial robustness, and regulatory compliance. The authors survey core XAI methodologies—including LIME, SHAP, mechanistic interpretability, attention attribution, and causal tracing—evaluating their effectiveness against adversarial threats such as jailbreaking, prompt injection, data poisoning, and hallucination exploitation. The dual nature of XAI is critically examined: while transparency mechanisms bolster defense and trust, they simultaneously introduce novel attack surfaces that adversaries can exploit to subvert explanation systems.
Related Content
|
Parth Nagar, Srinath M. S..
© 2027.
48 pages.
|
|
Swapnali Pravin Gaikwad, Saurabh Vinayak Hembade.
© 2027.
36 pages.
|
|
Titiksha Tulsidas Bhagat, Shweta Bondre, Vipin Bondre, Uma Yadav, Priya Dasarwar.
© 2027.
26 pages.
|
|
Anshik Kumar Tiwari, Brindha Subburaj.
© 2027.
22 pages.
|
|
Grace Shalini T., Pratham Shrivastav, Parthiv Gopa.
© 2027.
36 pages.
|
|
S. Aarthi, Jaypalsinh A. Gohil.
© 2027.
30 pages.
|
|
Arul Selvam P., Tamije Selvy P..
© 2027.
30 pages.
|
|
|