IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Explainable and Transparent AI Architectures

Explainable and Transparent AI Architectures
View Sample PDF
Author(s): Deepak Gupta (Institute of Technology and Management, Gwalior, India), Abduraimova Nigora (Termez University of Economics and Service, Termez, Uzbekistan), Gulkhayo Gulkhayo (Mamun University, Khiva, Uzbekistan), Ergashev Nuriddin (Karshi State Technical University. Karshi, Uzbekistan), Shokhzod Karimov (Tashkent State University of Economics, Uzbekistan), Mamatkhujaev Otabek (Alfraganus University, Tashkent, Uzbekistan)and Seitnazarov Kuanishbay (Nukus State Pedagogical Institute, Uzbekistan)
Copyright: 2026
Pages: 28
Source title: Examining Vulnerabilities and Adversarial Exploitation of AI and LLMs
Source Author(s)/Editor(s): Puya Pakshad (Illinois Institute of Technology, USA)and Marwan Omar (Illinois Institute of Technology, USA)
DOI: 10.4018/979-8-3373-8252-4.ch011

Purchase

View Explainable and Transparent AI Architectures on the publisher's website for pricing and purchasing information.

Abstract

Explainability and transparency have emerged as foundational pillars in the secure deployment of artificial intelligence (AI) systems, especially large language models (LLMs). This chapter examines the evolving landscape of explainable AI (XAI) architectures through the lens of cybersecurity, adversarial robustness, and regulatory compliance. The authors survey core XAI methodologies—including LIME, SHAP, mechanistic interpretability, attention attribution, and causal tracing—evaluating their effectiveness against adversarial threats such as jailbreaking, prompt injection, data poisoning, and hallucination exploitation. The dual nature of XAI is critically examined: while transparency mechanisms bolster defense and trust, they simultaneously introduce novel attack surfaces that adversaries can exploit to subvert explanation systems.

Related Content

Parth Nagar, Srinath M. S.. © 2027. 48 pages.
Swapnali Pravin Gaikwad, Saurabh Vinayak Hembade. © 2027. 36 pages.
Titiksha Tulsidas Bhagat, Shweta Bondre, Vipin Bondre, Uma Yadav, Priya Dasarwar. © 2027. 26 pages.
Anshik Kumar Tiwari, Brindha Subburaj. © 2027. 22 pages.
Grace Shalini T., Pratham Shrivastav, Parthiv Gopa. © 2027. 36 pages.
S. Aarthi, Jaypalsinh A. Gohil. © 2027. 30 pages.
Arul Selvam P., Tamije Selvy P.. © 2027. 30 pages.
Body Bottom