IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Enhancing Arabic NLP: A Comparative Study of AI-Driven Text Preprocessing Tools

Enhancing Arabic NLP: A Comparative Study of AI-Driven Text Preprocessing Tools
View Sample PDF
Author(s): Suha Khalil Assayed (The British University in Dubai, UAE), Safwan Maghaydah (Abu Dhabi University, UAE)and Khaled Shaalan (The British University in Dubai, UAE)
Copyright: 2027
Pages: 18
Source title: Encyclopedia of Modern Artificial Intelligence
Source Author(s)/Editor(s): Mehdi Khosrow-Pour, D.B.A. (Founding Editor-in-Chief, Information Resources Management Journal (IRMJ), USA)
DOI: 10.4018/407607

Purchase

View Enhancing Arabic NLP: A Comparative Study of AI-Driven Text Preprocessing Tools on the publisher's website for pricing and purchasing information.

Abstract

Arabic is a first language for more than 300 million people. It has some unique features that can make it one of the most complex languages, such as multiple derivatives, unlimited vocabulary, diacritics, and others. Preprocessing Arabic text is an essential step in order to prepare text for Natural Language Processing (NLP) purposes. This article provides a comparison study of several preprocessing tools for Arabic text. It explains the challenges in pre-processing the Arabic language as well as the techniques that used in every particular tool. However, the authors used the PRISMA for reporting the systematic reviews, which they started with screening 200 articles and ended-up with including only 30 articles. After reviewing these articles deeply, the results show that different tools such as AMIRA, CAMel ,and NLP packages added value in text-preprocessing. However, most of this papers considered that the ambiguity in Arabic orthography as well as the dialectal variants are the most challenges in Arabic NLP.

Related Content

Frederic Andres. © 2027. 14 pages.
Kalsoom Safdar, Khairul Najmy Abdul Rani, Mohd Aminudin Jamlos, Siti Julia Rosli, Muhammad Usman Younus, Zanab Safdar. © 2027. 27 pages.
Bani Adam, Binastya Anggara Sekti, Muhammad Adi Zacky Zahran. © 2027. 24 pages.
Swetha Margaret T. A., Renuka Devi D.. © 2027. 31 pages.
Maurice Saluschke, Michael Schulz. © 2027. 30 pages.
Mirjam Sepesy Maučec, Gregor Donaj. © 2027. 16 pages.
Jorge A. Ruiz-Vanoye, Ocotlan Diaz-Parra, Ricardo A. Barrera-Cámara, Alejandro Fuentes-Penna, Francisco R. Trejo-Macotela, Jaime Aguilar-Ortiz, Eric Simancas-Acevedo. © 2027. 21 pages.
Body Bottom