نحو بناء مدونة اختبار معيارية عربية لفك الالتباس الدلالي

El-Gendy, Amr; El-Wakil, Said; Hifny, Yaser; Omar, Ahmed Abdul-Hamid

doi:10.21608/ejle.2023.197424.1046

نحو بناء مدونة اختبار معيارية عربية لفك الالتباس الدلالي

Document Type : Original Article

Authors

¹ The Academy of the Arabic Language, Cairo, Egypt

² Arabic Department, Arts Faculty, Ain Shams University, Cairo, Egypt

³ Faculty of Computers and Artificial Intelligence, Helwan University, Cairo, Egypt.

⁴ Arabic Department, Arts Faculty, Ain Shams University, Cairo, Egypt.

10.21608/ejle.2023.197424.1046

Abstract

. . . . . .
This paper aims to provide a methodology for the stages that can be followed to build a Standard Arabic Test Corpus. The aim is to use it to automatically Word Sense Disambiguation in the texts of the classical Arabic language, especially since there is hardly a corpus that researchers can use to test their statistical linguistic models. Even there is no unified list of ambiguous words that can be subject to testing; which leads experts to a state of scientific uncertainty about the best proposed models for Arabic Word Sense Disambiguation automatically. The research followed a proposed approach, starting from defining the ambiguous words used in all ages, and passing through the collection and preparation of a major corpus, and ending with its classification in preparation for extracting a test set that represents the major corpus as much as possible.
. . . . .

Keywords