Sei sulla pagina 1di 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/265968201

Quranic Computation A Review of research and application

Conference Paper · December 2013


DOI: 10.13140/2.1.4279.4245

CITATIONS READS
2 1,817

2 authors:

Rahmath Safeena Dr. Abdullah Kammani


Taif University Taif University
21 PUBLICATIONS   263 CITATIONS    27 PUBLICATIONS   292 CITATIONS   

SEE PROFILE SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Knowledge Infrastructure for KM Capability View project

Electronic Services for Governance and Government View project

All content following this page was uploaded by Dr. Abdullah Kammani on 24 September 2014.

The user has requested enhancement of the downloaded file.


Quranic Computation
A Review of research and application

Rahmath Safeena Abdullah Kammani


College of Computers and Information Technology,
College of Computers and Information Technology, Taif university, Taif, Saudi Arabia
Taif university, Taif, Saudi Arabia
e-mail: abdullah.k@tu.edu.sa
e-mail: r.safeena@tu.edu.sa

Abstract— The Noble Qur’an is considered to be the central classification of articles in order to explore how the
religious text of Islam. Any linguistic or literary research with development of Quranic Computations has been evolved and
the use computational technologies of this text is benefitted by in which directions is need to be focused in to. The paper
billions of people around the world. It has been observed that covers journal articles, conference proceedings and
this research approach of Qur’anic Computation has strongly dissertations. Based on the scope of 39 articles from
established its base in the research and application. This study academic journals, this paper notice Quranic Computation to
reviews the evolution of computational effort on the noble book be evolved from two distinct complimentary directions of
of Quran, both from the research and application point of studies like computation of a) General Arabic and b) Quranic
view. The purpose of this review has been achieved through an
Arabic. A surveys and analyzes of these two streams of
exploratory study of several research literatures and various
applications documentation. Based on this objective, the study
studies reveals few research gaps and suggest some future
notice that Quranic Computation developed through various directions of the field.
researches and application has common goals of achieving easy
III. LITERATURE REVIEW
understanding of the Quran, but have chosen distinct
complementary methodology and techniques to achieve it. A One of the techniques used in Quranic computation is
snowball technique of collection, classification and computational linguistics. Computational linguistics is an
categorization of articles or documents from 1997 to 2011 has interdisciplinary field dealing with the statistical and/or rule-
adopted in the review. based modeling of natural language from a computational
perspective. This modeling is not limited to any particular
Keywords- Quranic Computing, Computational Linguistics, field of linguistics, but research on Quranic computation is
Quranic Arabic, Review Paper. an imperative. The importance of this is that the Qur’an
contains many classical words and the writing style is very
I. INTRODUCTION
different from modern standard Arabic. It is especially
The Quran is held by Muslims to be a single-authored important to preserve the correctness of words in this sacred
text, the direct words of God (Allah), conveyed by the angel book of the Muslims [1].
Gabriel to Prophet Muhammed (Peace be up on him). Arabic
words are known to have complex morphological structure A. Computation Linguistics
[1]. Quran is the central religious text of Islam. The Noble Intelligent natural language processing is based on the
Quran is one of the well-known books in the world which science called computational linguistics. Computational
has been descended in Arabic and most of peoples would linguistics might be considered as a synonym of automatic
like to understand more about this book [2]. Arabic is a processing of natural language, since the main task of
highly inflected language, with nouns and verbs taking computational linguistics is just the construction of computer
different morphological forms according to their role in a programs to process words and texts in natural language. [6].
sentence. Traditional grammar is familiar to native speakers Recent Computational Linguistics research incorporates
who have studied Arabic formally, and many books have statistical techniques as well as knowledge-based techniques.
been written about the language of the Quran which explain It is an approach to linguistics that employs methods and
the text in terms of traditional grammar [3-5]. However, the techniques of computer science. A formal, rigorous,
intent of this paper is to analyze various researches and computationally based investigation of questions that are
development done based on the Noble Qur’an. traditionally addressed by linguistics [7].
Computational linguistics (CL) is at the crossroads of
II. METHODOLOGY linguistics and computer science. Applied CL focuses on
The research method used in this paper is the reviews of developing practical applications that have some facility with
different literatures on Quranic computation from 1997 to human language. Applications using CL research which are
2011.Article were found via the computerized search of the currently available include: voice recognition software, web
selected topic. This paper surveys the development of search engines, word processors (spell checkers, grammar
Quranic Computation using a literature review and checkers), machine translation systems (automatic language
translation). There are many more exciting applications

Taibah University International Conference on Advances in Information Technology for the Holy Quran and Its Sciences
December 22 – 25, 2013, Madinah, Saudi Arabia
1 - 218
currently under development; multilingual information senses, their LR flexibly relates morphologically and PoS-
retrieval, information extraction, colloquial machine tags constrained Arabic lexical compounds to a predefined
translation and systems that create digests of newspapers, limited set of semantic fields across which the standard
journals and magazines. semantic relations are defined. With the aid of the same
Computation linguistics application on Noble Qur’an so large-scale Arabic morphological analyzer and PoS tagger in
far taken two distinct complimentary directions; A) General the runtime, the possible senses of virtually any given Arabic
Arabic, B) Quranic Arabic word are retrievable.
Habash and Rambow [14] developed MAGEAD, a
B. General Arabic morphological analyzer and generator for the Arabic
Arabic is acquiring attention in the natural language language; which decomposes word forms into the templatic
processing (NLP) community because of its political morphemes and relates morphemes to strings.
importance and the linguistic differences between it and Buckwalter [15] designed Buckwalter morphological
European languages. These linguistic characteristics, analyzer that uses a concatenative lexicon-driven approach
especially complex morphology, present interesting where morphotactics and orthographic rules are built directly
challenges for NLP researchers [8].Recent computational into the lexicon. The system has three components: the
advances have made possible annotating the Quran to very lexicon, the compatibility tables and the analysis engine. An
high accuracy [9] Arabic word is viewed as a concatenation of three regions, a
One of the main goals of Arabic Natural Language prefix region, a stem region and a suffix region. The prefix
Processing is effective document retrieval. For example, if and suffix regions can be null. Prefix and suffix lexicon
query is input through a search engine, the relevant entries cover all possible concatenations of Arabic prefixes
document retrieved must be based on either the root or the and suffixes, respectively. For every lexicon entry, a
stem of the word. Therefore, the goals of most Arabic morphological compatibility category, an English gloss and
morphological analyzers and stemming engines are to extract occasional part-of-speech (POS) data are specified.
the root and/or stem of a word. Habash [16] described an approach to automatic source-
Bielicky and Smarz [10] in their work describes the language syntactic preprocessing in the context of Arabic-
building of a valency lexicon of Arabic verbs using a English phrase-based machine translation. Source-language
morphologically and syntactically annotated corpus, the labeled dependencies, that are word aligned with target
Prague Arabic Dependency Treebank. Their work is built on language words in a parallel corpus, are used to
‘Functional Generative Description (FGD)’ theory where automatically extract syntactic reordering rules.
verbs have valency frame with many complements known as Belkredim1and El Sebai [17] used the derivations of
functors which can further be divided into actants (Actor, Arabic verbs and their patterns to structure the Arabic
Addressee, Patient, Effect and Origin) and adjuncts (like language and to link the words' morphology to their
Manner , Means and Location). This FGD concept was semantics. The model developed in their research was based
adapted for Arabic verbs [11]. on an Ontology using the derivation rules of the Arabic
Al-Qahtani [12] gives an extensive categorization of language. The model was evaluated by linguists to validate
modern standard Arabic verb valence based on Case its applicability.
Grammar (CG). According to this matrix five cases (Agent,
Experience, Benefactive, Object, and Locative) are plotted C. Quranic Arabic
horizontally and type of verb (State, Process, Action) The Qur’an is a classical book and the language is in the
vertically. The data was taken from 8327 verbs from a traditional Arabic known as i’rab [1].The Qur'an has the
lexicon and most frequent 200 verbs were exhaustively advantage of being a closed corpus in the following senses:
sorted to a cell in the matrix. First, it demonstrates a frequent repetition of structures,
Salem [8] developed a rule-based lexical framework for indeed of the same phrases, to the extent of what may be
Arabic language processing using the Role and Reference considered formulaic style. Second, the Qur'an is
Grammar linguistic model. A system, called UniArab is traditionally identified with one person, a specific region,
introduced to support the framework. UniArab utilizes an and a certain period of time and its volume is relatively
XML-based implementation of elements of the Role and restricted. These two facts justify treatment of the Qur'an as
Reference Grammar theory, and its representations for the an independent corpus which deserves an independent study
universal logical structure of Arabic sentences. The UniArab of its language in general and syntax in particular [18].
system for Modern Standard Arabic (MSA) takes MSA as Understanding the Quran is a grand challenge for society,
input in the native orthography, parses the sentence(s) into a for western public education, for Muslim-world education,
logical meta-representation, and generates a grammatically for knowledge representation and reasoning, for knowledge
correct English output with full agreement and extraction from text, for systems robustness and correctness,
morphological resolution. for online collaboration. Understanding the Quran is a major
Attia, Rashwan, Ragheb, Al-Badrashiny, Al-Basoumy, new Grand Challenge for Computer Science and Artificial
& Abdou, [13] designed and implemented an Arabic lexical Intelligence [19].
semantics Language Resource (LR) that enables the retrieval Dukes, Atwell, and Sharaf [20] are working on the
of the possible senses of any given Arabic word at a high Quranic Arabic corpus that is a resource which provides
coverage. Instead of tying full Arabic words to their possible morphological annotation and syntactic analysis using

Taibah University International Conference on Advances in Information Technology for the Holy Quran and Its Sciences
December 22 – 25, 2013, Madinah, Saudi Arabia
1 - 219
dependency grammar. The Quranic Arabic Corpus is a Dukes, Atwell, and Habash [25] presented a new
collaboratively constructed linguistic resource initiated at the approach to linguistic annotation of an Arabic corpus: online
University of Leeds, with multiple layers of annotation supervised collaboration using a multi-stage approach. The
including part-of-speech tagging, morphological different stages include automatic rule-based tagging, initial
segmentation [21] and syntactic analysis using dependency manual verification, and online supervised collaborative
grammar [22]. proofreading.
Dukes [26] presented LOGICON, an end-to-end system
TABLE I. STUDIES ON ARABIC COMPUTING using partial parsing, which assigns novel semantic
Auth Focus Method Result structures to natural language text. A syntactic tagging
or scheme is proposed which is closely aligned to the
[8] Arabic to English RRG theory, XML UniArab
corresponding semantics. Syntax-driven approach is used to
translation based metadata, system derive semantic roles through recursion. Given a simple
Java programming sentence focusing around an event, LOGICON attempts to
and Interlingua identify roles for the actor (who did the event), the action
design for machine (what the event was) and the target (what entity the actor
Translation.
performed the action on).
[13] To retrieve the Arabic Arabic Sharaf and Atwell [11] is in the work of designing a
sense of any morphological lexical Knowledge Representation (KR) model for the Quran
given Arabic analyzer and PoS semantics leveraging on the concept of ‘frame semantics’. They aim to
word. tagger Language
Resource
build FrameNet like lexicon for the verbs in the Quran. This
(LR) initial attempt will enable future extension to include
predicates other than verbs and to consider other classical
[12] Categorization of Data was taken Verbs were Arabic texts as well as Modern Standard Arabic.
modern standard from a lexicon exhaustivel
Arabic verb y sorted to
Shenassa and Khalvandi [2] designed a system to analyze
valence based on a cell in the the quality of translating a text; that is to evaluate different
Case Grammar matrix. English translations of Quran using tools and concepts such
as pos-tagging, natural language processing, computational
[10] Build a valency Morphologically Prague
lexicon of Arabic and syntactically Arabic linguistic and machine learning. To do such an evaluation,
verbs annotated corpus Dependenc each verb process type in translated text is compared with it's
y Treebank of Quranic text. The system uses Halliday Grammar, which
is a useful theory for analyzing a formal text, to do this. It
[14] Morphological Morphological MAGEAD
analyzer and analyzer has been assumed that each verb in the text of Quran has
generator for the been tagged manually. At the other hand, to detect each verb
Arabic language process type in translated English texts, a tagger is used to
[15] Morphological Morphological Buckwalter
detect and tag each verb based on Halliday grammar. The
analysis analyzer morphologi best translation is one in which the number of similar verb
cal process type in the source and translated text is maximum.
analyzer Dukes and Buckwalter [22] had shaped up a Dependency
Treebank of the Quran using Traditional Arabic Grammar.
The Quranic Arabic Dependency Treebank (QADT) uses
Al-Yahya, Al-Khalifa, Bahanshal, Al-Odah, and Al-
XML to represent the syntax of verses from the Quran and a
Helwah [23] proposed a computational model for
Java object model is provided with the Treebank as an API
representing Arabic lexicons using ontologies. The model is
to query the data. The Treebank also introduces the novel
based on the field theory of semantics from the linguistics
approach of displaying Quranic syntax using dependency
domain, and the data which drives the design of the model is
graphs, which show how each word in a sentence is related
obtained from the most accurate text that presents superiority
and what role it plays in building up a complete syntactic
and perfection of the Arabic language, the Noble Quran.
structure. This differs from other Arabic Treebank by
Dror, Shaharabani, Talmon, and Wintner, [18] devised a
providing a deep linguistic model of traditional Arabic
computational system for morphological analysis and
grammar. This Treebank is a part of the Arabic Corpus. It
annotation of the Qur'an, and Talmon and Wintner devised
also includes information like the root for each word, a
morphological tagging of the Qur’an [24]for research and
word-by-word interlinear translation into English, and an
teaching purposes. These systems facilitate a variety of
automatically generated phonetic transcription.
queries on the Quranic text that make reference to the words
Thabet [27] developed a methodology which results in
and their linguistic attributes. The core of the system is a set
understanding the Qur’an on the basis of its lexical
of finite-state based rules which describe the morpho-
semantics. It discovers the thematic structure (thematic
phonological and morpho-syntactic phenomena of the
interrelationships among the suras (chapters) ) of the Qur’an
Quranic language. The results are stored in an efficient
based on a fundamental idea in data mining and related
database and are accessed through a graphical user interface
disciplines: like collection of texts, the lexical frequency
which facilitates the presentation of complex queries.
profiles of the individual texts are a good indicator of their

Taibah University International Conference on Advances in Information Technology for the Holy Quran and Its Sciences
December 22 – 25, 2013, Madinah, Saudi Arabia
1 - 220
conceptual content, and thus provide a reliable criterion for relies on a subject matter expert (SME) to populate the
their classification relative to one another. ontology with terms and their properties).
Noordin and Othman [28] proposed a system design for Shoaib et al [33] addresses the deficiencies of key word
retrieving Quranic texts and any knowledge that derived or based searching and the issues related to semantic search in
cites al-Quran. They surveyed the 125 websites offering the Noble Quran, and proposed a model that is capable of
access to Quranic texts on their structure and linkages. performing semantic search. The model exploits WordNet
Findings revealed that the websites offer texts and relationships in relational database model; that is exploits the
translation, recitation, excerpt of exegesis, and link to other relational schema for the purpose of WordNet. The
websites consisting of news, events, and related topics. A implementation of this model has been carried out using
standard structure was not implemented by these websites SQL Server 2005 and VB.Net on Surah Al-Baqarah. The
and thus proposed a system design which focuses on texts, precision of the model's prototype implementation is far
translation, recitation, exegesis, al-Hadith, its topics and better than simple key word searching.
themes like stories of the prophets and places mentioned in
al-Quran, and search feature. IV. TECHNIQUES USED IN QURANIC COMPUTING
Al-Yahya et al [29] developed an ontological model A. Natural Language Processing(NLG)
based on Semantic Web technologies for representing
computational lexicons using the field theory of semantics It is used to provide concise English and Arabic
and componential analysis which provides the foundation for summaries of the inflection features stored in the Quranic
a dynamic and collaborative computational lexicon. The linguistic database [22]. Natural Language Processing (NLP)
ontological structure represents word semantics using the technology is a significant component in Semantic Web tool.
atomic components (features) of words and used the recent NLP is one branch of the linguistics, which uses the
W3C standard for representing ontologies, Web Ontology computer technology to realize human language processing
Language (OWL) for shared and open access to such a effectively. Its ultimate objective is to automatically
resource. In this method they limited the vocabulary to those understand human language with the support of artificial
words which exists in the Noble Quran. intelligence technology. It is also called as natural language
Kotb et al. [30] demonstrate the significance of XML understanding and sometimes is used to transform
semantics checker algorithm to check the semantic information to Semantic Web data. Traditional information
consistencies of the XML file of the Noble Quran book retrieval also can be turned into knowledge discovery [34].
found in Religion 2.0 website. They checked the semantic B. Treebank and Syntactic annotaion
consistency by attaching semantic information to XML
A Treebank is a linguistic resource which collects
element tag attributes. They automatically checked if the
together syntactic trees. These are manually annotated
number of verses in each chapter of the XML Noble Quran
analyses of sentences which can be read both by humans and
book as well as the number of chapters is semantically
computers, with different Treebank adopting different
correct. The system has successfully counted the number of
theories of syntax. Previous syntactic work includes the three
verses and chapters as in the real Noble book. This result was
major Arabic Treebank that have been recently developed:
achieved by using the SLXS Specification Language for
the Penn Arabic Treebank [35], the Prague Arabic
XML Semantics to specify the semantic rules associated
Dependency Treebank (PADT) and the Columbia Arabic
with XML attributes.
Treebank (CATiB)[36].
Moisl [31] proposed a model based on calculation of a
Penn Arabic Treebank annotation consists of two phases:
minimum Quran Sura (chapter) length threshold using
(a) Morphological/Part-of-Speech (=POS) tagging which
concepts from statistical sampling theory followed by
divides the text into lexical tokens and includes
selection of Sura and lexical variables based on that
morphological, morphosyntactic and gloss information, and
threshold. He applies the proposed solution to a reanalysis of
(b) Syntactic analysis referred to as Arabic Treebanking
the Quran and found that the higher the threshold, the larger
(=Arabic TB) which characterizes the constituent structures
the number of variables on which clustering can be based,
of word sequences, provides function categories for each
and the smaller the number of Sura that can be clustered.
non-terminal node, and identifies null elements, co-
Al-Khalifa et al [32] presents a work-in-progress project
reference, traces, etc [37].
for building a computerized framework that exploits the
Prague Arabic Dependency Treebank consists of multi-
power of semantic web technologies and natural language
level linguistic annotations over the language of Modern
processing, for recognizing and identifying semantic
Standard Arabic, and provides a variety of unique software
opposition terms using Natural Language processing armed
implementations designed for general use in Natural
with domain ontologies. The SemQ is a framework that takes
Language Processing (NLP) [38].
Quranic verse as an input and outputs the list of semantically
The Columbia Arabic Treebank (CATiB) is a database of
opposed words in the verse along with their degree of
syntactic analyses of Arabic sentences. CATiB avoids the
opposition. The framework architecture consists of two
annotation of redundant linguistic information that is
major components: the domain ontology (to mimic how the
determinable automatically from syntax and morphological
human brain keeps the semantics stored.) and the SemQ Tool
analysis, e.g., nominal case. And secondly, CATiB uses
(the tool works automatically to identify semantically
opposite terms and works as a manual identification tool that

Taibah University International Conference on Advances in Information Technology for the Holy Quran and Its Sciences
December 22 – 25, 2013, Madinah, Saudi Arabia
1 - 221
linguistic representation and terminology inspired by the [26] Partial parsing, Syntax-driven LOGICON
long tradition of Arabic syntactic studies [36]. which assigns novel approach is used to
semantic structures derive semantic
Syntactic annotation in the dependency framework to natural language roles through
involves two types of inter-related decisions: attachment and text recursion
labeling. The attachment of one word to another indicates [11] Build FrameNet Using the concept Knowledge
that there is a syntactic relationship between the head word like lexicon for the of ‘frame Representatio
and the dependent word (and the subtree it heads). The labels verbs in the Quran. semantics’. n (KR) model
specify the type of the attachment. For example, the relation, for the Quran
[27] thematic structure Data mining and understanding
subject, may label the attachment of a dependent noun to a of lexical semantics the
heading verb, where the noun is the subject of the verb [36]. the Qur’an Qur’an on the
Pajas and Stepank [39] presented recent advances in an basis of its
established Treebank annotation framework comprising of lexical
an abstract XML based data format, fully customizable semantics
editor of tree-based annotations, a toolkit for all kinds of [33] Semantic search SQL Server 2005 WordNet
and VB.Net based
automated data processing with support for cluster relational
computing, and a work-in-progress database-driven search model
engine with a graphical user interface built into the tree [23] Computational field theory of Ontological
editor. model for semantics from the model
representing Arabic linguistics domain
V. DISCUSSIONS lexicons using
ontologies.
Most recent research done in the field of Quranic [29] Time nouns from Semantic web Ontological
Computing can be classified as: Information Retrieval, the Holy Quran are technologies data-driven
Speech Recognition, Optical Character Recognition, used to derive the model
Morphology Analysis, Semantic checking, Educational resulting
ontological
Applications [32] Quranic Corpus[22] . The Noble Quran, structure
due to its unique style and allegorical nature, needs special
attention about searching and information retrieval issues. A. Research Gaps
The legacy keyword searching techniques are incapable of Although Arabic is the language of over two hundred
retrieving semantically relevant verses [33]. million speakers, little has been achieved in regards to
TABLE II. STUDIES ON QURANIC COMPUTING
computational Arabic resources [23]. There is a need an
objective, impartial computation of the noble Quran based on
Author Focus Method Results the Quranic word and other authentic source of Sunnah.
[20] Quranic Arabic Morphological Quranic tree Some of the research gaps that need to focused are 1) Most
corpus analysis, and bank
syntactic analysis
of the Muslims are ignorant of the deeper meanings in the
[22] Quranic corpus Uses XML to Quranic Quran, in spite of learning the sounds of the verses. An
represent the syntax Arabic authentic Quran Expert System could help them question and
of verses from the Dependency understand the teachings of the Quran for themselves. 2)
Quran and a Java Treebank Present-day systems can provide response to the questions
object model is (QADT) from the source text, but many potential questions are more
provided with the
Treebank as an API difficult and contentious to answer via text-match, requiring
to query the data a new Knowledge Representation and Reasoning formalism
[25] linguistic annotation Online supervised The different capable of capturing complex, subtle knowledge encoded in
of an Arabic corpus collaboration using stages include this Classical Arabic text, and inferencing in new ways
a multi-stage automatic which mirror the thousand-year–old traditions of scholarly
approach. rule-based
tagging, initial
analysis and interpretation.
manual B. Future Directions
verification,
and online The Quran stands out as the source of large collection of
supervised analysis and interpretation for Islamic Teaching. In future,
collaborative application of Artificial Intelligence, knowledge extraction
proofreading.
and knowledge representation techniques would certainly
[18] Computational morphological Database that
system for research analysis and can be give way easy and clear understanding of this splendid
and teaching annotation accessed scripture. This effort will lead to some computational results
through a GUI are that will shed new light on traditional interpretations,
which thus adding to the canon of Islamic wisdom. The Muslim
facilitates the believes the Noble Quran as free of any alterations or
presentation
of complex
variations. The Computation effort would surely reinstate
queries. this belief with sound inference and interpretation of Quran.

Taibah University International Conference on Advances in Information Technology for the Holy Quran and Its Sciences
December 22 – 25, 2013, Madinah, Saudi Arabia
1 - 222
VI. CONCLUSION [19] E. Atwell, K. Dukes, A. Sharaf, and N. Habash, Understanding the
Quran: A new Grand Challenge for Computer Science and Artificial
Computation of Quranic Arabic is a unique challenge, Intelligence, Edinburgh: 2010.
because of the vocabulary and morphological variation it has [20] K. Dukes, E. Atwell, and A. Abdul-Baquee M. Sharaf, “Syntactic
from the modern standard Arabic. Any use of computational Annotation Guidelines for the Quranic Arabic Treebank,” LREC-
technology on this linguistic or literary investigation would 2010, Malta: 2010, p. 4.
definitely help in easy understanding the Quranic text. [21] K. Dukes and N. Habash, “Morphological Annotation of Quranic
Therefore this review paper is intended to be base for Arabic,” LREC 2010, Valletta, Malta: 2010.
research on novel computational approach toward the Noble [22] K. Dukes and T. Buckwalter, “A Dependency Treebank of the Quran
using Traditional Arabic Grammar,” INFOS 2010, Cairo, Eygpt:
Quran. 2010.
ACKNOWLEDGMENT [23] M. Al-Yahya, H. Al-Khalifa, A. Bahanshal, I. Al-Odah, and N. Al-
Helwah, “An Ontological Model for Representing Semantic
The authors would like to thank all researchers who have Lexicons: An Application on Time Nouns in the Holy Quran,” The
sent their articles for our analysis and study on request. Arabian Journal for Science and Engineering, vol. 35, 2010, pp. 22-
35.
REFERENCES [24] R. Talmon and S. Wintner, “Morphological Tagging of the Qur’an,”
EACL'03 Workshop, Budapest, Hungary: 2003.
[1] R.J. Raja Yusof,, R. Zainuddin, M.S. Baba, and Z. Mohd. Yusoff,
“Qur'anic Words Stemming,” The Arabian Journal for Science and [25] K. Dukes, E. Atwell, and N. Habash, “Supervised Collaboration for
Engineering, vol. 35, 2010, pp. 37-49. Syntactic Annotation of Quranic Arabic,” 2010.
[2] M. Shenassa and M. Khalvandi, “Evaluation of Different English [26] K. Dukes, “Logicon: A System for Extracting Semantic Structure
Translations of Holy Koran in Scope of Verb Process Type,” ICTTA Using Partial Parsing,” RANLP-2009, Borovets, Bulgaria: 2009, p. 5.
2008, Damascus: 2008, pp. 1-4. [27] N. Thabet, “Understanding the Thematic Structure of the Qur’an: An
[3] H. Ansari, Learning the Language of the Quran, Centre of Religious Exploratory Multivariate Approach,” Proceedings of the ACL Student
Studies and Guidance., 1997. Research Workshop, Michigan: Association for Computational
Linguistics, 2005, pp. 7-12.
[4] A. Jones, Arabic Through the Quran, Islamic Texts Society, 2005.
[28] M. Noordin and R. Othman, “An Information Retrieval System for
[5] J. Rafai, Basic Quranic Arabic Grammar, Ta Ha Publishers, 2004. Quranic Texts: A Proposed System Design,” 2006.
[6] I.A. Bolshakov and A. Gelbukh, Computational Linguistics: Models, [29] M. Al-Yahya, H. Alkhalifa, A. Bahanshal, I. Alodah, and N. Al-
Resources, Applications, Mexico: Dirección de Publicaciones, 2004. Helwah, “An Ontological Model for Representing Computational
[7] S. Wintner, “Computational Linguistics,” 2005. Lexicons: A Componential Based Approach,” NLP-KE-2010,
[8] Y. Salem, “A Generic Framework for Arabic to English Machine Beijing, China: 2010, pp. 1-6.
Translation of Simplex Sentences Using the Role and Reference [30] Y. Kotb, K. Gondow, and T. Katayama, “A Case Study for XML
Grammar Linguistic Model,” Master Thesis, School of Informatics Semantics Checker Model,” Washington, DC: 2003, pp. 4834 - 4839.
and Engineering at the Institute of Technology Blanchardstown, [31] H. Moisl, “Sura Length and Lexical Probability Estimation in Cluster
2009. Analysis of the Qur'an,” ACM Transactions on Asian Language
[9] K. Dukes, “Computational Analysis of the Quran through Traditional Information Processing, vol. 8, 2009, pp. 1-19.
Arabic Linguistics,” 2011. [32] H.S. AI-Khalifa, M. Al-Yahya, A. Bahanshal, and I. Al-Odah,
[10] V. Bielicky and O. Smarz, “Building the Valency Lexicon of Arabic “SemQ: A Proposed Framework for Representing
Verbs,” LREC 2008, Marrakech, Morocco: 2008, pp. 2300-2307. Semantic Opposition in the Holy Quran using Semantic Web
[11] A. Sharaf and E. Atwell, “Knowledge Representation of the Quran Technologies,” CTIT-2009, Dubai: 2009, pp. 1-4.
Through Frame Semantics: A Corpus-Based Approach,” Corpus [33] M. Shoaib, M.N. Yasin, K. Hikmat Ullah, M.I. Saeed, and M.S.H.
Linguistics-2009, University of Liverpool: 2009, p. 12. Khiyal, “Relational WordNet model for semantic search in Holy
[12] D. Al-Qahtani, Semantic Valence of Arabic Verbs, Librairie du Liban Quran,” Islamabad, Pakistan: 2009, pp. 29-34.
Publishers, 2005. [34] M. Beseiso, A.R. Ahmad, and R. Ismail, “A Survey of Arabic
[13] M. Attia, M. Rashwan, A. Ragheb, M. Al-Badrashiny, H. Al- Language Support in Semantic Web,” International Journal of
Basoumy, and S. Abdou, “A Compact Arabic Lexical Semantics Computer Applications, vol. 9, 2010, pp. 35-40.
Language Resource [35] A. Bies and M. Maamouri, Penn Arabic Treebank Guidelines,
Based on the Theory of Semantic Fields,” GoTAL 2008, Gothenburg, Philadelphia: University of Pennsylvania, 2003.
Sweden: 2008. [36] N. Habash, R. Faraj, and R. Roth, “Syntactic Annotation in the
[14] N. Habash and O. Rambow, “MAGEAD: A Morphological Analyzer Columbia Arabic Treebank,” MEDAR, Cairo, Eygpt: 2009.
and Generator for the Arabic Dialects,” Sydney: Association for [37] M. Maamouri, A. Bies, and S. Kulik, “Creating a Methodology for
Computational Linguistics, 2006, pp. 681-688. Large-Scale Correction of Treebank Annotation: The Case of the
[15] T. Buckwalter, “Buckwalter Arabic Morphological Analyzer Version Arabic Treebank,” MEDAR, Cairo, Eygpt: 2009, p. 7.
1.0.,” University of Pennsylvania: 2002. [38] J. Hajic, O. Smarz, P. Zemanek, J. Snaidauf, and E. Beska, “Prague
[16] N. Habash, “Syntactic Preprocessing for Statistical Machine Arabic Dependency Treebank: Development in Data and Tools,”
Translation,” MT-Summit, Copenhagen, Denmark: 2007, p. 8. NEMLAR, Cairo, Eygpt: 2004, pp. 110-117.
[17] F. Be1krediml and A. El Sebai, “An Ontology Based Formalism for [39] P. Pajas and J. Stepanek, “Recent Advances in a Feature-rich
the Arabic Language Using Verbs and their Derivatives,” Framework for Treebank Annotation,” CoLing 2008, Manchester:
Communications of the IBIMA, vol. 11, 2009, pp. 44-52. 2008, pp. 673-680.
[18] J. Dror, D. Shaharabani, R. Talmon, and S. Wintner, “Morphological
Analysis of the Qur'an,” Literary and Linguistic Computing, vol. 19,
2004, pp. 431-452.

Taibah University International Conference on Advances in Information Technology for the Holy Quran and Its Sciences
December 22 – 25, 2013, Madinah, Saudi Arabia
1 - 223

View publication stats

Potrebbero piacerti anche