Finding the Similarity between Two Arabic Texts

Authors

  • Suhad Malallah kadhem Department of Computer Sciences, University of Technology, Baghdad, Iraq
  • Aseel Qassim Abd Alameer Department of Computer Sciences, University of Technology, Baghdad, Iraq

Keywords:

Arabic Text Similarity, Semantic Similarity, Keyword Extraction, N-Gram, Cosine Similarity Measure, Dice's Similarity Measure

Abstract

Calculating similarities between texts that have been written in one language or multiple languages still one of the most important challenges facing the natural language processing. This work offers many approaches that used for the texts similarity. The proposed system will find the similarity between two Arabic texts by using hybrid similarity measures techniques: Semantic similarity measure, Cosine similarity measure and N-gram ( using the Dice similarity measure). In our proposed system we will design Arabic SemanticNet that store the keywords for a specific field(computer science), by this network we can find semantic similarity between words according to specific equations. Cosine and N-gram similarity measures are used in order to find the similar characters sequences. The proposed system was executed by using Visual Basic 2012, and after testing it, it proved to be a worthy for finding the similarity between two Arabic texts (From the viewpoint of accuracy and search time).

Downloads

Download data is not yet available.

Downloads

Published

2022-01-25

Issue

Section

Computer Science

How to Cite

Finding the Similarity between Two Arabic Texts. (2022). Iraqi Journal of Science, 58(1A), 152-162. https://ijs.uobaghdad.edu.iq/index.php/eijs/article/view/6236

Similar Articles

11-20 of 627

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)