Finding the Similarity between Two Arabic Texts

Suhad Malallah  kadhem; Aseel Qassim  Abd Alameer

Authors

Suhad Malallah kadhem Department of Computer Sciences, University of Technology, Baghdad, Iraq
Aseel Qassim Abd Alameer Department of Computer Sciences, University of Technology, Baghdad, Iraq

Keywords:

Arabic Text Similarity, Semantic Similarity, Keyword Extraction, N-Gram, Cosine Similarity Measure, Dice's Similarity Measure

Abstract

Calculating similarities between texts that have been written in one language or multiple languages still one of the most important challenges facing the natural language processing. This work offers many approaches that used for the texts similarity. The proposed system will find the similarity between two Arabic texts by using hybrid similarity measures techniques: Semantic similarity measure, Cosine similarity measure and N-gram ( using the Dice similarity measure). In our proposed system we will design Arabic SemanticNet that store the keywords for a specific field(computer science), by this network we can find semantic similarity between words according to specific equations. Cosine and N-gram similarity measures are used in order to find the similar characters sequences. The proposed system was executed by using Visual Basic 2012, and after testing it, it proved to be a worthy for finding the similarity between two Arabic texts (From the viewpoint of accuracy and search time).

Finding the Similarity between Two Arabic Texts

Authors

Keywords:

Abstract

Downloads

Downloads

Published

Issue

Section

How to Cite

Similar Articles

Most read articles by the same author(s)

Make a Submission

about

scopus

sjr

index

map

You can find us here

Information

Latest publications

Keywords