RNTI

MODULAD
Méthode alternative à la détection de « copier/coller » : intersection de textes et construction de séquences maximales communes
In EGC 2015, vol. RNTI-E-28, pp.71-76
Abstract
Plagiarism detection most commonly use the most naive phase of similarities search, the detection of copy and paste. In this paper, we propose an alternative method to the standard verbatim comparison approach. The idea is to carry out an intersection of two texts to get a table of common words and to keep only the maximum sequences of consecutive words in one of the texts which also exists in the other. We show that this method is faster and less expensive in memory that commonly used scan texts methods. The goal is to detect identical passages between two texts faster than verbatim comparison methods, while operating more efficient than the n-grams.