Skip to content
forked from parthg/clpd-kbs

Code used in "Methods for cross-language plagiarism detection" of KBS paper.

Notifications You must be signed in to change notification settings

bahar67/clpd-kbs

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Author: Parth Gupta (pgupta@dsic.upv.es) Date: June, 2013.

Version: v 1.0

Add following .jar files in /lib directory.

* antlr.jar
* icu4j-4_6.jar
* Nemo-20140414.jar
* nlel-20130326.jar
* terrier-3.5-core.jar
* trove-3.0.1.jar

Terrier will probably more .jar files. You can safely add all the .jar files from terrier's lib/ to clpd-kbs/lib and add to your build-path.

It contains code used in experiement of the KBS paper for cross-language plagiarism detection. The programs are written for research purpose only. If you have any questions feel free to start an issue on GitHub. The programs are written for research purpose only.

Alberto Barrón-Cedeño, Parth Gupta, Paolo Rosso: Methods for cross-language plagiarism detection. Knowl.-Based Syst. 50: 211-217 (2013)

To Get Started:

  1. Expt A with CNG

The programs to run are i) es.upv.nlel.cng.PrepareCollectionExptA - Will prepare the character n-gram data ii) es.upv.nlel.cng.CLCNGExptA - Will calculate similarity between source and target documents and generate output file iii) es.upv.nlel.eval.AnalyseRecall - Measure recall for different types and length of experiments

All rights reserved. Copyright (C) 2011 Parth Gupta

About

Code used in "Methods for cross-language plagiarism detection" of KBS paper.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published