KASPRZAK, Jan and Michal BRANDEJS. Improving the Reliability of the Plagiarism Detection System. Online. In Notebook Papers of CLEF 2010 LABs and Workshops. Padova: University of Padova, 2010, p. 1-10. ISBN 978-88-904810-0-0.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Improving the Reliability of the Plagiarism Detection System
Authors KASPRZAK, Jan (203 Czech Republic, guarantor, belonging to the institution) and Michal BRANDEJS (203 Czech Republic, belonging to the institution).
Edition Padova, Notebook Papers of CLEF 2010 LABs and Workshops, p. 1-10, 10 pp. 2010.
Publisher University of Padova
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Italy
Confidentiality degree is not subject to a state or trade secret
Publication form electronic version available online
WWW URL
RIV identification code RIV/00216224:14330/10:00045065
Organization unit Faculty of Informatics
ISBN 978-88-904810-0-0
ISSN 2038-4963
Keywords in English plagiarism; document similarity; external plagiarism; intrinsic plagiarism
Tags best, IS, Plagiarism
Tags International impact, Reviewed
Changed by Changed by: RNDr. Jan Kasprzak, Ph.D., učo 1885. Changed: 11/5/2015 22:27.
Abstract
In this paper we describe our approach at the PAN 2010 plagiarism detection competition. We refer to the system we have used in PAN'09. We then present the improvements we have tried since the PAN'09 competition, and their impact on the results on the development corpus. We describe our experiments with intrinsic plagiarism detection and evaluate them. We then discuss the computational cost of each step of our implementation, including the performance data from two different computers.
Abstract (in Czech)
V tomto článku popisujeme náš přístup v soutěži PAN 2010 v detekci plagiátorství. Odkazujeme na systém, který jsme použili během PAN'09. Dále předkládáme vylepšení, která jsme vyzkoušeli, a jejich vliv na vývojový korpus. Popisujeme naše experimenty v oblasti detekce vnitřního plagiátorství a vyhodnocujeme je. Dále diskutujeme výpočetní náročnost každého kroku naší implementace, včetně výkonnostních dat na dvou různých počítačích.
Links
LA09016, research and development projectName: Účast ČR v European Research Consortium for Informatics and Mathematics (ERCIM) (Acronym: ERCIM)
Investor: Ministry of Education, Youth and Sports of the CR, Czech Republic membership in the European Research Consortium for Informatics and Mathematics
PrintDisplayed: 27/4/2024 16:20