Logo
Logo
Campo de búsqueda / búsqueda general

 
Autor
Título
Tema

Full metadata record
DC FieldValueLanguage
dc.contributor.authorRAMIREZ DE LA CRUZ, AARON-
dc.contributor.authorRAMIREZ DE LA ROSA, ADRIANA GABRIELA-
dc.contributor.authorSANCHEZ SANCHEZ, CHRISTIAN-
dc.contributor.authorJIMENEZ SALAZAR, HECTOR-
dc.coverage.spatial<dc:creator id="info:eu-repo/dai/mx/cvu/239516">ADRIANA GABRIELA RAMIREZ DE LA ROSA</dc:creator>-
dc.coverage.spatial<dc:creator id="info:eu-repo/dai/mx/cvu/170715">CHRISTIAN SANCHEZ SANCHEZ</dc:creator>-
dc.coverage.spatial<dc:creator id="info:eu-repo/dai/mx/cvu/54971">HECTOR JIMENEZ SALAZAR</dc:creator>-
dc.coverage.temporal<dc:subject>info:eu-repo/classification/cti/7</dc:subject>-
dc.date.accessioned2020-06-22T22:57:17Z-
dc.date.available2020-06-22T22:57:17Z-
dc.date.issued2007-
dc.identifier.citationFIRE 2014 : post-proceedings of the 6th workshop of the Forum for Information Retrieval Evaluationen_US
dc.identifier.urihttp://ilitia.cua.uam.mx:8080/jspui/handle/123456789/484-
dc.description.abstractSource code plagiarism can be identified by analyzing similarities of several and diverse aspects of a pair of source code. In this paper we present three types of similarity features that account for three aspects of source code documents, particularly: i) lexical, ii) structural, and iii) stylistics. From the lexical view, we used a character 3-gram model without considering reserved words for the programming language in revision. For the structural view, we proposed two similarity metrics that take into account the function’s signatures within a source code, namely the data types and the identifier’s names of the function’s signature. The third view consists on accounting for several stylistics’ features, such as the number of white spaces, lines of code, upper letters, etc. Accordingly, we proposed 8 similarity features to represent pairs of source code in order to, under a supervised approach, identify plagiarized pairs of source codes. We use a set of more than 32000 source code documents from Java and C to perform our experiments. The results show the pertinence of our set of features to identify plagiarism for source code documents that satisfy particular conditions, such as, source code that solve difficult problems.en_US
dc.description.sponsorshipFIRE 2014 : post-proceedings of the 6th workshop of the Forum for Information Retrieval Evaluationen_US
dc.language.isoInglésen_US
dc.publisherNew York : Association for Computing Machineryen_US
dc.relation978-1-4503-3755-7-
dc.rightshttps://dl.acm.org/doi/abs/10.1145/2824864.2824879-
dc.rightshttps://doi.org/10.1145/2824864.2824879-
dc.subjectCódigo fuente (Computación)en_US
dc.subjectEstructura de datos (Computadoras)en_US
dc.subjectPlagio - Innovaciones tecnológicasen_US
dc.titleOn the importance of lexicon, structure and style for identifying source code plagiarismen_US
dc.typeCapítulo de libroen_US
Aparece en las colecciones:Libros

Ficheros en este ítem:
Fichero Descripción TamañoFormato 
On the importance.pdf331.43 kBAdobe PDFVisualizar/Abrir


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.