Plagiarism Detection Methodology
Our plagiarism detection algorithm uses parse trees to check for program
similarities. The module is currently available for detecting plagiarism of
programs written in Java and C, and other programming languages will also
eventually be covered. Before describing the new methodology, the following
assumptions are made. The system will ignore:
ñ meaningless information in the program, such as white spaces, comments,
import statements and name of identifiers;
ñ the order of the fields, methods and statements;
ñ programs with syntax errors.
The new algorithm operates in two phases:
1. All programs to be compared are parsed into a parse tree structure.
2. These parse trees are compared in pairs to calculate the similarity. The
comparison traverses all parts of the parse tree. Each sub-tree represents a
logical part of the program. Different sub-tree may have different weights to
be measured. The final result of the comparison is shown in tabular format.
Plagiarism Detection Methodology
Our plagiarism detection algorithm uses parse trees to check for program
similarities. The module is currently available for detecting plagiarism of
programs written in Java and C, and other programming languages will also
eventually be covered. Before describing the new methodology, the following
assumptions are made. The system will ignore:
ñ meaningless information in the program, such as white spaces, comments,
import statements and name of identifiers;
ñ the order of the fields, methods and statements;
ñ programs with syntax errors.
The new algorithm operates in two phases:
1. All programs to be compared are parsed into a parse tree structure.
2. These parse trees are compared in pairs to calculate the similarity. The
comparison traverses all parts of the parse tree. Each sub-tree represents a
logical part of the program. Different sub-tree may have different weights to
be measured. The final result of the comparison is shown in tabular format.
การแปล กรุณารอสักครู่..