******************** * Evaluation Data * ******************** FileA: outPaiceJava.txt FileB: outPaiceJavaPre.txt The mean number of words pre conflation class: 2.1139948 The Index Compression Factor: 0.526962 The number of words and stems that differ: 2021 The mean number characters removed: 0.031679105 The median number characters removed: 0.0 The mode number characters removed: 0 The characters removed table: Number of words with 0 Chars Removed 307504 Number of words with 1 Chars Removed 28 Number of words with 2 Chars Removed 49 Number of words with 3 Chars Removed 102 Number of words with 4 Chars Removed 302 Number of words with 5 Chars Removed 985 Number of words with 6 Chars Removed 540 The mean Hamming distance: 0.064043164 The median Hamming distance: 0.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 307489 Number of words with 1 Hamming distance 0 Number of words with 2 Hamming distance 6 Number of words with 3 Hamming distance 27 Number of words with 4 Hamming distance 8 Number of words with 5 Hamming distance 34 Number of words with 6 Hamming distance 108 Number of words with 7 Hamming distance 163 Number of words with 8 Hamming distance 246 Number of words with 9 Hamming distance 349 Number of words with 10 Hamming distance 316 Number of words with 11 Hamming distance 277 Number of words with 12 Hamming distance 192 Number of words with 13 Hamming distance 123 Number of words with 14 Hamming distance 105 Number of words with 15 Hamming distance 37 Number of words with 16 Hamming distance 22 Number of words with 17 Hamming distance 4 Number of words with 18 Hamming distance 2 Number of words with 19 Hamming distance 2 The Fox and Frakes Similarity Metric: 15.609744 The Chris O'Neill Similarity Metric: 99.36815081349216%