******************** * Evaluation Data * ******************** FileA: document FileB: outPaiceJavaPre.txt The mean number of words pre conflation class: 1.3219938 The Index Compression Factor: 0.2435668 The number of words and stems that differ: 15475 The mean number characters removed: 1.5187925 The median number characters removed: 1.0 The mode number characters removed: 0 The characters removed table: Number of words with 0 Chars Removed 9684 Number of words with 1 Chars Removed 4712 Number of words with 2 Chars Removed 4163 Number of words with 3 Chars Removed 3701 Number of words with 4 Chars Removed 1304 Number of words with 5 Chars Removed 994 Number of words with 6 Chars Removed 355 Number of words with 7 Chars Removed 148 Number of words with 8 Chars Removed 51 Number of words with 9 Chars Removed 26 Number of words with 10 Chars Removed 4 Number of words with 11 Chars Removed 0 Number of words with 12 Chars Removed 1 The mean Hamming distance: 1.5590025 The median Hamming distance: 1.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 9668 Number of words with 1 Hamming distance 4563 Number of words with 2 Hamming distance 4208 Number of words with 3 Hamming distance 3447 Number of words with 4 Hamming distance 1525 Number of words with 5 Hamming distance 1099 Number of words with 6 Hamming distance 357 Number of words with 7 Hamming distance 155 Number of words with 8 Hamming distance 66 Number of words with 9 Hamming distance 36 Number of words with 10 Hamming distance 11 Number of words with 11 Hamming distance 4 Number of words with 12 Hamming distance 3 Number of words with 13 Hamming distance 0 Number of words with 14 Hamming distance 1 The Fox and Frakes Similarity Metric: 0.6414358 The Chris O'Neill Similarity Metric: 80.38414765072397%