******************** * Evaluation Data * ******************** FileA: document FileB: outPaiceJavaPre.txt The mean number of words pre conflation class: 2.1139948 The Index Compression Factor: 0.526962 The number of words and stems that differ: 242443 The mean number characters removed: 2.487377 The median number characters removed: 2.0 The mode number characters removed: 0 The characters removed table: Number of words with 0 Chars Removed 67163 Number of words with 1 Chars Removed 51821 Number of words with 2 Chars Removed 61956 Number of words with 3 Chars Removed 45370 Number of words with 4 Chars Removed 27461 Number of words with 5 Chars Removed 24357 Number of words with 6 Chars Removed 11469 Number of words with 7 Chars Removed 10282 Number of words with 8 Chars Removed 4509 Number of words with 9 Chars Removed 3046 Number of words with 10 Chars Removed 1197 Number of words with 11 Chars Removed 559 Number of words with 12 Chars Removed 196 Number of words with 13 Chars Removed 93 Number of words with 14 Chars Removed 24 Number of words with 15 Chars Removed 6 Number of words with 16 Chars Removed 1 The mean Hamming distance: 2.5580337 The median Hamming distance: 2.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 67067 Number of words with 1 Hamming distance 50970 Number of words with 2 Hamming distance 60880 Number of words with 3 Hamming distance 44559 Number of words with 4 Hamming distance 27720 Number of words with 5 Hamming distance 25111 Number of words with 6 Hamming distance 11916 Number of words with 7 Hamming distance 9698 Number of words with 8 Hamming distance 4803 Number of words with 9 Hamming distance 3167 Number of words with 10 Hamming distance 1347 Number of words with 11 Hamming distance 737 Number of words with 12 Hamming distance 499 Number of words with 13 Hamming distance 333 Number of words with 14 Hamming distance 282 Number of words with 15 Hamming distance 186 Number of words with 16 Hamming distance 123 Number of words with 17 Hamming distance 54 Number of words with 18 Hamming distance 36 Number of words with 19 Hamming distance 20 Number of words with 20 Hamming distance 2 The Fox and Frakes Similarity Metric: 0.39092526 The Chris O'Neill Similarity Metric: 74.09404348774173%