******************** * Evaluation Data * ******************** FileA: document FileB: outPaicePascal.txt The mean number of words pre conflation class: 2.0966387 The Index Compession Factor: 0.5230461 The number of words and stems that differ: 242164 The mean number characters removed: 2.455756 The median number characters removed: 2.0 The mode number characters removed: 0 The chaacters removed table: Number of words with 0 Chars Removed 67442 Number of words with 1 Chars Removed 52180 Number of words with 2 Chars Removed 62536 Number of words with 3 Chars Removed 45686 Number of words with 4 Chars Removed 27579 Number of words with 5 Chars Removed 24200 Number of words with 6 Chars Removed 11187 Number of words with 7 Chars Removed 9886 Number of words with 8 Chars Removed 4151 Number of words with 9 Chars Removed 2856 Number of words with 10 Chars Removed 1058 Number of words with 11 Chars Removed 481 Number of words with 12 Chars Removed 167 Number of words with 13 Chars Removed 77 Number of words with 14 Chars Removed 18 Number of words with 15 Chars Removed 5 Number of words with 16 Chars Removed 1 The mean Hamming distance: 2.4941683 The median Hamming distance: 2.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 67346 Number of words with 1 Hamming distance 51324 Number of words with 2 Hamming distance 61457 Number of words with 3 Hamming distance 44859 Number of words with 4 Hamming distance 27927 Number of words with 5 Hamming distance 25267 Number of words with 6 Hamming distance 11963 Number of words with 7 Hamming distance 9701 Number of words with 8 Hamming distance 4721 Number of words with 9 Hamming distance 3021 Number of words with 10 Hamming distance 1139 Number of words with 11 Hamming distance 486 Number of words with 12 Hamming distance 191 Number of words with 13 Hamming distance 78 Number of words with 14 Hamming distance 24 Number of words with 15 Hamming distance 4 Number of words with 16 Hamming distance 2 The Fox and Frakes Similarity Metric: 0.40093526 The Chris O'Neill Similarity Metric: 74.60306522269153%