******************** * Evaluation Data * ******************** FileA: document FileB: outPorterJavaPre.txt The mean number of words pre conflation class: 1.08896 The Index Compression Factor: 0.08169272 The number of words and stems that differ: 11615 The mean number characters removed: 0.7435469 The median number characters removed: 0.0 The mode number characters removed: 0 The characters removed table: Number of words with 0 Chars Removed 15712 Number of words with 1 Chars Removed 4672 Number of words with 2 Chars Removed 1730 Number of words with 3 Chars Removed 1872 Number of words with 4 Chars Removed 878 Number of words with 5 Chars Removed 253 Number of words with 6 Chars Removed 14 Number of words with 7 Chars Removed 10 Number of words with 8 Chars Removed 2 The mean Hamming distance: 0.83733046 The median Hamming distance: 0.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 13528 Number of words with 1 Hamming distance 6851 Number of words with 2 Hamming distance 1717 Number of words with 3 Hamming distance 1886 Number of words with 4 Hamming distance 866 Number of words with 5 Hamming distance 240 Number of words with 6 Hamming distance 11 Number of words with 7 Hamming distance 13 Number of words with 8 Hamming distance 12 Number of words with 9 Hamming distance 8 Number of words with 10 Hamming distance 4 Number of words with 11 Hamming distance 5 Number of words with 12 Hamming distance 1 Number of words with 13 Hamming distance 0 Number of words with 14 Hamming distance 1 The Fox and Frakes Similarity Metric: 1.1942716 The Chris O'Neill Similarity Metric: 89.94910515772665%