******************** * Evaluation Data * ******************** FileA: outPaiceJava.txt FileB: outPorterOrig.txt The mean number of words pre conflation class: 1.097181 The Index Compession Factor: 0.08857336 The number of words and stems that differ: 11271 The mean number characters removed: 0.7877739 The median number characters removed: 0.0 The mode number characters removed: 0 The chaacters removed table: Number of words with 0 Chars Removed 15798 Number of words with 1 Chars Removed 2720 Number of words with 2 Chars Removed 4126 Number of words with 3 Chars Removed 1733 Number of words with 4 Chars Removed 382 Number of words with 5 Chars Removed 264 Number of words with 6 Chars Removed 66 Number of words with 7 Chars Removed 42 Number of words with 8 Chars Removed 10 Number of words with 9 Chars Removed 2 The mean Hamming distance: 0.8812393 The median Hamming distance: 0.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 13872 Number of words with 1 Hamming distance 4527 Number of words with 2 Hamming distance 4088 Number of words with 3 Hamming distance 1788 Number of words with 4 Hamming distance 454 Number of words with 5 Hamming distance 291 Number of words with 6 Hamming distance 63 Number of words with 7 Hamming distance 43 Number of words with 8 Hamming distance 13 Number of words with 9 Hamming distance 4 The Fox and Frakes Similarity Metric: 1.1347655 The Chris O'Neill Similarity Metric: 86.82880909892985%