******************** * Evaluation Data * ******************** FileA: outPorterOrig.txt FileB: outPorterJava.txt The mean number of words pre conflation class: 1.0881119 The Index Compession Factor: 0.080976814 The number of words and stems that differ: 468 The mean number characters removed: 0.029232789 The median number characters removed: 0.0 The mode number characters removed: 0 The chaacters removed table: Number of words with 0 Chars Removed 24676 Number of words with 1 Chars Removed 319 Number of words with 2 Chars Removed 28 Number of words with 3 Chars Removed 120 The mean Hamming distance: 0.02927256 The median Hamming distance: 0.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 24675 Number of words with 1 Hamming distance 320 Number of words with 2 Hamming distance 28 Number of words with 3 Hamming distance 120 The Fox and Frakes Similarity Metric: 34.161686 The Chris O'Neill Similarity Metric: 99.63653346856425%