******************** * Evaluation Data * ******************** FileA: document FileB: outPorterJava.txt The mean number of words pre conflation class: 1.2928841 The Index Compession Factor: 0.22653548 The number of words and stems that differ: 135874 The mean number characters removed: 0.4345514 The median number characters removed: 0.0 The mode number characters removed: 0 The chaacters removed table: Number of words with 0 Chars Removed 208719 Number of words with 1 Chars Removed 77009 Number of words with 2 Chars Removed 14914 Number of words with 3 Chars Removed 7847 Number of words with 4 Chars Removed 985 Number of words with 5 Chars Removed 36 The mean Hamming distance: 0.55122614 The median Hamming distance: 0.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 173636 Number of words with 1 Hamming distance 111853 Number of words with 2 Hamming distance 14617 Number of words with 3 Hamming distance 8129 Number of words with 4 Hamming distance 1239 Number of words with 5 Hamming distance 36 The Fox and Frakes Similarity Metric: 1.8141375 The Chris O'Neill Similarity Metric: 93.80406617251795%