******************** * Evaluation Data * ******************** FileA: outPorterOrig.txt FileB: outPorterJava.txt The mean number of words pre conflation class: 1.251894 The Index Compression Factor: 0.20121029 The number of words and stems that differ: 32 The mean number characters removed: 0.083207265 The median number characters removed: 0.0 The mode number characters removed: 0 The characters removed table: Number of words with 0 Chars Removed 629 Number of words with 1 Chars Removed 20 Number of words with 2 Chars Removed 1 Number of words with 3 Chars Removed 11 The mean Hamming distance: 0.083207265 The median Hamming distance: 0.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 629 Number of words with 1 Hamming distance 20 Number of words with 2 Hamming distance 1 Number of words with 3 Hamming distance 11 The Fox and Frakes Similarity Metric: 12.018182 The Chris O'Neill Similarity Metric: 98.97821417110373%