******************** * Evaluation Data * ******************** FileA: document FileB: outPorterJavaPre.txt The mean number of words pre conflation class: 1.251894 The Index Compression Factor: 0.20121029 The number of words and stems that differ: 446 The mean number characters removed: 1.7534039 The median number characters removed: 1.0 The mode number characters removed: 0 The characters removed table: Number of words with 0 Chars Removed 243 Number of words with 1 Chars Removed 106 Number of words with 2 Chars Removed 121 Number of words with 3 Chars Removed 72 Number of words with 4 Chars Removed 49 Number of words with 5 Chars Removed 40 Number of words with 6 Chars Removed 17 Number of words with 7 Chars Removed 9 Number of words with 8 Chars Removed 2 Number of words with 9 Chars Removed 2 The mean Hamming distance: 1.8108926 The median Hamming distance: 1.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 215 Number of words with 1 Hamming distance 133 Number of words with 2 Hamming distance 117 Number of words with 3 Hamming distance 73 Number of words with 4 Hamming distance 53 Number of words with 5 Hamming distance 40 Number of words with 6 Hamming distance 17 Number of words with 7 Hamming distance 9 Number of words with 8 Hamming distance 2 Number of words with 9 Hamming distance 2 The Fox and Frakes Similarity Metric: 0.55221385 The Chris O'Neill Similarity Metric: 80.71281157069647%