******************** * Evaluation Data * ******************** FileA: outPorterJava.txt FileB: outPorterJavaPre.txt The mean number of words pre conflation class: 1.5939909 The Index Compression Factor: 0.37264386 The number of words and stems that differ: 2021 The mean number characters removed: 0.030186424 The median number characters removed: 0.0 The mode number characters removed: 0 The characters removed table: Number of words with 0 Chars Removed 307492 Number of words with 1 Chars Removed 13 Number of words with 2 Chars Removed 95 Number of words with 3 Chars Removed 263 Number of words with 4 Chars Removed 358 Number of words with 5 Chars Removed 815 Number of words with 6 Chars Removed 474 The mean Hamming distance: 0.06984912 The median Hamming distance: 0.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 307489 Number of words with 1 Hamming distance 0 Number of words with 2 Hamming distance 0 Number of words with 3 Hamming distance 0 Number of words with 4 Hamming distance 6 Number of words with 5 Hamming distance 13 Number of words with 6 Hamming distance 46 Number of words with 7 Hamming distance 118 Number of words with 8 Hamming distance 228 Number of words with 9 Hamming distance 268 Number of words with 10 Hamming distance 323 Number of words with 11 Hamming distance 287 Number of words with 12 Hamming distance 253 Number of words with 13 Hamming distance 194 Number of words with 14 Hamming distance 136 Number of words with 15 Hamming distance 79 Number of words with 16 Hamming distance 46 Number of words with 17 Hamming distance 15 Number of words with 18 Hamming distance 5 Number of words with 19 Hamming distance 3 Number of words with 20 Hamming distance 1 The Fox and Frakes Similarity Metric: 14.316573 The Chris O'Neill Similarity Metric: 99.37232614908764%