******************** * Evaluation Data * ******************** FileA: document FileB: outPaiceJavaPre.txt The mean number of words pre conflation class: 1.3828452 The Index Compression Factor: 0.27685326 The number of words and stems that differ: 505 The mean number characters removed: 2.7518911 The median number characters removed: 2.0 The mode number characters removed: 0 The characters removed table: Number of words with 0 Chars Removed 156 Number of words with 1 Chars Removed 84 Number of words with 2 Chars Removed 120 Number of words with 3 Chars Removed 112 Number of words with 4 Chars Removed 51 Number of words with 5 Chars Removed 61 Number of words with 6 Chars Removed 25 Number of words with 7 Chars Removed 19 Number of words with 8 Chars Removed 10 Number of words with 9 Chars Removed 10 Number of words with 10 Chars Removed 3 Number of words with 11 Chars Removed 2 Number of words with 12 Chars Removed 0 Number of words with 13 Chars Removed 0 Number of words with 14 Chars Removed 1 Number of words with 15 Chars Removed 0 Number of words with 16 Chars Removed 2 Number of words with 17 Chars Removed 0 Number of words with 18 Chars Removed 0 Number of words with 19 Chars Removed 3 Number of words with 20 Chars Removed 1 Number of words with 21 Chars Removed 0 Number of words with 22 Chars Removed 1 The mean Hamming distance: 2.8018155 The median Hamming distance: 2.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 156 Number of words with 1 Hamming distance 82 Number of words with 2 Hamming distance 117 Number of words with 3 Hamming distance 104 Number of words with 4 Hamming distance 60 Number of words with 5 Hamming distance 60 Number of words with 6 Hamming distance 27 Number of words with 7 Hamming distance 21 Number of words with 8 Hamming distance 11 Number of words with 9 Hamming distance 10 Number of words with 10 Hamming distance 3 Number of words with 11 Hamming distance 2 Number of words with 12 Hamming distance 0 Number of words with 13 Hamming distance 0 Number of words with 14 Hamming distance 1 Number of words with 15 Hamming distance 0 Number of words with 16 Hamming distance 2 Number of words with 17 Hamming distance 0 Number of words with 18 Hamming distance 0 Number of words with 19 Hamming distance 3 Number of words with 20 Hamming distance 1 Number of words with 21 Hamming distance 0 Number of words with 22 Hamming distance 1 The Fox and Frakes Similarity Metric: 0.35691145 The Chris O'Neill Similarity Metric: 70.04318893917852%