******************** * Evaluation Data * ******************** FileA: outPaiceJava.txt FileB: outPorterJava.txt The mean number of words pre conflation class: 1.2928841 The Index Compession Factor: 0.22653548 The number of words and stems that differ: 205413 The mean number characters removed: 2.0268586 The median number characters removed: 2.0 The mode number characters removed: 0 The chaacters removed table: Number of words with 0 Chars Removed 115732 Number of words with 1 Chars Removed 31006 Number of words with 2 Chars Removed 61997 Number of words with 3 Chars Removed 33183 Number of words with 4 Chars Removed 22870 Number of words with 5 Chars Removed 18841 Number of words with 6 Chars Removed 9242 Number of words with 7 Chars Removed 8558 Number of words with 8 Chars Removed 3807 Number of words with 9 Chars Removed 2563 Number of words with 10 Chars Removed 993 Number of words with 11 Chars Removed 458 Number of words with 12 Chars Removed 161 Number of words with 13 Chars Removed 75 Number of words with 14 Chars Removed 18 Number of words with 15 Chars Removed 5 Number of words with 16 Chars Removed 1 The mean Hamming distance: 2.0981553 The median Hamming distance: 2.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 104097 Number of words with 1 Hamming distance 42412 Number of words with 2 Hamming distance 58974 Number of words with 3 Hamming distance 34529 Number of words with 4 Hamming distance 22824 Number of words with 5 Hamming distance 19511 Number of words with 6 Hamming distance 9931 Number of words with 7 Hamming distance 8370 Number of words with 8 Hamming distance 4335 Number of words with 9 Hamming distance 2712 Number of words with 10 Hamming distance 1062 Number of words with 11 Hamming distance 462 Number of words with 12 Hamming distance 185 Number of words with 13 Hamming distance 76 Number of words with 14 Hamming distance 24 Number of words with 15 Hamming distance 4 Number of words with 16 Hamming distance 2 The Fox and Frakes Similarity Metric: 0.47660917 The Chris O'Neill Similarity Metric: 78.2371460112436%