******************** * Evaluation Data * ******************** FileA: document FileB: outPorterJava.txt The mean number of words pre conflation class: 1.0881119 The Index Compession Factor: 0.080976814 The number of words and stems that differ: 11596 The mean number characters removed: 0.7377799 The median number characters removed: 0.0 The mode number characters removed: 0 The chaacters removed table: Number of words with 0 Chars Removed 15734 Number of words with 1 Chars Removed 4680 Number of words with 2 Chars Removed 1732 Number of words with 3 Chars Removed 1874 Number of words with 4 Chars Removed 865 Number of words with 5 Chars Removed 238 Number of words with 6 Chars Removed 8 Number of words with 7 Chars Removed 10 Number of words with 8 Chars Removed 2 The mean Hamming distance: 0.82591575 The median Hamming distance: 0.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 13547 Number of words with 1 Hamming distance 6862 Number of words with 2 Hamming distance 1719 Number of words with 3 Hamming distance 1888 Number of words with 4 Hamming distance 867 Number of words with 5 Hamming distance 240 Number of words with 6 Hamming distance 8 Number of words with 7 Hamming distance 10 Number of words with 8 Hamming distance 2 The Fox and Frakes Similarity Metric: 1.2107773 The Chris O'Neill Similarity Metric: 90.0743101970231%