******************** * Evaluation Data * ******************** FileA: outPorterOrig.txt FileB: outPorterJava.txt The mean number of words pre conflation class: 1.2928841 The Index Compession Factor: 0.22653548 The number of words and stems that differ: 98095 The mean number characters removed: 0.9983684 The median number characters removed: 0.0 The mode number characters removed: 0 The chaacters removed table: Number of words with 0 Chars Removed 211415 Number of words with 1 Chars Removed 1483 Number of words with 2 Chars Removed 44764 Number of words with 3 Chars Removed 20792 Number of words with 4 Chars Removed 14721 Number of words with 5 Chars Removed 8504 Number of words with 6 Chars Removed 2317 Number of words with 7 Chars Removed 4209 Number of words with 8 Chars Removed 942 Number of words with 9 Chars Removed 335 Number of words with 10 Chars Removed 11 Number of words with 11 Chars Removed 16 Number of words with 12 Chars Removed 1 The mean Hamming distance: 0.9992052 The median Hamming distance: 0.0 The mode Hamming distance: 0 The Hamming distance table: Number of words with 0 Hamming distance 211415 Number of words with 1 Hamming distance 1483 Number of words with 2 Hamming distance 44764 Number of words with 3 Hamming distance 20792 Number of words with 4 Hamming distance 14462 Number of words with 5 Hamming distance 8763 Number of words with 6 Hamming distance 2317 Number of words with 7 Hamming distance 4209 Number of words with 8 Hamming distance 942 Number of words with 9 Hamming distance 335 Number of words with 10 Hamming distance 11 Number of words with 11 Hamming distance 16 Number of words with 12 Hamming distance 1 The Fox and Frakes Similarity Metric: 1.0007955 The Chris O'Neill Similarity Metric: 90.87492912818844%