Name Description Size
head_bayes.js 876
resources
test_bug228675.js import-globals-from resources/trainingfile.js 3948
test_customTokenization.js 5041
test_junkAsTraits.js 15972
test_msgCorpus.js msgCorpus.corpusCounts(1001, messageCountObject); let v1001 = messageCountObject.value; msgCorpus.corpusCounts(1003, messageCountObject); let v1003 = messageCountObject.value; dump("pre-clear value " + v1001 + " " + v1003 + "\n"); /* 4675
test_traitAliases.js These tests rely on data stored in a file, with the same format as traits.dat, that was trained in the following manner. There are two training messages, included here as files aliases1.eml and aliases2.eml Aliases.dat was trained on each of these messages, for different trait indices, as follows, with columns showing the training count for each trait index: file count(1001) count(1005) count(1007) count(1009) aliases1.eml 1 0 2 0 aliases2.eml 0 1 0 1 There is also a third email file, aliases3.eml, which combines tokens from aliases1.eml and aliases2.eml The goal here is to demonstrate that traits 1001 and 1007, and traits 1005 and 1009, can be combined using aliases. We classify messages with trait 1001 as the PRO trait, and 1005 as the ANTI trait. With these characteristics, I've run a trait analysis without aliases, and determined that the following is the correct percentage results from the analysis for each message. "Train11" means that the training was 1 pro count from aliases1.eml, and 1 anti count from alias2.eml. "Train32" is 3 pro counts, and 2 anti counts. percentage file Train11 Train32 alias1.eml 92 98 alias2.eml 8 3 alias3.eml 50 53 4947
test_traits.js 8229
xpcshell.ini 199