[current]
Automatic Document Classification With Perl
based on 'naive bayesian'... this seems to be exactly what I was searching for.
AI::Categorize::NaiveBayes
allows the user to feed it the text of several documents (the training set), which it will parse and add to the word frequency database
<snip />
Once a sufficient number of training documents have been fed to the database and the needed probabilities have been calculated, we can start asking AI::Categorize::NaiveBayes
to categorize new documents that it hasn't seen before. It returns to us an ordered list of the most probable categories for that document.
TeledyN: Comment on Graham's Plan for Spam
The specific filtering of spam also reminded me of the 1994 ACM project to produce a collaborative filter to rid the dying USENET from spam attacks; http://www.si.umich.edu/~presnick/papers/cscw94/GroupLens.htm
chronicals the GroupLens project, and sure enough, there's the same Bayesian method at the root of it.The ifile Web Site
ifile is a general mail filtering system that works with a mail client to intelligently filter mail according to the way the user tends to organize mail. ifile uses the machine learning algorithm Naive Bayes to classify e-mail documents. Freely Available Filtering Systems, Information Filtering ResourcesPersonal WebWatcher, Project Page
Personal WebWatcher is a "personal" agent that accompanies you from page to page as you browse the web, highlighting hyperlinks that it believes will be of interest. Its strategy for giving advice is learned from feedback from earlier tours.
[ by Martin>]
[]
[]
similar entries (vs):
similar entries (cg):