Compression Analysis is a technique that can identify certain characteristics about text, without actually requiring a human to read the text. It uses ordinary file compression programs such as gzip, winzip, or arc.
(ps) Automated Categorization of Real-time Newswire Stories
[ by Martin>]
similar entries (vs):
similar entries (cg):