That my installation of Search::ContextGraph
is currently showing only very poor performance accuarcy wise... I had already observed
that it works better for posts with only a low word-cont and seemed to have a sharp drop-off for longer posts... but it also seems to decrease in accuracy with the size of the dataset...
Here's someone else who seems to have taken a look at it some time back: throwingbeans.org
Just for a test I staret up the old VectorSpace script I had done waaay back... it does have more 'relevant' hits... hmmm if only it was not so very slow... I can't sensibly run it on a regular basis. It takes more than 2 hours for each run to complete on my current set of 1500+ posts.
Just take this post
as example. The VectorSpace engine found all the relevant posts about my PowerBook decision making... whereas the ContextGraph found only one relation.. which is not really relevant.
[ by Martin>]
similar entries (vs):
- ContextGraph similar entries (# 14%)
- HolaryHey! (# 11%)
- similar entries VectorSpace (# 10%)
- How to re-present content in blogs (# 9%)
similar entries (cg):
no similar entries (yet?)