You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucy.apache.org by Peter Karman <pe...@peknet.com> on 2010/03/17 05:00:20 UTC
Re: [Lucy] Re: MoreLikeThisQuery
Marvin Humphrey wrote on 3/16/10 9:02 AM:
> His suggestion was to use OpenCyc to classify terms.
>
> That's similar to what we'd do with topic vectors generated by an indexing
> component, except that the Cyc topic vectors were built laboriously by hand
> rather than using automatic dimension reduction.
I've been looking at the SenseClusters package. It's very unfriendly to use, but
the ideas in it are worth some investigation.
http://www.d.umn.edu/~tpederse/senseclusters.html
It uses SVDPACKC to do the big matrix math:
http://netlib.org/svdpack/
--
Peter Karman . http://peknet.com/ . peter@peknet.com