You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucy.apache.org by Peter Karman <pe...@peknet.com> on 2010/03/17 05:00:20 UTC

Re: [Lucy] Re: MoreLikeThisQuery

Marvin Humphrey wrote on 3/16/10 9:02 AM:

> His suggestion was to use OpenCyc to classify terms.
> 
> That's similar to what we'd do with topic vectors generated by an indexing
> component, except that the Cyc topic vectors were built laboriously by hand
> rather than using automatic dimension reduction.

I've been looking at the SenseClusters package. It's very unfriendly to use, but
the ideas in it are worth some investigation.

http://www.d.umn.edu/~tpederse/senseclusters.html

It uses SVDPACKC to do the big matrix math:
http://netlib.org/svdpack/

-- 
Peter Karman  .  http://peknet.com/  .  peter@peknet.com