You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Mark Dos Santos (JIRA)" <ji...@apache.org> on 2005/12/06 22:07:08 UTC

[jira] Created: (LUCENE-477) Build an index which allows me to broswe by category.

Build an index which allows me to broswe by category.
-----------------------------------------------------

         Key: LUCENE-477
         URL: http://issues.apache.org/jira/browse/LUCENE-477
     Project: Lucene - Java
        Type: Task
  Components: Index  
    Versions: 1.4    
 Environment: JDK 1.4, Windows 2003, Tomcat 5.0.28
    Reporter: Mark Dos Santos


Hello there,

I have a collection of documents that I am using lucene to build an index for, and then I have a jsp app to search my documents. This all works great. I believe lucene is such an amazing product, but thats a whole other topic. Anyway, maybe it's my lack of experience in building indexes, but I am have trouble coming up with an index that kind of mimics verity's parametric index.  You see my documents all have a category path (I have over 50,000 docs).  A document can be at any level of the category path, and that same path can have many different documents. IE. Document x, has a category path USA//New Jersey//Trenton//09890 and Document y has a category path USA//New Jersey//Trenton//09890.  

Basically, I would like to build an index using lucene, where when I search, if my results were to bring back those two documents, I would like to retrieve the distinct category path for those two documents.  Of course I can loop through and build a vector with only the unique paths that come in the search results, but that obviously would take to long when I get lets say 10000 results from my search.

So the question I guess is, how can I build an index that would facilitate this functionality for me.  If anyone has any suggestions I would greatly appreciate it.

Thanks,
Mark

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Closed: (LUCENE-477) Build an index which allows me to broswe by category.

Posted by "Erik Hatcher (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/LUCENE-477?page=all ]
     
Erik Hatcher closed LUCENE-477:
-------------------------------

    Resolution: Invalid

Yes, please bring this topic to the user list rather than JIRA

> Build an index which allows me to broswe by category.
> -----------------------------------------------------
>
>          Key: LUCENE-477
>          URL: http://issues.apache.org/jira/browse/LUCENE-477
>      Project: Lucene - Java
>         Type: Task
>   Components: Index
>     Versions: 1.4
>  Environment: JDK 1.4, Windows 2003, Tomcat 5.0.28
>     Reporter: Mark Dos Santos

>
> Hello there,
> I have a collection of documents that I am using lucene to build an index for, and then I have a jsp app to search my documents. This all works great. I believe lucene is such an amazing product, but thats a whole other topic. Anyway, maybe it's my lack of experience in building indexes, but I am have trouble coming up with an index that kind of mimics verity's parametric index.  You see my documents all have a category path (I have over 50,000 docs).  A document can be at any level of the category path, and that same path can have many different documents. IE. Document x, has a category path USA//New Jersey//Trenton//09890 and Document y has a category path USA//New Jersey//Trenton//09890.  
> Basically, I would like to build an index using lucene, where when I search, if my results were to bring back those two documents, I would like to retrieve the distinct category path for those two documents.  Of course I can loop through and build a vector with only the unique paths that come in the search results, but that obviously would take to long when I get lets say 10000 results from my search.
> So the question I guess is, how can I build an index that would facilitate this functionality for me.  If anyone has any suggestions I would greatly appreciate it.
> Thanks,
> Mark

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Commented: (LUCENE-477) Build an index which allows me to broswe by category.

Posted by "Hoss Man (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/LUCENE-477?page=comments#action_12359495 ] 

Hoss Man commented on LUCENE-477:
---------------------------------

This isn't a "bug" or a "feature" or a "task" as much as it is a "question" about using lucene in a particular way.  Questions generally recieve more comment on the java-user at lucene dot apache dot org then they do when posted in JIRA.

In particular, you should search the mailing list archive for "facet" or "faceted" before you ask teh question, previous discussions may give you enough info to solve your problem.

> Build an index which allows me to broswe by category.
> -----------------------------------------------------
>
>          Key: LUCENE-477
>          URL: http://issues.apache.org/jira/browse/LUCENE-477
>      Project: Lucene - Java
>         Type: Task
>   Components: Index
>     Versions: 1.4
>  Environment: JDK 1.4, Windows 2003, Tomcat 5.0.28
>     Reporter: Mark Dos Santos

>
> Hello there,
> I have a collection of documents that I am using lucene to build an index for, and then I have a jsp app to search my documents. This all works great. I believe lucene is such an amazing product, but thats a whole other topic. Anyway, maybe it's my lack of experience in building indexes, but I am have trouble coming up with an index that kind of mimics verity's parametric index.  You see my documents all have a category path (I have over 50,000 docs).  A document can be at any level of the category path, and that same path can have many different documents. IE. Document x, has a category path USA//New Jersey//Trenton//09890 and Document y has a category path USA//New Jersey//Trenton//09890.  
> Basically, I would like to build an index using lucene, where when I search, if my results were to bring back those two documents, I would like to retrieve the distinct category path for those two documents.  Of course I can loop through and build a vector with only the unique paths that come in the search results, but that obviously would take to long when I get lets say 10000 results from my search.
> So the question I guess is, how can I build an index that would facilitate this functionality for me.  If anyone has any suggestions I would greatly appreciate it.
> Thanks,
> Mark

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org