You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Tomoko Uchida (JIRA)" <ji...@apache.org> on 2019/07/24 01:13:00 UTC

[jira] [Comment Edited] (LUCENE-8764) Add "export all terms" feature to Luke

    [ https://issues.apache.org/jira/browse/LUCENE-8764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16891158#comment-16891158 ] 

Tomoko Uchida edited comment on LUCENE-8764 at 7/24/19 1:12 AM:
----------------------------------------------------------------

Hi,
{quote}The way I see, this requires at least:
 - field (which right now just uses the table used for top terms)
 - destination file (to make things simpler, we could avoid this and dump to current working directory + /export/\{timestamp}_\{field}.out for example){quote}
Thanks, it looks good starting point to me.

There is little space left in the main window as you mentioned, a popup dialog would be desired. "Export all terms" button or a menu to show the popup, either one or both, look fine to me. Term and/or document frequencies might be useful for everyone. Also, some filters could be there (e.g. export terms that have specific prefix or match a regular expression, or have specified minimum term/document frequencies), but I'm not fully sure what options should be provided :)

About the document id, this is the original request from a user on the github project. I just copied it here without deep thoughts and have no strong feeling about it. We can delay and consider about that later (possibly at another issue) if it does not make sense to you. 
  


was (Author: tomoko uchida):
Hi,
{quote}The way I see, this requires at least:
 - field (which right now just uses the table used for top terms)
 - destination file (to make things simpler, we could avoid this and dump to current working directory + /export/\{timestamp}_\{field}.out for example){quote}
Thanks, it looks good starting point to me.

There is little space left in the main window as you mentioned, a popup dialog would be desired. "Export all terms" button or a menu, either one or both, look fine to me. Term and/or document frequencies might be useful for everyone. Also, some filters could be there (e.g. export terms that have specific prefix or match a regular expression, or have specified minimum term/document frequencies), but I'm not fully sure what options should be provided :)

About the document id, this is the original request from a user on the github project. I just copied it here without no deep thoughts and have no strong feeling about it. We can delay and consider about that later (possibly at another issue) if it does not make sense to you. 
  

> Add "export all terms" feature to Luke
> --------------------------------------
>
>                 Key: LUCENE-8764
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8764
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: modules/luke
>            Reporter: Tomoko Uchida
>            Priority: Major
>              Labels: beginner
>         Attachments: LUCENE-8764.patch, Screenshot 2019-07-23 12.29.06.png
>
>
> This is a migrated issue from previous Luke project in GitHub: [https://github.com/DmitryKey/luke/issues/3] (There are users' requests so I moved this from GitHub to Jira)
> You can browse terms in arbitrary field via Luke GUI, but in some cases "exporting all terms (and optionally docids) to a file" feature would be useful for further inspection. It might be similar to Solr's terms component.
> As for the user interface, "Export terms" button should be located in Overview tab and/or Documents tab.
>  



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org