You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-dev@lucene.apache.org by "Hoss Man (JIRA)" <ji...@apache.org> on 2008/06/10 20:23:45 UTC

[jira] Commented: (SOLR-579) Extend SimplePost with RecurseDirectories, threads, document encoding , number of docs per commit

    [ https://issues.apache.org/jira/browse/SOLR-579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12603960#action_12603960 ] 

Hoss Man commented on SOLR-579:
-------------------------------

bq. with a simple perl script this can be converted into solr.

shouldn't it be just a easy for that perl script to POST the data to Solr as it is to write it to disk and then use SimplePostTool?

> Extend SimplePost with RecurseDirectories, threads, document encoding , number of docs per commit
> -------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-579
>                 URL: https://issues.apache.org/jira/browse/SOLR-579
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 1.3
>         Environment: Applies to all platforms
>            Reporter: Patrick Debois
>            Priority: Minor
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> -When specifying a directory, simplepost should read also the contents of a  directory
> New options for the commandline (some only usefull in DATAMODE= files)
> -RECURSEDIRS
>         Recursive read of directories as an option, this is usefull for directories with a lot of files where the commandline expansion fails and xargs is too slow
> -DOCENCODING (default = system encoding or UTF-8) 
>         For non utf-8 clients , simplepost should include a way to set the encoding of the documents posted
> -THREADSIZE (default =1 ) 
>         For large volume posts, a threading pool makes sense , using JDK 1.5 Threadpool model
> -DOCSPERCOMMIT (default = 1)
>         Number of documents after which a commit is done, instead of only at the end
> Note: not to break the existing behaviour of the existing SimplePost tool (post.sh) might be used in scripts 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.