You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-dev@lucene.apache.org by "Hoss Man (JIRA)" <ji...@apache.org> on 2008/06/10 20:23:45 UTC
[jira] Commented: (SOLR-579) Extend SimplePost with
RecurseDirectories, threads, document encoding , number of docs per commit
[ https://issues.apache.org/jira/browse/SOLR-579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12603960#action_12603960 ]
Hoss Man commented on SOLR-579:
-------------------------------
bq. with a simple perl script this can be converted into solr.
shouldn't it be just a easy for that perl script to POST the data to Solr as it is to write it to disk and then use SimplePostTool?
> Extend SimplePost with RecurseDirectories, threads, document encoding , number of docs per commit
> -------------------------------------------------------------------------------------------------
>
> Key: SOLR-579
> URL: https://issues.apache.org/jira/browse/SOLR-579
> Project: Solr
> Issue Type: New Feature
> Affects Versions: 1.3
> Environment: Applies to all platforms
> Reporter: Patrick Debois
> Priority: Minor
> Original Estimate: 72h
> Remaining Estimate: 72h
>
> -When specifying a directory, simplepost should read also the contents of a directory
> New options for the commandline (some only usefull in DATAMODE= files)
> -RECURSEDIRS
> Recursive read of directories as an option, this is usefull for directories with a lot of files where the commandline expansion fails and xargs is too slow
> -DOCENCODING (default = system encoding or UTF-8)
> For non utf-8 clients , simplepost should include a way to set the encoding of the documents posted
> -THREADSIZE (default =1 )
> For large volume posts, a threading pool makes sense , using JDK 1.5 Threadpool model
> -DOCSPERCOMMIT (default = 1)
> Number of documents after which a commit is done, instead of only at the end
> Note: not to break the existing behaviour of the existing SimplePost tool (post.sh) might be used in scripts
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.