You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Ingo Renner (JIRA)" <ji...@apache.org> on 2010/02/24 12:42:27 UTC
[jira] Commented: (TIKA-169) Tika Web Service Servlet
[ https://issues.apache.org/jira/browse/TIKA-169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837750#action_12837750 ]
Ingo Renner commented on TIKA-169:
----------------------------------
I see a servlet making quite some sense - think of Solr, but only having the extraction request handler... That way you could have a central meta data / text extracting server without needing to install java + tika on all the hosts where you might need it in a replicated CMS environment f.e.
So the scenario would be that a CMS trys to extract text, meta data from a file, but does not have a local tika at hand. It would then send the file to a Tika server and get the results back in XML or JSON like Solr does.
> Tika Web Service Servlet
> ------------------------
>
> Key: TIKA-169
> URL: https://issues.apache.org/jira/browse/TIKA-169
> Project: Tika
> Issue Type: New Feature
> Components: general
> Affects Versions: 0.2
> Reporter: Rida Benjelloun
> Priority: Minor
> Attachments: tikaServlet.war
>
>
> Tika servlet, use file or directory path to build a list of XML documents. The next version will allow file upload.
> Usage :
> //Extract document content and metadata
> http://localhost:8080/tikaServlet/?filePath=C:\test&start=0&rows=10
> //Extract metadata
> http://localhost:8080/tikaServlet/?filePath=C:\test&start=0&rows=10&extract=metadata
> //Extract document content
> http://localhost:8080/tikaServlet/?filePath=C:\test&start=0&rows=10&extract=content
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.