You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tyler Palsulich (JIRA)" <ji...@apache.org> on 2015/01/28 18:45:34 UTC

[jira] [Reopened] (TIKA-1518) Docker with Tika Server

     [ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tyler Palsulich reopened TIKA-1518:
-----------------------------------

Reopening as suggested above.

1. I'm thinking we can place the Dockerfile in trunk/tika-server? Then, if someone wants to run it, they can build it from scratch, rather than pull from Dockerhub? Or, in the tika-server readme, we can say, here is the raw file you can build from scratch and this is how to pull from Dockerhub. It might be a pain to keep them at the same version. How does that sound, Dave?

2. An out-there, just an idea, suggestion would be to have an Apache hosted "Dockerhub"... Thoughts? Several different projects (Tika, Nutch, Gora, OODT, etc) are interested in Docker integration.

3. Is there a convenient way to use tika-server through a browser (without curl)? If not, I'll open up an issue. It would be nice to be able to stand up a server like http://any23-vm.apache.org/ for Tika. This Dockerfile would make it extremely easy.

> Docker with Tika Server
> -----------------------
>
>                 Key: TIKA-1518
>                 URL: https://issues.apache.org/jira/browse/TIKA-1518
>             Project: Tika
>          Issue Type: New Feature
>            Reporter: Paul Ramirez
>             Fix For: 1.8
>
>
> This version should be able to demonstrate as many of Apache Tika's capabilities as possible. For instance with GDAL, Tesseract, and FFmpeg to show parsers which require installation of other dependencies. In addition, this should help move TIKA-1301 forward and should leverage the suggestion made by [~lewismc] of a script which can pull down the latest version of Apache Tika.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)