You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/06/25 04:46:00 UTC

[jira] [Commented] (TIKA-3454) Facilitate configuration of translation and transcription impls in tika-server/tika-docker/tika-helm

    [ https://issues.apache.org/jira/browse/TIKA-3454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17369234#comment-17369234 ] 

Lewis John McGibbney commented on TIKA-3454:
--------------------------------------------

There are a couple of confusing statements 

{quote}
Configuring Language Identifiers

At this time, there is no unified way to configure language identifiers. While the work on that is ongoing, for now you will need to review the Tika Javadocs to see how individual identifiers are configured.

Configuring Translators

At this time, there is no unified way to configure Translators. While the work on that is ongoing, for now you will need to review the Tika Javadocs to see how individual Translators are configured.
{quote}

I think this is going to take some collective input. Is anyone else interested in this?

> Facilitate configuration of translation and transcription impls in tika-server/tika-docker/tika-helm
> ----------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-3454
>                 URL: https://issues.apache.org/jira/browse/TIKA-3454
>             Project: Tika
>          Issue Type: Bug
>          Components: docker, helm, server
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Major
>             Fix For: 2.0.0
>
>
> I need an easy way to configure, for example, the [AmazonTranscribe|https://github.com/apache/tika/blob/main/tika-parsers/tika-parsers-ml/tika-transcribe-aws/src/main/java/org/apache/tika/parser/transcribe/aws/AmazonTranscribe.java] implementation when I deploy tika-server (tika-docker) via the Helm chart into Kubernetes. The same goes for TIka translation implementations.
> We have [documentation for configuring tika-server to run via Docker|https://github.com/apache/tika-docker#custom-config] however currently, there is [no way to configure translators or transcribers|https://tika.apache.org/1.26/configuring.html#Configuring_Translators]
>  
> This task will determine a sensible means by which we can configure translators and transcribers for tika-server such that it can be used further downstream via Docker and Helm on K8s.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)