You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Jean-Baptiste Onofré (JIRA)" <ji...@apache.org> on 2017/07/03 14:48:01 UTC
[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input
component
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16072549#comment-16072549 ]
Jean-Baptiste Onofré commented on BEAM-2328:
--------------------------------------------
I'm still reviewing the PR. It's short to include in 2.1.0 release. Let's target on 2.2.0 release.
> Introduce Apache Tika Input component
> -------------------------------------
>
> Key: BEAM-2328
> URL: https://issues.apache.org/jira/browse/BEAM-2328
> Project: Beam
> Issue Type: New Feature
> Components: sdk-ideas, sdk-java-extensions
> Reporter: Sergey Beryozkin
> Assignee: Sergey Beryozkin
>
> Apache Tika is a popular project that offers an extensive support for parsing the variety of file formats. It is used in many projects including Lucene and Elastic Search.
> Supporting a Tika Input (Read) at the Beam level would be of major interest to many users.
> PR is to follow
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)