You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Hudson (Jira)" <ji...@apache.org> on 2022/04/08 19:10:00 UTC

[jira] [Commented] (TIKA-3717) Add language detector metadata filters for optimaize and opennlp

    [ https://issues.apache.org/jira/browse/TIKA-3717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519765#comment-17519765 ] 

Hudson commented on TIKA-3717:
------------------------------

SUCCESS: Integrated in Jenkins build Tika ยป tika-main-jdk8 #511 (See [https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/511/])
TIKA-3717 -- add metadata filters for optimaize and opennlp lang detectors (tallison: [https://github.com/apache/tika/commit/fa6c4baac4502f0bbda7f92d71a69493d9805399])
* (add) tika-langdetect/tika-langdetect-opennlp/src/main/java/org/apache/tika/langdetect/opennlp/metadatafilter/OpenNLPMetadataFilter.java
* (add) tika-server/tika-server-standard/src/test/resources/config/tika-config-langdetect-opennlp-filter.xml
* (add) tika-server/tika-server-standard/src/test/resources/config/tika-config-langdetect-optimaize-filter.xml
* (add) tika-server/tika-server-standard/src/test/java/org/apache/tika/server/standard/OpenNLPMetadataFilterTest.java
* (add) tika-server/tika-server-standard/src/test/java/org/apache/tika/server/standard/OptimaizeMetadataFilterTest.java
* (add) tika-langdetect/tika-langdetect-optimaize/src/main/java/org/apache/tika/langdetect/optimaize/metadatafilter/OptimaizeMetadataFilter.java
* (edit) CHANGES.txt
* (edit) tika-server/tika-server-standard/pom.xml
* (edit) tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java
* (edit) tika-core/src/main/java/org/apache/tika/metadata/Metadata.java
* (edit) tika-langdetect/tika-langdetect-optimaize/src/main/java/org/apache/tika/langdetect/optimaize/OptimaizeLangDetector.java


> Add language detector metadata filters for optimaize and opennlp
> ----------------------------------------------------------------
>
>                 Key: TIKA-3717
>                 URL: https://issues.apache.org/jira/browse/TIKA-3717
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>             Fix For: 2.4.0
>
>
> This will allow easier integration of language detection+parsing in tika server and tika-app.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)