You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Ethan Wilansky (Jira)" <ji...@apache.org> on 2022/10/20 19:23:00 UTC
[jira] [Created] (TIKA-3894) Documentation update needed
Ethan Wilansky created TIKA-3894:
------------------------------------
Summary: Documentation update needed
Key: TIKA-3894
URL: https://issues.apache.org/jira/browse/TIKA-3894
Project: Tika
Issue Type: Improvement
Components: documentation
Affects Versions: 2.5.0
Reporter: Ethan Wilansky
In this documentation: [https://cwiki.apache.org/confluence/display/TIKA/TikaServer,] sections: Filtering Metadata Keys and Filtering Metadata Objects, I believe the <param> and <string> elements in the configuration examples need to be changed to <include> and <field> elements respectively. Here's an example of something I tested for filtering metadata keys:
{code:xml}
...
<metadataFilters>
<metadataFilter class="org.apache.tika.metadata.filter.IncludeFieldMetadataFilter">
<params>
<include>
<field>extended-properties:Application</field>
<field>xmpTPg:NPages</field>
<field>meta:page-count</field>
<field>meta:line-count</field>
<field>X-TIKA:content</field>
</include>
</params>
</metadataFilter>
</metadataFilters>
...
{code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)