You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Ethan Wilansky (Jira)" <ji...@apache.org> on 2022/10/20 19:23:00 UTC

[jira] [Created] (TIKA-3894) Documentation update needed

Ethan Wilansky created TIKA-3894:
------------------------------------

             Summary: Documentation update needed
                 Key: TIKA-3894
                 URL: https://issues.apache.org/jira/browse/TIKA-3894
             Project: Tika
          Issue Type: Improvement
          Components: documentation
    Affects Versions: 2.5.0
            Reporter: Ethan Wilansky


In this documentation: [https://cwiki.apache.org/confluence/display/TIKA/TikaServer,] sections: Filtering Metadata Keys and Filtering Metadata Objects, I believe the <param> and <string> elements in the configuration examples need to be changed to <include> and <field> elements respectively. Here's an example of something I tested for filtering metadata keys:

{code:xml}
  ...
  <metadataFilters>
    <metadataFilter class="org.apache.tika.metadata.filter.IncludeFieldMetadataFilter">
      <params>
        <include>
          <field>extended-properties:Application</field>
          <field>xmpTPg:NPages</field>
          <field>meta:page-count</field>
          <field>meta:line-count</field>
          <field>X-TIKA:content</field>
        </include>
      </params>
    </metadataFilter>
  </metadataFilters>
...
{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)