You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by Tim Allison <ta...@apache.org> on 2023/01/03 16:43:49 UTC

tika-python updates/thank you Chris!

All,

  Chris Mattmann spent some time over the break updating tika-python
and closing out a _bunch_ of open issues on the tika-python repo
(https://github.com/chrismattmann/tika-python).  The key updates (from
my perspective):

1) Updated to the 2.x release branch, specifically 2.6.0:
https://github.com/chrismattmann/tika-python/releases/tag/2.6.0

2) Allowed "raw" /rmeta output.  The legacy behavior for tika-python
was to append fields for embedded files into a single metadata object,
which meant, for example, that users couldn't figure out which
embedded file a given "title" belonged to
(https://github.com/chrismattmann/tika-python/issues/375).

  Many thanks, Chris!

            Cheers,

                     Tim