You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by Tim Allison <ta...@apache.org> on 2023/01/03 16:43:49 UTC
tika-python updates/thank you Chris!
All,
Chris Mattmann spent some time over the break updating tika-python
and closing out a _bunch_ of open issues on the tika-python repo
(https://github.com/chrismattmann/tika-python). The key updates (from
my perspective):
1) Updated to the 2.x release branch, specifically 2.6.0:
https://github.com/chrismattmann/tika-python/releases/tag/2.6.0
2) Allowed "raw" /rmeta output. The legacy behavior for tika-python
was to append fields for embedded files into a single metadata object,
which meant, for example, that users couldn't figure out which
embedded file a given "title" belonged to
(https://github.com/chrismattmann/tika-python/issues/375).
Many thanks, Chris!
Cheers,
Tim