You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by GitBox <gi...@apache.org> on 2020/09/03 16:32:20 UTC

[GitHub] [tika] tballison commented on pull request #338: Tika-2421 : About the encoding of HTML

tballison commented on pull request #338:
URL: https://github.com/apache/tika/pull/338#issuecomment-686610377


   Wait, it turns out I did get around to doing this study...
   
   https://github.com/tballison/share/blob/main/slides/Tika_charset_detector_study_201909.docx
   
   Let me read it and remember what I found... :rofl: 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org