You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by GitBox <gi...@apache.org> on 2022/03/06 13:29:24 UTC

[GitHub] [tika] lfcnassif edited a comment on pull request #520: Fix email detection (TIKA-3687)

lfcnassif edited a comment on pull request #520:
URL: https://github.com/apache/tika/pull/520#issuecomment-1059962861


   Also, if we look for additional headers if X|DKIM|ARC matches at the beginning, why not look for other headers if they match in a wide range of positions? (ok there is the \n but it is pretty common in txt files).
   
   Maybe we could use a regex to put all definitions together, regardless if they are at the beginning of the file or at the beginning of lines in the first 1024...


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@tika.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org