You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by lf...@apache.org on 2017/04/19 22:51:56 UTC
[tika] branch master updated: update javadoc for Latin1StringsParser

This is an automated email from the ASF dual-hosted git repository.

lfcnassif pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/tika.git

The following commit(s) were added to refs/heads/master by this push:
       new  0f1034a   update javadoc for Latin1StringsParser
0f1034a is described below

commit 0f1034aef7c7b3f76eea7242052f5f89ddf94dbd
Author: Nassif <na...@dpf.gov.br>
AuthorDate: Wed Apr 19 17:31:00 2017 -0300

    update javadoc for Latin1StringsParser
---
 .../apache/tika/parser/strings/Latin1StringsParser.java   | 15 ++++++++++-----
 1 file changed, 10 insertions(+), 5 deletions(-)

diff --git a/tika-parsers/src/main/java/org/apache/tika/parser/strings/Latin1StringsParser.java b/tika-parsers/src/main/java/org/apache/tika/parser/strings/Latin1StringsParser.java
index 5c6fb46..b2ea40b 100644
--- a/tika-parsers/src/main/java/org/apache/tika/parser/strings/Latin1StringsParser.java
+++ b/tika-parsers/src/main/java/org/apache/tika/parser/strings/Latin1StringsParser.java
@@ -28,13 +28,18 @@ import org.xml.sax.ContentHandler;
 import org.xml.sax.SAXException;
 
 /**
- * Parser to extract printable Latin1 strings from arbitrary files with pure
- * java. Useful for binary or unknown files, for files without a specific parser
- * and for corrupted ones causing a TikaException as a fallback parser.
- * 
+ * Parser to extract printable Latin1 strings from arbitrary files with pure java
+ * without running any external process. Useful for binary or unknown files, for
+ * files without a specific parser and for corrupted ones causing a TikaException
+ * as a fallback parser. To enable the parsing of unknown or files without a
+ * specific parser with AutoDetectParser:
+ * <p>
+ * AutoDetectParser parser = new AutoDetectParser();
+ * parser.setFallback(new Latin1StringsParser());
+ * </p>
  * Currently the parser does a best effort to extract Latin1 strings, used by
  * Western European languages, encoded with ISO-8859-1, UTF-8 or UTF-16 charsets
- * within the same file.
+ * mixed within the same file.
  * 
  * The implementation is optimized for fast parsing with only one pass.
  */

-- 
To stop receiving notification emails like this one, please contact
['"commits@tika.apache.org" <co...@tika.apache.org>'].