You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by lf...@apache.org on 2017/04/19 23:22:01 UTC
[tika] branch 2.x updated: update javadoc for Latin1StringsParser
This is an automated email from the ASF dual-hosted git repository.
lfcnassif pushed a commit to branch 2.x
in repository https://gitbox.apache.org/repos/asf/tika.git
The following commit(s) were added to refs/heads/2.x by this push:
new 73147a2 update javadoc for Latin1StringsParser
73147a2 is described below
commit 73147a23913f777d6bc5dff62633ad41970f0a4c
Author: lfcnassif <lf...@gmail.com>
AuthorDate: Wed Apr 19 20:20:13 2017 -0300
update javadoc for Latin1StringsParser
---
.../apache/tika/parser/strings/Latin1StringsParser.java | 15 ++++++++++-----
1 file changed, 10 insertions(+), 5 deletions(-)
diff --git a/tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/parser/strings/Latin1StringsParser.java b/tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/parser/strings/Latin1StringsParser.java
index 5c6fb46..b2ea40b 100644
--- a/tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/parser/strings/Latin1StringsParser.java
+++ b/tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/parser/strings/Latin1StringsParser.java
@@ -28,13 +28,18 @@ import org.xml.sax.ContentHandler;
import org.xml.sax.SAXException;
/**
- * Parser to extract printable Latin1 strings from arbitrary files with pure
- * java. Useful for binary or unknown files, for files without a specific parser
- * and for corrupted ones causing a TikaException as a fallback parser.
- *
+ * Parser to extract printable Latin1 strings from arbitrary files with pure java
+ * without running any external process. Useful for binary or unknown files, for
+ * files without a specific parser and for corrupted ones causing a TikaException
+ * as a fallback parser. To enable the parsing of unknown or files without a
+ * specific parser with AutoDetectParser:
+ * <p>
+ * AutoDetectParser parser = new AutoDetectParser();
+ * parser.setFallback(new Latin1StringsParser());
+ * </p>
* Currently the parser does a best effort to extract Latin1 strings, used by
* Western European languages, encoded with ISO-8859-1, UTF-8 or UTF-16 charsets
- * within the same file.
+ * mixed within the same file.
*
* The implementation is optimized for fast parsing with only one pass.
*/
--
To stop receiving notification emails like this one, please contact
['"commits@tika.apache.org" <co...@tika.apache.org>'].