You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Mattmann, Chris A (3010)" <ch...@jpl.nasa.gov> on 2017/03/08 05:01:54 UTC

Re: Query Regarding Apache Tika Language Ditector

Resending this to dev@tika.apache.org<ma...@tika.apache.org> rather than dev-owner.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Principal Data Scientist, Engineering Administrative Office (3010)
Manager, NSF & Open Source Projects Formulation and Development Offices (8212)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 180-503E, Mailstop: 180-503
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


From: Supriti Dan <su...@gmail.com>
Date: Tuesday, March 7, 2017 at 5:21 PM
To: "dev-owner@tika.apache.org" <de...@tika.apache.org>
Subject: Query Regarding Apache Tika Language Ditector

Hi Team,

I want to use Apache Tika for language detection purpose, could you please suggest me how many different language are detected well by Apache Tika. From the source (https://www.tutorialspoint.com/tika/tika_language_detection.htm) I found that Apache Tika support 18 languages but I believe the latest version support more then that.

Thanking you in advance.

Regards,
Supriti Dan