You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@opennlp.apache.org by Madhawa Kasun Gunasekara <ma...@gmail.com> on 2016/03/17 06:51:56 UTC

GSOC2016 Sentiment Analysis

Hi

I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
GSOC2016 this time. Since i have been engaging with some similar projects i
think it will be a great experience for me.

I am a final year student in IESL College of Engineering, Sri lanka. I have
learned machine learning and natural language processing stuff when I'm
doing my first degree (Computer Science) in University of Sri
Jayewardhenapura.

In my internship period, I have actively contributed to a Twitter based NLP
project. and We have published an article on IEEE Conference, "Real-time
Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .

Please let me know what you think and what you suggest.

Please kindly give me further information on how I could proceed. I
couldn't able to find the mentioned paper "Multi-Class Sentiment Analysis
in Twitter: a Pattern-Based Approach"
[1] https://issues.apache.org/jira/browse/OPENNLP-840
[2] http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7377667

Thanks
Madhawa Gunasekara

Re: GSOC2016 Sentiment Analysis

Posted by Madhawa Kasun Gunasekara <ma...@gmail.com>.
Hi Chris,

Thanks for the response, I'm really do interested about this project. I
would like to know your thoughts on this project ?

Thanks,
Madhawa

Madhawa

On Thu, Mar 17, 2016 at 11:26 AM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Hi Madhawa,
>
> I have several students working on these types of analytics as
> a combination of Tika and things like OpenNLP. If you are interested
> I would be happy to help mentor. FYI my group’s page at USC and at
> JPL:
>
> http://irds.usc.edu/
> http://memex.jpl.nasa.gov/
>
> I’m not an OpenNLP PMC but use OpenNLP and would be happy to help
> mentor.
>
> Cheers,
> Chris
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
> -----Original Message-----
> From: Madhawa Kasun Gunasekara <ma...@gmail.com>
> Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
> Date: Wednesday, March 16, 2016 at 10:51 PM
> To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
> Subject: GSOC2016 Sentiment Analysis
>
> >Hi
> >
> >I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
> >GSOC2016 this time. Since i have been engaging with some similar projects
> >i
> >think it will be a great experience for me.
> >
> >I am a final year student in IESL College of Engineering, Sri lanka. I
> >have
> >learned machine learning and natural language processing stuff when I'm
> >doing my first degree (Computer Science) in University of Sri
> >Jayewardhenapura.
> >
> >In my internship period, I have actively contributed to a Twitter based
> >NLP
> >project. and We have published an article on IEEE Conference, "Real-time
> >Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
> >
> >Please let me know what you think and what you suggest.
> >
> >Please kindly give me further information on how I could proceed. I
> >couldn't able to find the mentioned paper "Multi-Class Sentiment Analysis
> >in Twitter: a Pattern-Based Approach"
> >[1] https://issues.apache.org/jira/browse/OPENNLP-840
> >[2] http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7377667
> >
> >Thanks
> >Madhawa Gunasekara
>
>

Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Hi Madhawa,

I have several students working on these types of analytics as
a combination of Tika and things like OpenNLP. If you are interested
I would be happy to help mentor. FYI my group’s page at USC and at
JPL:

http://irds.usc.edu/
http://memex.jpl.nasa.gov/

I’m not an OpenNLP PMC but use OpenNLP and would be happy to help
mentor.

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Madhawa Kasun Gunasekara <ma...@gmail.com>
Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
Date: Wednesday, March 16, 2016 at 10:51 PM
To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
Subject: GSOC2016 Sentiment Analysis

>Hi
>
>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>GSOC2016 this time. Since i have been engaging with some similar projects
>i
>think it will be a great experience for me.
>
>I am a final year student in IESL College of Engineering, Sri lanka. I
>have
>learned machine learning and natural language processing stuff when I'm
>doing my first degree (Computer Science) in University of Sri
>Jayewardhenapura.
>
>In my internship period, I have actively contributed to a Twitter based
>NLP
>project. and We have published an article on IEEE Conference, "Real-time
>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>
>Please let me know what you think and what you suggest.
>
>Please kindly give me further information on how I could proceed. I
>couldn't able to find the mentioned paper "Multi-Class Sentiment Analysis
>in Twitter: a Pattern-Based Approach"
>[1] https://issues.apache.org/jira/browse/OPENNLP-840
>[2] http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7377667
>
>Thanks
>Madhawa Gunasekara


Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Great that sound awesome Anthony. Friday at 10am PT it is. Please
add chris.mattmann@gmail.com to your GHangout buddy list.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Anthony Beylerian <an...@hotmail.com>
Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
Date: Tuesday, March 29, 2016 at 8:32 AM
To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Mondher Bouazizi
<mo...@gmail.com>, Madhawa Kasun Gunasekara
<ma...@gmail.com>
Cc: "dev@tika.apache.org" <de...@tika.apache.org>, Information and Data
Science Group USC List <ir...@mymaillists.usc.edu>
Subject: RE: GSOC2016 Sentiment Analysis

>Dear Chris,
>
>Thank you again for reviewing our proposals.
>We are looking forward to working together on this.
>
>In our previous trials we have used an annotated corpus made through
>crowdflower for testing, and would be happy to share.
>Although relatively modest and noisy (~10k training ~8k testing ~20k
>pattern extraction) we believe it was enough to demonstrate encouraging
>performances.
>From our side, we also have a Java implementation that we would like to
>shape up for production, however I'm also comfortable with Python in case
>we will need it.
>
>On the other hand, it sounds intriguing to use a cross-lingual corpus, we
>would love to discuss it.
>As for the hangout session, I have just checked with Mondher and the time
>works for us.
>
>Best,
>
>Anthony
>
>
>> From: chris.a.mattmann@jpl.nasa.gov
>> To: mondher.bouazizi@gmail.com; madhawa30@gmail.com
>> CC: anthonybeylerian@hotmail.com; dev@opennlp.apache.org;
>>dev@tika.apache.org; irds-L@mymaillists.usc.edu
>> Subject: Re: GSOC2016 Sentiment Analysis
>> Date: Tue, 29 Mar 2016 13:57:11 +0000
>> 
>> I like both of your comments Mondher and Madhawa. My team at USC
>> has been investigating the use of particular corpuses including
>> Fisher Callhome so as to support sentiment analysis. We have been
>> writing Java code outside of both OpenNLP and Tika but with the
>> goal of integrating them into both. We have a mix of Java and
>> Python code that we’d like to bring into both projects.
>> 
>> I’m reviewing the proposals you wrote now, but would it make sense
>> to have a Google hangout this Friday, ~10am PT Los Angeles/time?
>> 
>> Cheers,
>> Chris
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattmann@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Director, Information Retrieval and Data Science Group (IRDS)
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> WWW: http://irds.usc.edu/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
>> 
>> 
>> 
>> 
>> -----Original Message-----
>> From: Mondher Bouazizi <mo...@gmail.com>
>> Date: Monday, March 28, 2016 at 11:46 PM
>> To: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
>> <ch...@jpl.nasa.gov>
>> Cc: Anthony Beylerian <an...@hotmail.com>,
>> "dev@opennlp.apache.org" <de...@opennlp.apache.org>, "dev@tika.apache.org"
>> <de...@tika.apache.org>, Information and Data Science Group USC List
>> <ir...@mymaillists.usc.edu>
>> Subject: Re: GSOC2016 Sentiment Analysis
>> 
>> >Dear Madhawa,
>> >
>> >
>> >Thank you for your interest in the proposals.
>> >The current tasks we proposed refer to the classification and
>> >quantification regardless of the topic.
>> >This can be used in a larger context where the topic is not specified,
>>or
>> >not unique, in which case we will need to identify the topic(s).
>> >Therefore, a topic detector would be a good idea to implement, in order
>> >to complement this.
>> >
>> >
>> >As for the Document Categorizer, it is a general purpose component with
>> >basic features (n-gram, bag of words, etc.).
>> >
>> >It is basically used for the classification of texts into a set of
>> >classes defined by the user, whether they are sentiment classes or
>>other.
>> >
>> >However it doesn't perform well for this purpose.
>> >
>> >Furthermore, the sentiment analysis component would not just perform
>>the
>> >naive classification but also additional tasks (e.g., quantification)
>>and
>> >implement more specific and sophisticated approaches.
>> >
>> >
>> >Please share your thoughts.
>> >
>> >
>> >Mondher
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >On Tue, Mar 29, 2016 at 1:51 PM, Madhawa Kasun Gunasekara
>> ><ma...@gmail.com> wrote:
>> >
>> >Hi Chris / Antony
>> >
>> >
>> >yes I would like to work on this, This proposal address most of the
>> >things in Sentiment analysis,
>> >
>> >AFAIK most of the people use OpenNLP Document Categorizer for Sentiment
>> >Analysis, since there isn't a proper functionality to do sentiment
>> >analysis in OpenNLP, This would be great if we can add this feature on
>> >OpenNLP project, and also I would like to suggest
>> > that we should able to detect the target object of the opinions from
>> >this feature as well.
>> >
>> >
>> >WDYT ??
>> >
>> >
>> >
>> >Thanks,
>> >
>> >Madhawa
>> >
>> >
>> >Madhawa
>> >
>> >
>> >
>> >
>> >On Tue, Mar 29, 2016 at 2:11 AM, Mattmann, Chris A (3980)
>> ><ch...@jpl.nasa.gov> wrote:
>> >
>> >Dear Anthony,
>> >
>> >Great! These both sound like fantastic proposals and I’m happy
>> >to be a mentor. Madhawa, would you like to join in on these
>> >efforts?
>> >
>> >Cheers,
>> >Chris
>> >
>> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >Chris Mattmann, Ph.D.
>> >Chief Architect
>> >Instrument Software and Science Data Systems Section (398)
>> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >Office: 168-519, Mailstop: 168-527
>> >Email: chris.a.mattmann@nasa.gov
>> >WWW:  
>> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >Director, Information Retrieval and Data Science Group (IRDS)
>> >Adjunct Associate Professor, Computer Science Department
>> >University of Southern California, Los Angeles, CA 90089 USA
>> >WWW: http://irds.usc.edu/
>> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >
>> >
>> >
>> >
>> >
>> >-----Original Message-----
>> >From: Anthony Beylerian <an...@hotmail.com>
>> >Date: Monday, March 28, 2016 at 11:48 AM
>> >To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>,
>> >"mondher.bouazizi@gmail.com" <mo...@gmail.com>
>> >Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
>> ><ch...@jpl.nasa.gov>
>> >Subject: RE: GSOC2016 Sentiment Analysis
>> >
>> >>Dear Chris,
>> >>
>> >>Thank you for starting the discussion.
>> >>We are glad there is an interest in a sentiment analysis component.
>> >>
>> >>My colleague Mondher posted the two JIRA issues related to Sentiment
>> >>Analysis [1][2] as references for our proposals [3][4] for GSoC.
>> >>In fact, we have been researching this topic at our university.
>> >>We are hoping to participate this year and work on integrating both a
>> >>sentiment classifier and a quantifier for the library.
>> >>
>> >>It would be nice to also have an interface with Tika, maybe we can
>> >>collaborate ?
>> >>We are also looking for mentors, in case someone is willing to support
>> >>our proposals.
>> >>
>> >>Best,
>> >>
>> >>Anthony
>> >>
>> >>[1] 
>> >https://issues.apache.org/jira/browse/OPENNLP-842
>> ><https://issues.apache.org/jira/browse/OPENNLP-842>
>> >>[2] 
>> >https://issues.apache.org/jira/browse/OPENNLP-840
>> ><https://issues.apache.org/jira/browse/OPENNLP-840>
>> >>[3]
>> 
>>>>https://docs.google.com/document/d/1nVnwpmGaOnwHERXr55IClE4V87jUX2sva-m
>>>>kg
>> >>W
>> >>nR8n0/edit?usp=sharing
>> >>[4]
>> 
>>>>https://docs.google.com/document/d/1x02II9W3rirtuSbx_sY8kOQZSgOp0SIKeIW
>>>>TC
>> >>X
>> >>EOJvo/edit?usp=sharing
>> >>
>> >>> From: chris.a.mattmann@jpl.nasa.gov
>> >>> To: nishant.k02@gmail.com
>> >>> CC: dev@opennlp.apache.org;
>> >madhawa30@gmail.com;
>> >hmanjuna@usc.edu <ma...@usc.edu>;
>> >>>kamalaku@usc.edu
>> >>> Subject: Re: GSOC2016 Sentiment Analysis
>> >>> Date: Sun, 27 Mar 2016 19:34:24 +0000
>> >>>
>> >>> No problem - I just wanted to encourage discussion thank you for
>> >>> your prompt and courteous replies.
>> >>>
>> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> Chris Mattmann, Ph.D.
>> >>> Chief Architect
>> >>> Instrument Software and Science Data Systems Section (398)
>> >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >>> Office: 168-519, Mailstop: 168-527
>> >>> Email: chris.a.mattmann@nasa.gov
>> >>> WWW: 
>> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> Director, Information Retrieval and Data Science Group (IRDS)
>> >>> Adjunct Associate Professor, Computer Science Department
>> >>> University of Southern California, Los Angeles, CA 90089 USA
>> >>> WWW: http://irds.usc.edu/
>> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>
>> >>
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> 
> 		 	   		  


Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Great that sound awesome Anthony. Friday at 10am PT it is. Please
add chris.mattmann@gmail.com to your GHangout buddy list.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Anthony Beylerian <an...@hotmail.com>
Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
Date: Tuesday, March 29, 2016 at 8:32 AM
To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Mondher Bouazizi
<mo...@gmail.com>, Madhawa Kasun Gunasekara
<ma...@gmail.com>
Cc: "dev@tika.apache.org" <de...@tika.apache.org>, Information and Data
Science Group USC List <ir...@mymaillists.usc.edu>
Subject: RE: GSOC2016 Sentiment Analysis

>Dear Chris,
>
>Thank you again for reviewing our proposals.
>We are looking forward to working together on this.
>
>In our previous trials we have used an annotated corpus made through
>crowdflower for testing, and would be happy to share.
>Although relatively modest and noisy (~10k training ~8k testing ~20k
>pattern extraction) we believe it was enough to demonstrate encouraging
>performances.
>From our side, we also have a Java implementation that we would like to
>shape up for production, however I'm also comfortable with Python in case
>we will need it.
>
>On the other hand, it sounds intriguing to use a cross-lingual corpus, we
>would love to discuss it.
>As for the hangout session, I have just checked with Mondher and the time
>works for us.
>
>Best,
>
>Anthony
>
>
>> From: chris.a.mattmann@jpl.nasa.gov
>> To: mondher.bouazizi@gmail.com; madhawa30@gmail.com
>> CC: anthonybeylerian@hotmail.com; dev@opennlp.apache.org;
>>dev@tika.apache.org; irds-L@mymaillists.usc.edu
>> Subject: Re: GSOC2016 Sentiment Analysis
>> Date: Tue, 29 Mar 2016 13:57:11 +0000
>> 
>> I like both of your comments Mondher and Madhawa. My team at USC
>> has been investigating the use of particular corpuses including
>> Fisher Callhome so as to support sentiment analysis. We have been
>> writing Java code outside of both OpenNLP and Tika but with the
>> goal of integrating them into both. We have a mix of Java and
>> Python code that we’d like to bring into both projects.
>> 
>> I’m reviewing the proposals you wrote now, but would it make sense
>> to have a Google hangout this Friday, ~10am PT Los Angeles/time?
>> 
>> Cheers,
>> Chris
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattmann@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Director, Information Retrieval and Data Science Group (IRDS)
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> WWW: http://irds.usc.edu/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
>> 
>> 
>> 
>> 
>> -----Original Message-----
>> From: Mondher Bouazizi <mo...@gmail.com>
>> Date: Monday, March 28, 2016 at 11:46 PM
>> To: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
>> <ch...@jpl.nasa.gov>
>> Cc: Anthony Beylerian <an...@hotmail.com>,
>> "dev@opennlp.apache.org" <de...@opennlp.apache.org>, "dev@tika.apache.org"
>> <de...@tika.apache.org>, Information and Data Science Group USC List
>> <ir...@mymaillists.usc.edu>
>> Subject: Re: GSOC2016 Sentiment Analysis
>> 
>> >Dear Madhawa,
>> >
>> >
>> >Thank you for your interest in the proposals.
>> >The current tasks we proposed refer to the classification and
>> >quantification regardless of the topic.
>> >This can be used in a larger context where the topic is not specified,
>>or
>> >not unique, in which case we will need to identify the topic(s).
>> >Therefore, a topic detector would be a good idea to implement, in order
>> >to complement this.
>> >
>> >
>> >As for the Document Categorizer, it is a general purpose component with
>> >basic features (n-gram, bag of words, etc.).
>> >
>> >It is basically used for the classification of texts into a set of
>> >classes defined by the user, whether they are sentiment classes or
>>other.
>> >
>> >However it doesn't perform well for this purpose.
>> >
>> >Furthermore, the sentiment analysis component would not just perform
>>the
>> >naive classification but also additional tasks (e.g., quantification)
>>and
>> >implement more specific and sophisticated approaches.
>> >
>> >
>> >Please share your thoughts.
>> >
>> >
>> >Mondher
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >On Tue, Mar 29, 2016 at 1:51 PM, Madhawa Kasun Gunasekara
>> ><ma...@gmail.com> wrote:
>> >
>> >Hi Chris / Antony
>> >
>> >
>> >yes I would like to work on this, This proposal address most of the
>> >things in Sentiment analysis,
>> >
>> >AFAIK most of the people use OpenNLP Document Categorizer for Sentiment
>> >Analysis, since there isn't a proper functionality to do sentiment
>> >analysis in OpenNLP, This would be great if we can add this feature on
>> >OpenNLP project, and also I would like to suggest
>> > that we should able to detect the target object of the opinions from
>> >this feature as well.
>> >
>> >
>> >WDYT ??
>> >
>> >
>> >
>> >Thanks,
>> >
>> >Madhawa
>> >
>> >
>> >Madhawa
>> >
>> >
>> >
>> >
>> >On Tue, Mar 29, 2016 at 2:11 AM, Mattmann, Chris A (3980)
>> ><ch...@jpl.nasa.gov> wrote:
>> >
>> >Dear Anthony,
>> >
>> >Great! These both sound like fantastic proposals and I’m happy
>> >to be a mentor. Madhawa, would you like to join in on these
>> >efforts?
>> >
>> >Cheers,
>> >Chris
>> >
>> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >Chris Mattmann, Ph.D.
>> >Chief Architect
>> >Instrument Software and Science Data Systems Section (398)
>> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >Office: 168-519, Mailstop: 168-527
>> >Email: chris.a.mattmann@nasa.gov
>> >WWW:  
>> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >Director, Information Retrieval and Data Science Group (IRDS)
>> >Adjunct Associate Professor, Computer Science Department
>> >University of Southern California, Los Angeles, CA 90089 USA
>> >WWW: http://irds.usc.edu/
>> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >
>> >
>> >
>> >
>> >
>> >-----Original Message-----
>> >From: Anthony Beylerian <an...@hotmail.com>
>> >Date: Monday, March 28, 2016 at 11:48 AM
>> >To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>,
>> >"mondher.bouazizi@gmail.com" <mo...@gmail.com>
>> >Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
>> ><ch...@jpl.nasa.gov>
>> >Subject: RE: GSOC2016 Sentiment Analysis
>> >
>> >>Dear Chris,
>> >>
>> >>Thank you for starting the discussion.
>> >>We are glad there is an interest in a sentiment analysis component.
>> >>
>> >>My colleague Mondher posted the two JIRA issues related to Sentiment
>> >>Analysis [1][2] as references for our proposals [3][4] for GSoC.
>> >>In fact, we have been researching this topic at our university.
>> >>We are hoping to participate this year and work on integrating both a
>> >>sentiment classifier and a quantifier for the library.
>> >>
>> >>It would be nice to also have an interface with Tika, maybe we can
>> >>collaborate ?
>> >>We are also looking for mentors, in case someone is willing to support
>> >>our proposals.
>> >>
>> >>Best,
>> >>
>> >>Anthony
>> >>
>> >>[1] 
>> >https://issues.apache.org/jira/browse/OPENNLP-842
>> ><https://issues.apache.org/jira/browse/OPENNLP-842>
>> >>[2] 
>> >https://issues.apache.org/jira/browse/OPENNLP-840
>> ><https://issues.apache.org/jira/browse/OPENNLP-840>
>> >>[3]
>> 
>>>>https://docs.google.com/document/d/1nVnwpmGaOnwHERXr55IClE4V87jUX2sva-m
>>>>kg
>> >>W
>> >>nR8n0/edit?usp=sharing
>> >>[4]
>> 
>>>>https://docs.google.com/document/d/1x02II9W3rirtuSbx_sY8kOQZSgOp0SIKeIW
>>>>TC
>> >>X
>> >>EOJvo/edit?usp=sharing
>> >>
>> >>> From: chris.a.mattmann@jpl.nasa.gov
>> >>> To: nishant.k02@gmail.com
>> >>> CC: dev@opennlp.apache.org;
>> >madhawa30@gmail.com;
>> >hmanjuna@usc.edu <ma...@usc.edu>;
>> >>>kamalaku@usc.edu
>> >>> Subject: Re: GSOC2016 Sentiment Analysis
>> >>> Date: Sun, 27 Mar 2016 19:34:24 +0000
>> >>>
>> >>> No problem - I just wanted to encourage discussion thank you for
>> >>> your prompt and courteous replies.
>> >>>
>> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> Chris Mattmann, Ph.D.
>> >>> Chief Architect
>> >>> Instrument Software and Science Data Systems Section (398)
>> >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >>> Office: 168-519, Mailstop: 168-527
>> >>> Email: chris.a.mattmann@nasa.gov
>> >>> WWW: 
>> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> Director, Information Retrieval and Data Science Group (IRDS)
>> >>> Adjunct Associate Professor, Computer Science Department
>> >>> University of Southern California, Los Angeles, CA 90089 USA
>> >>> WWW: http://irds.usc.edu/
>> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>
>> >>
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> 
> 		 	   		  


RE: GSOC2016 Sentiment Analysis

Posted by Anthony Beylerian <an...@hotmail.com>.
Dear Chris,

Thank you again for reviewing our proposals. 
We are looking forward to working together on this.

In our previous trials we have used an annotated corpus made through crowdflower for testing, and would be happy to share.
Although relatively modest and noisy (~10k training ~8k testing ~20k pattern extraction) we believe it was enough to demonstrate encouraging performances.
From our side, we also have a Java implementation that we would like to shape up for production, however I'm also comfortable with Python in case we will need it.

On the other hand, it sounds intriguing to use a cross-lingual corpus, we would love to discuss it.
As for the hangout session, I have just checked with Mondher and the time works for us.

Best,

Anthony


> From: chris.a.mattmann@jpl.nasa.gov
> To: mondher.bouazizi@gmail.com; madhawa30@gmail.com
> CC: anthonybeylerian@hotmail.com; dev@opennlp.apache.org; dev@tika.apache.org; irds-L@mymaillists.usc.edu
> Subject: Re: GSOC2016 Sentiment Analysis
> Date: Tue, 29 Mar 2016 13:57:11 +0000
> 
> I like both of your comments Mondher and Madhawa. My team at USC
> has been investigating the use of particular corpuses including
> Fisher Callhome so as to support sentiment analysis. We have been
> writing Java code outside of both OpenNLP and Tika but with the
> goal of integrating them into both. We have a mix of Java and
> Python code that we’d like to bring into both projects.
> 
> I’m reviewing the proposals you wrote now, but would it make sense
> to have a Google hangout this Friday, ~10am PT Los Angeles/time?
> 
> Cheers,
> Chris
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 
> 
> 
> 
> 
> -----Original Message-----
> From: Mondher Bouazizi <mo...@gmail.com>
> Date: Monday, March 28, 2016 at 11:46 PM
> To: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
> <ch...@jpl.nasa.gov>
> Cc: Anthony Beylerian <an...@hotmail.com>,
> "dev@opennlp.apache.org" <de...@opennlp.apache.org>, "dev@tika.apache.org"
> <de...@tika.apache.org>, Information and Data Science Group USC List
> <ir...@mymaillists.usc.edu>
> Subject: Re: GSOC2016 Sentiment Analysis
> 
> >Dear Madhawa,
> >
> >
> >Thank you for your interest in the proposals.
> >The current tasks we proposed refer to the classification and
> >quantification regardless of the topic.
> >This can be used in a larger context where the topic is not specified, or
> >not unique, in which case we will need to identify the topic(s).
> >Therefore, a topic detector would be a good idea to implement, in order
> >to complement this.
> >
> >
> >As for the Document Categorizer, it is a general purpose component with
> >basic features (n-gram, bag of words, etc.).
> >
> >It is basically used for the classification of texts into a set of
> >classes defined by the user, whether they are sentiment classes or other.
> >
> >However it doesn't perform well for this purpose.
> >
> >Furthermore, the sentiment analysis component would not just perform the
> >naive classification but also additional tasks (e.g., quantification) and
> >implement more specific and sophisticated approaches.
> >
> >
> >Please share your thoughts.
> >
> >
> >Mondher
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >On Tue, Mar 29, 2016 at 1:51 PM, Madhawa Kasun Gunasekara
> ><ma...@gmail.com> wrote:
> >
> >Hi Chris / Antony
> >
> >
> >yes I would like to work on this, This proposal address most of the
> >things in Sentiment analysis,
> >
> >AFAIK most of the people use OpenNLP Document Categorizer for Sentiment
> >Analysis, since there isn't a proper functionality to do sentiment
> >analysis in OpenNLP, This would be great if we can add this feature on
> >OpenNLP project, and also I would like to suggest
> > that we should able to detect the target object of the opinions from
> >this feature as well.
> >
> >
> >WDYT ??
> >
> >
> >
> >Thanks,
> >
> >Madhawa
> >
> >
> >Madhawa
> >
> >
> >
> >
> >On Tue, Mar 29, 2016 at 2:11 AM, Mattmann, Chris A (3980)
> ><ch...@jpl.nasa.gov> wrote:
> >
> >Dear Anthony,
> >
> >Great! These both sound like fantastic proposals and I’m happy
> >to be a mentor. Madhawa, would you like to join in on these
> >efforts?
> >
> >Cheers,
> >Chris
> >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >Chris Mattmann, Ph.D.
> >Chief Architect
> >Instrument Software and Science Data Systems Section (398)
> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >Office: 168-519, Mailstop: 168-527
> >Email: chris.a.mattmann@nasa.gov
> >WWW:  
> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >Director, Information Retrieval and Data Science Group (IRDS)
> >Adjunct Associate Professor, Computer Science Department
> >University of Southern California, Los Angeles, CA 90089 USA
> >WWW: http://irds.usc.edu/
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >
> >
> >
> >
> >-----Original Message-----
> >From: Anthony Beylerian <an...@hotmail.com>
> >Date: Monday, March 28, 2016 at 11:48 AM
> >To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>,
> >"mondher.bouazizi@gmail.com" <mo...@gmail.com>
> >Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
> ><ch...@jpl.nasa.gov>
> >Subject: RE: GSOC2016 Sentiment Analysis
> >
> >>Dear Chris,
> >>
> >>Thank you for starting the discussion.
> >>We are glad there is an interest in a sentiment analysis component.
> >>
> >>My colleague Mondher posted the two JIRA issues related to Sentiment
> >>Analysis [1][2] as references for our proposals [3][4] for GSoC.
> >>In fact, we have been researching this topic at our university.
> >>We are hoping to participate this year and work on integrating both a
> >>sentiment classifier and a quantifier for the library.
> >>
> >>It would be nice to also have an interface with Tika, maybe we can
> >>collaborate ?
> >>We are also looking for mentors, in case someone is willing to support
> >>our proposals.
> >>
> >>Best,
> >>
> >>Anthony
> >>
> >>[1] 
> >https://issues.apache.org/jira/browse/OPENNLP-842
> ><https://issues.apache.org/jira/browse/OPENNLP-842>
> >>[2] 
> >https://issues.apache.org/jira/browse/OPENNLP-840
> ><https://issues.apache.org/jira/browse/OPENNLP-840>
> >>[3]
> >>https://docs.google.com/document/d/1nVnwpmGaOnwHERXr55IClE4V87jUX2sva-mkg
> >>W
> >>nR8n0/edit?usp=sharing
> >>[4]
> >>https://docs.google.com/document/d/1x02II9W3rirtuSbx_sY8kOQZSgOp0SIKeIWTC
> >>X
> >>EOJvo/edit?usp=sharing
> >>
> >>> From: chris.a.mattmann@jpl.nasa.gov
> >>> To: nishant.k02@gmail.com
> >>> CC: dev@opennlp.apache.org;
> >madhawa30@gmail.com;
> >hmanjuna@usc.edu <ma...@usc.edu>;
> >>>kamalaku@usc.edu
> >>> Subject: Re: GSOC2016 Sentiment Analysis
> >>> Date: Sun, 27 Mar 2016 19:34:24 +0000
> >>>
> >>> No problem - I just wanted to encourage discussion thank you for
> >>> your prompt and courteous replies.
> >>>
> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>> Chris Mattmann, Ph.D.
> >>> Chief Architect
> >>> Instrument Software and Science Data Systems Section (398)
> >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>> Office: 168-519, Mailstop: 168-527
> >>> Email: chris.a.mattmann@nasa.gov
> >>> WWW: 
> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>> Director, Information Retrieval and Data Science Group (IRDS)
> >>> Adjunct Associate Professor, Computer Science Department
> >>> University of Southern California, Los Angeles, CA 90089 USA
> >>> WWW: http://irds.usc.edu/
> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> 
 		 	   		  

Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
I like both of your comments Mondher and Madhawa. My team at USC
has been investigating the use of particular corpuses including
Fisher Callhome so as to support sentiment analysis. We have been
writing Java code outside of both OpenNLP and Tika but with the
goal of integrating them into both. We have a mix of Java and
Python code that we’d like to bring into both projects.

I’m reviewing the proposals you wrote now, but would it make sense
to have a Google hangout this Friday, ~10am PT Los Angeles/time?

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Mondher Bouazizi <mo...@gmail.com>
Date: Monday, March 28, 2016 at 11:46 PM
To: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
<ch...@jpl.nasa.gov>
Cc: Anthony Beylerian <an...@hotmail.com>,
"dev@opennlp.apache.org" <de...@opennlp.apache.org>, "dev@tika.apache.org"
<de...@tika.apache.org>, Information and Data Science Group USC List
<ir...@mymaillists.usc.edu>
Subject: Re: GSOC2016 Sentiment Analysis

>Dear Madhawa,
>
>
>Thank you for your interest in the proposals.
>The current tasks we proposed refer to the classification and
>quantification regardless of the topic.
>This can be used in a larger context where the topic is not specified, or
>not unique, in which case we will need to identify the topic(s).
>Therefore, a topic detector would be a good idea to implement, in order
>to complement this.
>
>
>As for the Document Categorizer, it is a general purpose component with
>basic features (n-gram, bag of words, etc.).
>
>It is basically used for the classification of texts into a set of
>classes defined by the user, whether they are sentiment classes or other.
>
>However it doesn't perform well for this purpose.
>
>Furthermore, the sentiment analysis component would not just perform the
>naive classification but also additional tasks (e.g., quantification) and
>implement more specific and sophisticated approaches.
>
>
>Please share your thoughts.
>
>
>Mondher
>
>
>
>
>
>
>
>
>
>On Tue, Mar 29, 2016 at 1:51 PM, Madhawa Kasun Gunasekara
><ma...@gmail.com> wrote:
>
>Hi Chris / Antony
>
>
>yes I would like to work on this, This proposal address most of the
>things in Sentiment analysis,
>
>AFAIK most of the people use OpenNLP Document Categorizer for Sentiment
>Analysis, since there isn't a proper functionality to do sentiment
>analysis in OpenNLP, This would be great if we can add this feature on
>OpenNLP project, and also I would like to suggest
> that we should able to detect the target object of the opinions from
>this feature as well.
>
>
>WDYT ??
>
>
>
>Thanks,
>
>Madhawa
>
>
>Madhawa
>
>
>
>
>On Tue, Mar 29, 2016 at 2:11 AM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov> wrote:
>
>Dear Anthony,
>
>Great! These both sound like fantastic proposals and I’m happy
>to be a mentor. Madhawa, would you like to join in on these
>efforts?
>
>Cheers,
>Chris
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Chris Mattmann, Ph.D.
>Chief Architect
>Instrument Software and Science Data Systems Section (398)
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>Office: 168-519, Mailstop: 168-527
>Email: chris.a.mattmann@nasa.gov
>WWW:  
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Director, Information Retrieval and Data Science Group (IRDS)
>Adjunct Associate Professor, Computer Science Department
>University of Southern California, Los Angeles, CA 90089 USA
>WWW: http://irds.usc.edu/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>-----Original Message-----
>From: Anthony Beylerian <an...@hotmail.com>
>Date: Monday, March 28, 2016 at 11:48 AM
>To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>,
>"mondher.bouazizi@gmail.com" <mo...@gmail.com>
>Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
><ch...@jpl.nasa.gov>
>Subject: RE: GSOC2016 Sentiment Analysis
>
>>Dear Chris,
>>
>>Thank you for starting the discussion.
>>We are glad there is an interest in a sentiment analysis component.
>>
>>My colleague Mondher posted the two JIRA issues related to Sentiment
>>Analysis [1][2] as references for our proposals [3][4] for GSoC.
>>In fact, we have been researching this topic at our university.
>>We are hoping to participate this year and work on integrating both a
>>sentiment classifier and a quantifier for the library.
>>
>>It would be nice to also have an interface with Tika, maybe we can
>>collaborate ?
>>We are also looking for mentors, in case someone is willing to support
>>our proposals.
>>
>>Best,
>>
>>Anthony
>>
>>[1] 
>https://issues.apache.org/jira/browse/OPENNLP-842
><https://issues.apache.org/jira/browse/OPENNLP-842>
>>[2] 
>https://issues.apache.org/jira/browse/OPENNLP-840
><https://issues.apache.org/jira/browse/OPENNLP-840>
>>[3]
>>https://docs.google.com/document/d/1nVnwpmGaOnwHERXr55IClE4V87jUX2sva-mkg
>>W
>>nR8n0/edit?usp=sharing
>>[4]
>>https://docs.google.com/document/d/1x02II9W3rirtuSbx_sY8kOQZSgOp0SIKeIWTC
>>X
>>EOJvo/edit?usp=sharing
>>
>>> From: chris.a.mattmann@jpl.nasa.gov
>>> To: nishant.k02@gmail.com
>>> CC: dev@opennlp.apache.org;
>madhawa30@gmail.com;
>hmanjuna@usc.edu <ma...@usc.edu>;
>>>kamalaku@usc.edu
>>> Subject: Re: GSOC2016 Sentiment Analysis
>>> Date: Sun, 27 Mar 2016 19:34:24 +0000
>>>
>>> No problem - I just wanted to encourage discussion thank you for
>>> your prompt and courteous replies.
>>>
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Chief Architect
>>> Instrument Software and Science Data Systems Section (398)
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 168-519, Mailstop: 168-527
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW: 
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Director, Information Retrieval and Data Science Group (IRDS)
>>> Adjunct Associate Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> WWW: http://irds.usc.edu/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>
>
>
>
>
>
>
>
>
>
>
>
>


Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
I like both of your comments Mondher and Madhawa. My team at USC
has been investigating the use of particular corpuses including
Fisher Callhome so as to support sentiment analysis. We have been
writing Java code outside of both OpenNLP and Tika but with the
goal of integrating them into both. We have a mix of Java and
Python code that we’d like to bring into both projects.

I’m reviewing the proposals you wrote now, but would it make sense
to have a Google hangout this Friday, ~10am PT Los Angeles/time?

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Mondher Bouazizi <mo...@gmail.com>
Date: Monday, March 28, 2016 at 11:46 PM
To: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
<ch...@jpl.nasa.gov>
Cc: Anthony Beylerian <an...@hotmail.com>,
"dev@opennlp.apache.org" <de...@opennlp.apache.org>, "dev@tika.apache.org"
<de...@tika.apache.org>, Information and Data Science Group USC List
<ir...@mymaillists.usc.edu>
Subject: Re: GSOC2016 Sentiment Analysis

>Dear Madhawa,
>
>
>Thank you for your interest in the proposals.
>The current tasks we proposed refer to the classification and
>quantification regardless of the topic.
>This can be used in a larger context where the topic is not specified, or
>not unique, in which case we will need to identify the topic(s).
>Therefore, a topic detector would be a good idea to implement, in order
>to complement this.
>
>
>As for the Document Categorizer, it is a general purpose component with
>basic features (n-gram, bag of words, etc.).
>
>It is basically used for the classification of texts into a set of
>classes defined by the user, whether they are sentiment classes or other.
>
>However it doesn't perform well for this purpose.
>
>Furthermore, the sentiment analysis component would not just perform the
>naive classification but also additional tasks (e.g., quantification) and
>implement more specific and sophisticated approaches.
>
>
>Please share your thoughts.
>
>
>Mondher
>
>
>
>
>
>
>
>
>
>On Tue, Mar 29, 2016 at 1:51 PM, Madhawa Kasun Gunasekara
><ma...@gmail.com> wrote:
>
>Hi Chris / Antony
>
>
>yes I would like to work on this, This proposal address most of the
>things in Sentiment analysis,
>
>AFAIK most of the people use OpenNLP Document Categorizer for Sentiment
>Analysis, since there isn't a proper functionality to do sentiment
>analysis in OpenNLP, This would be great if we can add this feature on
>OpenNLP project, and also I would like to suggest
> that we should able to detect the target object of the opinions from
>this feature as well.
>
>
>WDYT ??
>
>
>
>Thanks,
>
>Madhawa
>
>
>Madhawa
>
>
>
>
>On Tue, Mar 29, 2016 at 2:11 AM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov> wrote:
>
>Dear Anthony,
>
>Great! These both sound like fantastic proposals and I’m happy
>to be a mentor. Madhawa, would you like to join in on these
>efforts?
>
>Cheers,
>Chris
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Chris Mattmann, Ph.D.
>Chief Architect
>Instrument Software and Science Data Systems Section (398)
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>Office: 168-519, Mailstop: 168-527
>Email: chris.a.mattmann@nasa.gov
>WWW:  
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Director, Information Retrieval and Data Science Group (IRDS)
>Adjunct Associate Professor, Computer Science Department
>University of Southern California, Los Angeles, CA 90089 USA
>WWW: http://irds.usc.edu/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>-----Original Message-----
>From: Anthony Beylerian <an...@hotmail.com>
>Date: Monday, March 28, 2016 at 11:48 AM
>To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>,
>"mondher.bouazizi@gmail.com" <mo...@gmail.com>
>Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
><ch...@jpl.nasa.gov>
>Subject: RE: GSOC2016 Sentiment Analysis
>
>>Dear Chris,
>>
>>Thank you for starting the discussion.
>>We are glad there is an interest in a sentiment analysis component.
>>
>>My colleague Mondher posted the two JIRA issues related to Sentiment
>>Analysis [1][2] as references for our proposals [3][4] for GSoC.
>>In fact, we have been researching this topic at our university.
>>We are hoping to participate this year and work on integrating both a
>>sentiment classifier and a quantifier for the library.
>>
>>It would be nice to also have an interface with Tika, maybe we can
>>collaborate ?
>>We are also looking for mentors, in case someone is willing to support
>>our proposals.
>>
>>Best,
>>
>>Anthony
>>
>>[1] 
>https://issues.apache.org/jira/browse/OPENNLP-842
><https://issues.apache.org/jira/browse/OPENNLP-842>
>>[2] 
>https://issues.apache.org/jira/browse/OPENNLP-840
><https://issues.apache.org/jira/browse/OPENNLP-840>
>>[3]
>>https://docs.google.com/document/d/1nVnwpmGaOnwHERXr55IClE4V87jUX2sva-mkg
>>W
>>nR8n0/edit?usp=sharing
>>[4]
>>https://docs.google.com/document/d/1x02II9W3rirtuSbx_sY8kOQZSgOp0SIKeIWTC
>>X
>>EOJvo/edit?usp=sharing
>>
>>> From: chris.a.mattmann@jpl.nasa.gov
>>> To: nishant.k02@gmail.com
>>> CC: dev@opennlp.apache.org;
>madhawa30@gmail.com;
>hmanjuna@usc.edu <ma...@usc.edu>;
>>>kamalaku@usc.edu
>>> Subject: Re: GSOC2016 Sentiment Analysis
>>> Date: Sun, 27 Mar 2016 19:34:24 +0000
>>>
>>> No problem - I just wanted to encourage discussion thank you for
>>> your prompt and courteous replies.
>>>
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Chief Architect
>>> Instrument Software and Science Data Systems Section (398)
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 168-519, Mailstop: 168-527
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW: 
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Director, Information Retrieval and Data Science Group (IRDS)
>>> Adjunct Associate Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> WWW: http://irds.usc.edu/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>
>
>
>
>
>
>
>
>
>
>
>
>


Re: GSOC2016 Sentiment Analysis

Posted by Mondher Bouazizi <mo...@gmail.com>.
Dear Madhawa,

Thank you for your interest in the proposals.
The current tasks we proposed refer to the classification and
quantification regardless of the topic.
This can be used in a larger context where the topic is not specified, or
not unique, in which case we will need to identify the topic(s).
Therefore, a topic detector would be a good idea to implement, in order to
complement this.

As for the Document Categorizer, it is a general purpose component with
basic features (n-gram, bag of words, etc.).
It is basically used for the classification of texts into a set of classes
defined by the user, whether they are sentiment classes or other.
However it doesn't perform well for this purpose.

Furthermore, the sentiment analysis component would not just perform the
naive classification but also additional tasks (e.g., quantification) and
implement more specific and sophisticated approaches.

Please share your thoughts.

Mondher



On Tue, Mar 29, 2016 at 1:51 PM, Madhawa Kasun Gunasekara <
madhawa30@gmail.com> wrote:

> Hi Chris / Antony
>
> yes I would like to work on this, This proposal address most of the things
> in Sentiment analysis,
> AFAIK most of the people use OpenNLP Document Categorizer for Sentiment
> Analysis, since there isn't a proper functionality to do sentiment analysis
> in OpenNLP, This would be great if we can add this feature on OpenNLP
> project, and also I would like to suggest that we should able to detect the
> target object of the opinions from this feature as well.
>
> WDYT ??
>
> Thanks,
> Madhawa
>
> Madhawa
>
> On Tue, Mar 29, 2016 at 2:11 AM, Mattmann, Chris A (3980) <
> chris.a.mattmann@jpl.nasa.gov> wrote:
>
>> Dear Anthony,
>>
>> Great! These both sound like fantastic proposals and I’m happy
>> to be a mentor. Madhawa, would you like to join in on these
>> efforts?
>>
>> Cheers,
>> Chris
>>
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattmann@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Director, Information Retrieval and Data Science Group (IRDS)
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> WWW: http://irds.usc.edu/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>
>>
>> -----Original Message-----
>> From: Anthony Beylerian <an...@hotmail.com>
>> Date: Monday, March 28, 2016 at 11:48 AM
>> To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>,
>> "mondher.bouazizi@gmail.com" <mo...@gmail.com>
>> Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
>> <ch...@jpl.nasa.gov>
>> Subject: RE: GSOC2016 Sentiment Analysis
>>
>> >Dear Chris,
>> >
>> >Thank you for starting the discussion.
>> >We are glad there is an interest in a sentiment analysis component.
>> >
>> >My colleague Mondher posted the two JIRA issues related to Sentiment
>> >Analysis [1][2] as references for our proposals [3][4] for GSoC.
>> >In fact, we have been researching this topic at our university.
>> >We are hoping to participate this year and work on integrating both a
>> >sentiment classifier and a quantifier for the library.
>> >
>> >It would be nice to also have an interface with Tika, maybe we can
>> >collaborate ?
>> >We are also looking for mentors, in case someone is willing to support
>> >our proposals.
>> >
>> >Best,
>> >
>> >Anthony
>> >
>> >[1] https://issues.apache.org/jira/browse/OPENNLP-842
>> >[2] https://issues.apache.org/jira/browse/OPENNLP-840
>> >[3]
>> >
>> https://docs.google.com/document/d/1nVnwpmGaOnwHERXr55IClE4V87jUX2sva-mkgW
>> >nR8n0/edit?usp=sharing
>> >[4]
>> >
>> https://docs.google.com/document/d/1x02II9W3rirtuSbx_sY8kOQZSgOp0SIKeIWTCX
>> >EOJvo/edit?usp=sharing
>> >
>> >> From: chris.a.mattmann@jpl.nasa.gov
>> >> To: nishant.k02@gmail.com
>> >> CC: dev@opennlp.apache.org; madhawa30@gmail.com; hmanjuna@usc.edu;
>> >>kamalaku@usc.edu
>> >> Subject: Re: GSOC2016 Sentiment Analysis
>> >> Date: Sun, 27 Mar 2016 19:34:24 +0000
>> >>
>> >> No problem - I just wanted to encourage discussion thank you for
>> >> your prompt and courteous replies.
>> >>
>> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >> Chris Mattmann, Ph.D.
>> >> Chief Architect
>> >> Instrument Software and Science Data Systems Section (398)
>> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >> Office: 168-519, Mailstop: 168-527
>> >> Email: chris.a.mattmann@nasa.gov
>> >> WWW: http://sunset.usc.edu/~mattmann/
>> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >> Director, Information Retrieval and Data Science Group (IRDS)
>> >> Adjunct Associate Professor, Computer Science Department
>> >> University of Southern California, Los Angeles, CA 90089 USA
>> >> WWW: http://irds.usc.edu/
>> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >
>> >
>>
>>
>

Re: GSOC2016 Sentiment Analysis

Posted by Mondher Bouazizi <mo...@gmail.com>.
Dear Madhawa,

Thank you for your interest in the proposals.
The current tasks we proposed refer to the classification and
quantification regardless of the topic.
This can be used in a larger context where the topic is not specified, or
not unique, in which case we will need to identify the topic(s).
Therefore, a topic detector would be a good idea to implement, in order to
complement this.

As for the Document Categorizer, it is a general purpose component with
basic features (n-gram, bag of words, etc.).
It is basically used for the classification of texts into a set of classes
defined by the user, whether they are sentiment classes or other.
However it doesn't perform well for this purpose.

Furthermore, the sentiment analysis component would not just perform the
naive classification but also additional tasks (e.g., quantification) and
implement more specific and sophisticated approaches.

Please share your thoughts.

Mondher



On Tue, Mar 29, 2016 at 1:51 PM, Madhawa Kasun Gunasekara <
madhawa30@gmail.com> wrote:

> Hi Chris / Antony
>
> yes I would like to work on this, This proposal address most of the things
> in Sentiment analysis,
> AFAIK most of the people use OpenNLP Document Categorizer for Sentiment
> Analysis, since there isn't a proper functionality to do sentiment analysis
> in OpenNLP, This would be great if we can add this feature on OpenNLP
> project, and also I would like to suggest that we should able to detect the
> target object of the opinions from this feature as well.
>
> WDYT ??
>
> Thanks,
> Madhawa
>
> Madhawa
>
> On Tue, Mar 29, 2016 at 2:11 AM, Mattmann, Chris A (3980) <
> chris.a.mattmann@jpl.nasa.gov> wrote:
>
>> Dear Anthony,
>>
>> Great! These both sound like fantastic proposals and I’m happy
>> to be a mentor. Madhawa, would you like to join in on these
>> efforts?
>>
>> Cheers,
>> Chris
>>
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattmann@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Director, Information Retrieval and Data Science Group (IRDS)
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> WWW: http://irds.usc.edu/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>
>>
>> -----Original Message-----
>> From: Anthony Beylerian <an...@hotmail.com>
>> Date: Monday, March 28, 2016 at 11:48 AM
>> To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>,
>> "mondher.bouazizi@gmail.com" <mo...@gmail.com>
>> Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
>> <ch...@jpl.nasa.gov>
>> Subject: RE: GSOC2016 Sentiment Analysis
>>
>> >Dear Chris,
>> >
>> >Thank you for starting the discussion.
>> >We are glad there is an interest in a sentiment analysis component.
>> >
>> >My colleague Mondher posted the two JIRA issues related to Sentiment
>> >Analysis [1][2] as references for our proposals [3][4] for GSoC.
>> >In fact, we have been researching this topic at our university.
>> >We are hoping to participate this year and work on integrating both a
>> >sentiment classifier and a quantifier for the library.
>> >
>> >It would be nice to also have an interface with Tika, maybe we can
>> >collaborate ?
>> >We are also looking for mentors, in case someone is willing to support
>> >our proposals.
>> >
>> >Best,
>> >
>> >Anthony
>> >
>> >[1] https://issues.apache.org/jira/browse/OPENNLP-842
>> >[2] https://issues.apache.org/jira/browse/OPENNLP-840
>> >[3]
>> >
>> https://docs.google.com/document/d/1nVnwpmGaOnwHERXr55IClE4V87jUX2sva-mkgW
>> >nR8n0/edit?usp=sharing
>> >[4]
>> >
>> https://docs.google.com/document/d/1x02II9W3rirtuSbx_sY8kOQZSgOp0SIKeIWTCX
>> >EOJvo/edit?usp=sharing
>> >
>> >> From: chris.a.mattmann@jpl.nasa.gov
>> >> To: nishant.k02@gmail.com
>> >> CC: dev@opennlp.apache.org; madhawa30@gmail.com; hmanjuna@usc.edu;
>> >>kamalaku@usc.edu
>> >> Subject: Re: GSOC2016 Sentiment Analysis
>> >> Date: Sun, 27 Mar 2016 19:34:24 +0000
>> >>
>> >> No problem - I just wanted to encourage discussion thank you for
>> >> your prompt and courteous replies.
>> >>
>> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >> Chris Mattmann, Ph.D.
>> >> Chief Architect
>> >> Instrument Software and Science Data Systems Section (398)
>> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >> Office: 168-519, Mailstop: 168-527
>> >> Email: chris.a.mattmann@nasa.gov
>> >> WWW: http://sunset.usc.edu/~mattmann/
>> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >> Director, Information Retrieval and Data Science Group (IRDS)
>> >> Adjunct Associate Professor, Computer Science Department
>> >> University of Southern California, Los Angeles, CA 90089 USA
>> >> WWW: http://irds.usc.edu/
>> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >
>> >
>>
>>
>

Re: GSOC2016 Sentiment Analysis

Posted by Madhawa Kasun Gunasekara <ma...@gmail.com>.
Hi Chris / Antony

yes I would like to work on this, This proposal address most of the things
in Sentiment analysis,
AFAIK most of the people use OpenNLP Document Categorizer for Sentiment
Analysis, since there isn't a proper functionality to do sentiment analysis
in OpenNLP, This would be great if we can add this feature on OpenNLP
project, and also I would like to suggest that we should able to detect the
target object of the opinions from this feature as well.

WDYT ??

Thanks,
Madhawa

Madhawa

On Tue, Mar 29, 2016 at 2:11 AM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Dear Anthony,
>
> Great! These both sound like fantastic proposals and I’m happy
> to be a mentor. Madhawa, would you like to join in on these
> efforts?
>
> Cheers,
> Chris
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
> -----Original Message-----
> From: Anthony Beylerian <an...@hotmail.com>
> Date: Monday, March 28, 2016 at 11:48 AM
> To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>,
> "mondher.bouazizi@gmail.com" <mo...@gmail.com>
> Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
> <ch...@jpl.nasa.gov>
> Subject: RE: GSOC2016 Sentiment Analysis
>
> >Dear Chris,
> >
> >Thank you for starting the discussion.
> >We are glad there is an interest in a sentiment analysis component.
> >
> >My colleague Mondher posted the two JIRA issues related to Sentiment
> >Analysis [1][2] as references for our proposals [3][4] for GSoC.
> >In fact, we have been researching this topic at our university.
> >We are hoping to participate this year and work on integrating both a
> >sentiment classifier and a quantifier for the library.
> >
> >It would be nice to also have an interface with Tika, maybe we can
> >collaborate ?
> >We are also looking for mentors, in case someone is willing to support
> >our proposals.
> >
> >Best,
> >
> >Anthony
> >
> >[1] https://issues.apache.org/jira/browse/OPENNLP-842
> >[2] https://issues.apache.org/jira/browse/OPENNLP-840
> >[3]
> >
> https://docs.google.com/document/d/1nVnwpmGaOnwHERXr55IClE4V87jUX2sva-mkgW
> >nR8n0/edit?usp=sharing
> >[4]
> >
> https://docs.google.com/document/d/1x02II9W3rirtuSbx_sY8kOQZSgOp0SIKeIWTCX
> >EOJvo/edit?usp=sharing
> >
> >> From: chris.a.mattmann@jpl.nasa.gov
> >> To: nishant.k02@gmail.com
> >> CC: dev@opennlp.apache.org; madhawa30@gmail.com; hmanjuna@usc.edu;
> >>kamalaku@usc.edu
> >> Subject: Re: GSOC2016 Sentiment Analysis
> >> Date: Sun, 27 Mar 2016 19:34:24 +0000
> >>
> >> No problem - I just wanted to encourage discussion thank you for
> >> your prompt and courteous replies.
> >>
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Chris Mattmann, Ph.D.
> >> Chief Architect
> >> Instrument Software and Science Data Systems Section (398)
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 168-519, Mailstop: 168-527
> >> Email: chris.a.mattmann@nasa.gov
> >> WWW: http://sunset.usc.edu/~mattmann/
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Director, Information Retrieval and Data Science Group (IRDS)
> >> Adjunct Associate Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> WWW: http://irds.usc.edu/
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >
>
>

Re: GSOC2016 Sentiment Analysis

Posted by Madhawa Kasun Gunasekara <ma...@gmail.com>.
Hi Chris / Antony

yes I would like to work on this, This proposal address most of the things
in Sentiment analysis,
AFAIK most of the people use OpenNLP Document Categorizer for Sentiment
Analysis, since there isn't a proper functionality to do sentiment analysis
in OpenNLP, This would be great if we can add this feature on OpenNLP
project, and also I would like to suggest that we should able to detect the
target object of the opinions from this feature as well.

WDYT ??

Thanks,
Madhawa

Madhawa

On Tue, Mar 29, 2016 at 2:11 AM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Dear Anthony,
>
> Great! These both sound like fantastic proposals and I’m happy
> to be a mentor. Madhawa, would you like to join in on these
> efforts?
>
> Cheers,
> Chris
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
> -----Original Message-----
> From: Anthony Beylerian <an...@hotmail.com>
> Date: Monday, March 28, 2016 at 11:48 AM
> To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>,
> "mondher.bouazizi@gmail.com" <mo...@gmail.com>
> Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
> <ch...@jpl.nasa.gov>
> Subject: RE: GSOC2016 Sentiment Analysis
>
> >Dear Chris,
> >
> >Thank you for starting the discussion.
> >We are glad there is an interest in a sentiment analysis component.
> >
> >My colleague Mondher posted the two JIRA issues related to Sentiment
> >Analysis [1][2] as references for our proposals [3][4] for GSoC.
> >In fact, we have been researching this topic at our university.
> >We are hoping to participate this year and work on integrating both a
> >sentiment classifier and a quantifier for the library.
> >
> >It would be nice to also have an interface with Tika, maybe we can
> >collaborate ?
> >We are also looking for mentors, in case someone is willing to support
> >our proposals.
> >
> >Best,
> >
> >Anthony
> >
> >[1] https://issues.apache.org/jira/browse/OPENNLP-842
> >[2] https://issues.apache.org/jira/browse/OPENNLP-840
> >[3]
> >
> https://docs.google.com/document/d/1nVnwpmGaOnwHERXr55IClE4V87jUX2sva-mkgW
> >nR8n0/edit?usp=sharing
> >[4]
> >
> https://docs.google.com/document/d/1x02II9W3rirtuSbx_sY8kOQZSgOp0SIKeIWTCX
> >EOJvo/edit?usp=sharing
> >
> >> From: chris.a.mattmann@jpl.nasa.gov
> >> To: nishant.k02@gmail.com
> >> CC: dev@opennlp.apache.org; madhawa30@gmail.com; hmanjuna@usc.edu;
> >>kamalaku@usc.edu
> >> Subject: Re: GSOC2016 Sentiment Analysis
> >> Date: Sun, 27 Mar 2016 19:34:24 +0000
> >>
> >> No problem - I just wanted to encourage discussion thank you for
> >> your prompt and courteous replies.
> >>
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Chris Mattmann, Ph.D.
> >> Chief Architect
> >> Instrument Software and Science Data Systems Section (398)
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 168-519, Mailstop: 168-527
> >> Email: chris.a.mattmann@nasa.gov
> >> WWW: http://sunset.usc.edu/~mattmann/
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Director, Information Retrieval and Data Science Group (IRDS)
> >> Adjunct Associate Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> WWW: http://irds.usc.edu/
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >
>
>

Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Dear Anthony,

Great! These both sound like fantastic proposals and I’m happy
to be a mentor. Madhawa, would you like to join in on these
efforts?

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Anthony Beylerian <an...@hotmail.com>
Date: Monday, March 28, 2016 at 11:48 AM
To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>,
"mondher.bouazizi@gmail.com" <mo...@gmail.com>
Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
<ch...@jpl.nasa.gov>
Subject: RE: GSOC2016 Sentiment Analysis

>Dear Chris,
>
>Thank you for starting the discussion.
>We are glad there is an interest in a sentiment analysis component.
>
>My colleague Mondher posted the two JIRA issues related to Sentiment
>Analysis [1][2] as references for our proposals [3][4] for GSoC.
>In fact, we have been researching this topic at our university.
>We are hoping to participate this year and work on integrating both a
>sentiment classifier and a quantifier for the library.
>
>It would be nice to also have an interface with Tika, maybe we can
>collaborate ?
>We are also looking for mentors, in case someone is willing to support
>our proposals.
>
>Best,
>
>Anthony
>
>[1] https://issues.apache.org/jira/browse/OPENNLP-842
>[2] https://issues.apache.org/jira/browse/OPENNLP-840
>[3] 
>https://docs.google.com/document/d/1nVnwpmGaOnwHERXr55IClE4V87jUX2sva-mkgW
>nR8n0/edit?usp=sharing
>[4] 
>https://docs.google.com/document/d/1x02II9W3rirtuSbx_sY8kOQZSgOp0SIKeIWTCX
>EOJvo/edit?usp=sharing
>
>> From: chris.a.mattmann@jpl.nasa.gov
>> To: nishant.k02@gmail.com
>> CC: dev@opennlp.apache.org; madhawa30@gmail.com; hmanjuna@usc.edu;
>>kamalaku@usc.edu
>> Subject: Re: GSOC2016 Sentiment Analysis
>> Date: Sun, 27 Mar 2016 19:34:24 +0000
>> 
>> No problem - I just wanted to encourage discussion thank you for
>> your prompt and courteous replies.
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattmann@nasa.gov
>> WWW: http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Director, Information Retrieval and Data Science Group (IRDS)
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> WWW: http://irds.usc.edu/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>


Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Dear Anthony,

Great! These both sound like fantastic proposals and I’m happy
to be a mentor. Madhawa, would you like to join in on these
efforts?

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Anthony Beylerian <an...@hotmail.com>
Date: Monday, March 28, 2016 at 11:48 AM
To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>,
"mondher.bouazizi@gmail.com" <mo...@gmail.com>
Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, jpluser
<ch...@jpl.nasa.gov>
Subject: RE: GSOC2016 Sentiment Analysis

>Dear Chris,
>
>Thank you for starting the discussion.
>We are glad there is an interest in a sentiment analysis component.
>
>My colleague Mondher posted the two JIRA issues related to Sentiment
>Analysis [1][2] as references for our proposals [3][4] for GSoC.
>In fact, we have been researching this topic at our university.
>We are hoping to participate this year and work on integrating both a
>sentiment classifier and a quantifier for the library.
>
>It would be nice to also have an interface with Tika, maybe we can
>collaborate ?
>We are also looking for mentors, in case someone is willing to support
>our proposals.
>
>Best,
>
>Anthony
>
>[1] https://issues.apache.org/jira/browse/OPENNLP-842
>[2] https://issues.apache.org/jira/browse/OPENNLP-840
>[3] 
>https://docs.google.com/document/d/1nVnwpmGaOnwHERXr55IClE4V87jUX2sva-mkgW
>nR8n0/edit?usp=sharing
>[4] 
>https://docs.google.com/document/d/1x02II9W3rirtuSbx_sY8kOQZSgOp0SIKeIWTCX
>EOJvo/edit?usp=sharing
>
>> From: chris.a.mattmann@jpl.nasa.gov
>> To: nishant.k02@gmail.com
>> CC: dev@opennlp.apache.org; madhawa30@gmail.com; hmanjuna@usc.edu;
>>kamalaku@usc.edu
>> Subject: Re: GSOC2016 Sentiment Analysis
>> Date: Sun, 27 Mar 2016 19:34:24 +0000
>> 
>> No problem - I just wanted to encourage discussion thank you for
>> your prompt and courteous replies.
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattmann@nasa.gov
>> WWW: http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Director, Information Retrieval and Data Science Group (IRDS)
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> WWW: http://irds.usc.edu/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>


RE: GSOC2016 Sentiment Analysis

Posted by Anthony Beylerian <an...@hotmail.com>.
Dear Chris,

Thank you for starting the discussion.
We are glad there is an interest in a sentiment analysis component.

My colleague Mondher posted the two JIRA issues related to Sentiment Analysis [1][2] as references for our proposals [3][4] for GSoC.
In fact, we have been researching this topic at our university.
We are hoping to participate this year and work on integrating both a sentiment classifier and a quantifier for the library.

It would be nice to also have an interface with Tika, maybe we can collaborate ?
We are also looking for mentors, in case someone is willing to support our proposals.

Best,

Anthony

[1] https://issues.apache.org/jira/browse/OPENNLP-842[2] https://issues.apache.org/jira/browse/OPENNLP-840
[3] https://docs.google.com/document/d/1nVnwpmGaOnwHERXr55IClE4V87jUX2sva-mkgWnR8n0/edit?usp=sharing
[4] https://docs.google.com/document/d/1x02II9W3rirtuSbx_sY8kOQZSgOp0SIKeIWTCXEOJvo/edit?usp=sharing
> From: chris.a.mattmann@jpl.nasa.gov
> To: nishant.k02@gmail.com
> CC: dev@opennlp.apache.org; madhawa30@gmail.com; hmanjuna@usc.edu; kamalaku@usc.edu
> Subject: Re: GSOC2016 Sentiment Analysis
> Date: Sun, 27 Mar 2016 19:34:24 +0000
> 
> No problem - I just wanted to encourage discussion thank you for
> your prompt and courteous replies.
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
 		 	   		  

Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
No problem - I just wanted to encourage discussion thank you for
your prompt and courteous replies.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Nishant Kelkar <ni...@gmail.com>
Date: Sunday, March 27, 2016 at 12:19 PM
To: jpluser <ch...@jpl.nasa.gov>
Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Madhawa Kasun
Gunasekara <ma...@gmail.com>, Harshavardhan Manjunatha
<hm...@usc.edu>, "kamalaku@usc.edu" <ka...@usc.edu>
Subject: Re: GSOC2016 Sentiment Analysis

>Sorry about that, yes, I did mean Chris.
>
>
>Best Regards,
>Nishant
>
>
>On Sun, Mar 27, 2016 at 12:18 PM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov> wrote:
>
>I think you mean “Chris”, not “Matt”, right?
>
>Additionally, discussion, even related to GSoC should happen
>on the dev list. As a dev list, OpenNLP traffic is fairly
>non-existent. So, having some discussion about Google Summer
>of Code and a project that will use OpenNLP in that effort
>is a good thing and discussion should happen on the dev list for
>the community. In this case, the conversation involves several
>communities (Tika, OpenNLP, etc.)
>
>Chris
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Chris Mattmann, Ph.D.
>Chief Architect
>Instrument Software and Science Data Systems Section (398)
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>Office: 168-519, Mailstop: 168-527
>Email: chris.a.mattmann@nasa.gov
>WWW:  
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Director, Information Retrieval and Data Science Group (IRDS)
>Adjunct Associate Professor, Computer Science Department
>University of Southern California, Los Angeles, CA 90089 USA
>WWW: http://irds.usc.edu/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>-----Original Message-----
>From: Nishant Kelkar <ni...@gmail.com>
>Date: Sunday, March 27, 2016 at 12:07 PM
>To: jpluser <ch...@jpl.nasa.gov>
>Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Madhawa Kasun
>Gunasekara <ma...@gmail.com>, Harshavardhan Manjunatha
><hm...@usc.edu>, "kamalaku@usc.edu" <ka...@usc.edu>
>Subject: Re: GSOC2016 Sentiment Analysis
>
>>Matt,
>>
>>
>>What I thought was that the dev. mailing list (per my limited knowledge)
>>was for initiation of code-related tickets, bug-filing, feature
>>discussions, etc.
>>
>>
>>From the discussion in the thread above, it seems like you've reached a
>>consensus on what project you'd like to work on, what team members will
>>be participating, etc. All I meant was to then take the discussion into a
>>more closed group (e.g. you and the
>> participants of this GSoC project). If you come across an OpenNLP-wide
>>code-related issue, then I'd love to see such an email in my email.
>>However, questions about logging in, for example, can happen in a more
>>private group of people (or maybe on a user@ mailing
>> list level), not on a dev@ level.
>>
>>
>>But maybe I am wrong, sorry if that is the case.
>>
>>
>>Best Regards,
>>Nishant
>>
>>
>>On Sun, Mar 27, 2016 at 11:58 AM, Mattmann, Chris A (3980)
>><ch...@jpl.nasa.gov> wrote:
>>
>>Nishant,
>>
>>I’m not sure what you are talking about, at all. It’s part of the
>>engagement process in GSoC to *engage the community*. At Apache
>>this is done on list.
>>
>>I’ve been on this list for months and there is about 0..traffic.
>>Which is not good. Traffic, like this, *is good*. It shows there
>>is a healthy community that actually discusses things.
>>
>>Madhawa, you don’t need to take this conversation off list, and
>>precisely the opposite. The conversation must be kept on list.
>>
>>Chris
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>Chris Mattmann, Ph.D.
>>Chief Architect
>>Instrument Software and Science Data Systems Section (398)
>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>Office: 168-519, Mailstop: 168-527
>>Email: chris.a.mattmann@nasa.gov
>>WWW:
>
>
>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>Director, Information Retrieval and Data Science Group (IRDS)
>>Adjunct Associate Professor, Computer Science Department
>>University of Southern California, Los Angeles, CA 90089 USA
>>WWW: http://irds.usc.edu/
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>
>>
>>-----Original Message-----
>>From: Nishant Kelkar <ni...@gmail.com>
>>Date: Sunday, March 27, 2016 at 11:44 AM
>>To: <de...@opennlp.apache.org>
>>Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, Harshavardhan
>>Manjunatha <hm...@usc.edu>, Information and Data Science Group USC
>>List
>><ir...@mymaillists.usc.edu>, "kamalaku@usc.edu" <ka...@usc.edu>,
>>"dev@tika.apache.org" <de...@tika.apache.org>
>>Subject: Re: GSOC2016 Sentiment Analysis
>>
>>>Hi Madhawa,
>>>Could you take this discussion off the dev openNLP list for other
>>>problems concerning logging in, participation, etc. now that you have a
>>>positive response? In my humble opinion, that would prevent others not
>>>involved in your discussion from getting email about the topic.
>>>
>>>Good luck!
>>>
>>>Best Regards,
>>>Nishant
>>>
>>>
>>>On Sun, Mar 27, 2016 at 6:37 AM, Mattmann, Chris A (3980)
>>><ch...@jpl.nasa.gov> wrote:
>>>
>>>Thanks please can you create a username with no spaces?
>>>
>>>Sent from my iPhone
>>>
>>>On Mar 27, 2016, at 2:20 AM, Madhawa Kasun Gunasekara
>>><ma...@gmail.com>> wrote:
>>>
>>>Hi Chris,
>>>
>>>Thanks for the reply, I tried to logging to [1], but I couldn't able to
>>>login into that my username is "Madhawa Gunasekara"
>>>[1]
>>https://wiki.apache.org/tika/GSoC2016
>><https://wiki.apache.org/tika/GSoC2016>
>>>
>>>I have created a jira issue on
>>>https://issues.apache.org/jira/browse/TIKA-1911
>>>
>>>Thanks,
>>>Madhawa
>>>
>>>Madhawa
>>>
>>>On Sat, Mar 26, 2016 at 3:21 AM, Mattmann, Chris A (3980)
>>><ch...@jpl.nasa.gov>>
>>>wrote:
>>>Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
>>>is data related in there that can be used for sentiment analysis :)
>>>It can be adapted and is being used for that.
>>>
>>>Anyways, yes looking forward to the task. Please send in your proposal
>>>Madhawa.
>>>
>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>Chris Mattmann, Ph.D.
>>>Chief Architect
>>>Instrument Software and Science Data Systems Section (398)
>>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>Office: 168-519, Mailstop: 168-527
>>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>>>WWW:
>
>
>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>Director, Information Retrieval and Data Science Group (IRDS)
>>>Adjunct Associate Professor, Computer Science Department
>>>University of Southern California, Los Angeles, CA 90089 USA
>>>WWW: http://irds.usc.edu/
>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
>>>
>>>
>>>
>>>
>>>-----Original Message-----
>>>From: Harshavardhan Manjunatha
>>><hm...@usc.edu>>
>>>Date: Friday, March 25, 2016 at 2:45 PM
>>>To: jpluser
>>><ch...@jpl.nasa.gov>>
>>>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>>><de...@opennlp.apache.org>>, Information and
>>>Data Science Group USC List
>>><ir...@mymaillists.usc.edu>>,
>>>"kamalaku@usc.edu<ma...@usc.edu>"
>>><ka...@usc.edu>>,
>>>"dev@tika.apache.org<ma...@tika.apache.org>"
>>><de...@tika.apache.org>>
>>>Subject: Re: GSOC2016 Sentiment Analysis
>>>
>>>>Dear Prof. Mattmann,
>>>>
>>>>
>>>>Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
>>>>Translation b/w Spanish & Englosh.
>>>>
>>>>
>>>>I dont think it can be adapted to Sentiment Analysis.
>>>>
>>>>
>>>>Developing a generic training model/corpus for Sentiment Analysis that
>>>>encapsulates social media, movie reviews, etc, etc will be a
>>>>Challenging
>>>>& Exciting Task !!
>>>>
>>>>
>>>>Regards,
>>>>Harsha
>>>>
>>>>
>>>>On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
>>>><ch...@jpl.nasa.gov>>
>>>>wrote:
>>>>
>>>>Sounds great Harsha. This is for Google Summer of Code, so
>>>>collaborating
>>>>would be great, and in this case, we would be working with Madhawa,
>>>>should
>>>>he choose to accept.
>>>>
>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>Chris Mattmann, Ph.D.
>>>>Chief Architect
>>>>Instrument Software and Science Data Systems Section (398)
>>>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>>Office: 168-519, Mailstop: 168-527
>>>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>>>>WWW:
>>>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>Director, Information Retrieval and Data Science Group (IRDS)
>>>>Adjunct Associate Professor, Computer Science Department
>>>>University of Southern California, Los Angeles, CA 90089 USA
>>>>WWW: http://irds.usc.edu/
>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>-----Original Message-----
>>>>From: Harshavardhan Manjunatha
>>>><hm...@usc.edu>>
>>>>Date: Friday, March 25, 2016 at 2:38 PM
>>>>To: jpluser
>>>><ch...@jpl.nasa.gov>>
>>>>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>>>><de...@opennlp.apache.org>>, Information
>>>>and
>>>>Data Science Group USC List
>>>><ir...@mymaillists.usc.edu>>,
>>>>"kamalaku@usc.edu<ma...@usc.edu>"
>>>><ka...@usc.edu>>,
>>>>"dev@tika.apache.org<ma...@tika.apache.org>"
>>>><de...@tika.apache.org>>
>>>>Subject: Re: GSOC2016 Sentiment Analysis
>>>>
>>>>>Dear Prof. Mattmann,
>>>>>
>>>>>
>>>>>I would love to collaborate on this & am interested in developing
>>>>>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
>>>>>
>>>>>
>>>>>I have completed an Applied NLP course @ USC.
>>>>>
>>>>>
>>>>>I have done a Literature Review of Papers & Open Source Tools on the
>>>>>same
>>>>>recently.
>>>>>
>>>>>
>>>>>Regards,
>>>>>Harsha
>>>>>
>>>>>
>>>>>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
>>>>><ch...@jpl.nasa.gov>>
>>>>>wrote:
>>>>>
>>>>>Hi Madhawa,
>>>>>
>>>>>
>>>>>
>>>>>So, how about a project that develops and contributes an Apache
>>>>>
>>>>>Tika and OpenNLP based SentimentAnalysisParser?
>>>>>
>>>>>
>>>>>
>>>>>I have some students currently doing work using the Fisher Callhome
>>>>>
>>>>>Corpus and you can build off that. I am CC’ing my USC IRDS team
>>>>>
>>>>>and my student Indhu who is working on this.
>>>>>
>>>>>
>>>>>
>>>>>Can you start working on your proposal by:
>>>>>
>>>>>
>>>>>
>>>>>1. Creating a JIRA issue here:
>>>>>
>>>>>https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_
>>>>>j
>>>>>i
>>>>>r
>>>>>a
>>>>>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=
>>>>>8
>>>>>l
>>>>>5
>>>>>6
>>>>>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPB
>>>>>K
>>>>>1
>>>>>m
>>>>>s
>>>>>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
>>>>>
>>>>> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
>>>>>
>>>>>
>>>>>
>>>>>2. Develop a proposal on the Tika wiki here:
>>>>>
>>>>>https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_ti
>>>>>k
>>>>>a
>>>>>_
>>>>>G
>>>>>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W
>>>>>6
>>>>>E
>>>>>U
>>>>>8
>>>>>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogP
>>>>>S
>>>>>o
>>>>>N
>>>>>h
>>>>>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
>>>>> (you will need permission, first
>>>>>
>>>>>sign up for your account on the wiki then tell me your username so I
>>>>>
>>>>>can add permissions for you)
>>>>>
>>>>>
>>>>>
>>>>>3. Apply through the Google Summer of Code 2016 program.
>>>>>
>>>>>
>>>>>
>>>
>>>
>>>>>4. Get in touch with me, and Indhu, and keep
>>>>>dev@tika.a.o<ma...@tika.a.o> and
>>>>>
>>>>>dev@openlp.a.o<ma...@openlp.a.o> and
>>>>>irds-L@usc.edu<ma...@usc.edu> in the loop so we can discuss
>>>>>together
>>>>>
>>>>>as a community.
>>>>>
>>>>>
>>>>>
>>>>>Cool?
>>>>>
>>>>>
>>>>>
>>>>>Cheers,
>>>>>
>>>>>Chris
>>>>>
>>>>>
>>>>>
>>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>>
>>>>>Chris Mattmann, Ph.D.
>>>>>
>>>>>Chief Architect
>>>>>
>>>>>Instrument Software and Science Data Systems Section (398)
>>>>>
>>>>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>>>
>>>>>Office: 168-519, Mailstop: 168-527
>>>>>
>>>>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>>>>>
>>>>>WWW:
>>>>
>>>>
>>>>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>>>>
>>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>>
>>>>>Director, Information Retrieval and Data Science Group (IRDS)
>>>>>
>>>>>Adjunct Associate Professor, Computer Science Department
>>>>>
>>>>>University of Southern California, Los Angeles, CA 90089 USA
>>>>>
>>>>>WWW: http://irds.usc.edu/
>>>>>
>>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>-----Original Message-----
>>>>>
>>>>>From: Madhawa Kasun Gunasekara
>>>>><ma...@gmail.com>>
>>>>>
>>>>>Reply-To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>>>>><de...@opennlp.apache.org>>
>>>>>
>>>>>Date: Wednesday, March 16, 2016 at 10:51 PM
>>>>>
>>>>>To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>>>>><de...@opennlp.apache.org>>
>>>>>
>>>>>Subject: GSOC2016 Sentiment Analysis
>>>>>
>>>>>
>>>>>
>>>>>>Hi
>>>>>
>>>>>>
>>>>>
>>>>>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis"
>>>>>>for
>>>>>
>>>>>>GSOC2016 this time. Since i have been engaging with some similar
>>>>>>projects
>>>>>
>>>>>>i
>>>>>
>>>>>>think it will be a great experience for me.
>>>>>
>>>>>>
>>>>>
>>>>>>I am a final year student in IESL College of Engineering, Sri lanka.
>>>>>>I
>>>>>
>>>>>>have
>>>>>
>>>>>>learned machine learning and natural language processing stuff when
>>>>>>I'm
>>>>>
>>>>>>doing my first degree (Computer Science) in University of Sri
>>>>>
>>>>>>Jayewardhenapura.
>>>>>
>>>>>>
>>>>>
>>>>>>In my internship period, I have actively contributed to a Twitter
>>>>>>based
>>>>>
>>>>>>NLP
>>>>>
>>>>>>project. and We have published an article on IEEE Conference,
>>>>>>"Real-time
>>>>>
>>>>>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2]
>>>>>>.
>>>>>
>>>>>>
>>>>>
>>>>>>Please let me know what you think and what you suggest.
>>>>>
>>>>>>
>>>>>
>>>>>>Please kindly give me further information on how I could proceed. I
>>>>>
>>>>>>couldn't able to find the mentioned paper "Multi-Class Sentiment
>>>>>>Analysis
>>>>>
>>>>>>in Twitter: a Pattern-Based Approach"
>>>>>
>>>>>>[1]
>>>>>https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org
>>>>>_
>>>>>j
>>>>>i
>>>>>r
>>>>>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7C
>>>>>S
>>>>>f
>>>>>n
>>>>>c
>>>>>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwp
>>>>>R
>>>>>8
>>>>>t
>>>>>0
>>>>>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
>>>>
>>>>
>>>>><https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.or
>>>>>g
>>>>>_
>>>>>j
>>>>>i
>>>>>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7
>>>>>C
>>>>>S
>>>>>f
>>>>>n
>>>>>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOw
>>>>>p
>>>>>R
>>>>>8
>>>>>t
>
>
>>>>>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
>>>>>
>>>>>>[2]
>>>>>https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.or
>>>>>g
>>>>>_
>>>>>x
>>>>>p
>>>>>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIg
>>>>>v
>>>>>i
>>>>>0
>>>>>N
>>>>>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_r
>>>>>L
>>>>>N
>>>>>i
>>>>>Y
>>>>>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
>>>>><https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.o
>>>>>r
>>>>>g
>>>>>_
>>>>>x
>>>>>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVI
>>>>>g
>>>>>v
>>>>>i
>>>>>0
>>>>>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_
>>>>>r
>>>>>L
>>>>>N
>>>>>i
>>>>>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
>>>>>
>>>>>>
>>>>>
>>>>>>Thanks
>>>>>
>>>>>>Madhawa Gunasekara
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>
>>
>>
>>
>>
>>
>>
>>
>
>
>
>
>
>


Re: GSOC2016 Sentiment Analysis

Posted by Nishant Kelkar <ni...@gmail.com>.
Sorry about that, yes, I did mean Chris.

Best Regards,
Nishant

On Sun, Mar 27, 2016 at 12:18 PM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> I think you mean “Chris”, not “Matt”, right?
>
> Additionally, discussion, even related to GSoC should happen
> on the dev list. As a dev list, OpenNLP traffic is fairly
> non-existent. So, having some discussion about Google Summer
> of Code and a project that will use OpenNLP in that effort
> is a good thing and discussion should happen on the dev list for
> the community. In this case, the conversation involves several
> communities (Tika, OpenNLP, etc.)
>
> Chris
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
> -----Original Message-----
> From: Nishant Kelkar <ni...@gmail.com>
> Date: Sunday, March 27, 2016 at 12:07 PM
> To: jpluser <ch...@jpl.nasa.gov>
> Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Madhawa Kasun
> Gunasekara <ma...@gmail.com>, Harshavardhan Manjunatha
> <hm...@usc.edu>, "kamalaku@usc.edu" <ka...@usc.edu>
> Subject: Re: GSOC2016 Sentiment Analysis
>
> >Matt,
> >
> >
> >What I thought was that the dev. mailing list (per my limited knowledge)
> >was for initiation of code-related tickets, bug-filing, feature
> >discussions, etc.
> >
> >
> >From the discussion in the thread above, it seems like you've reached a
> >consensus on what project you'd like to work on, what team members will
> >be participating, etc. All I meant was to then take the discussion into a
> >more closed group (e.g. you and the
> > participants of this GSoC project). If you come across an OpenNLP-wide
> >code-related issue, then I'd love to see such an email in my email.
> >However, questions about logging in, for example, can happen in a more
> >private group of people (or maybe on a user@ mailing
> > list level), not on a dev@ level.
> >
> >
> >But maybe I am wrong, sorry if that is the case.
> >
> >
> >Best Regards,
> >Nishant
> >
> >
> >On Sun, Mar 27, 2016 at 11:58 AM, Mattmann, Chris A (3980)
> ><ch...@jpl.nasa.gov> wrote:
> >
> >Nishant,
> >
> >I’m not sure what you are talking about, at all. It’s part of the
> >engagement process in GSoC to *engage the community*. At Apache
> >this is done on list.
> >
> >I’ve been on this list for months and there is about 0..traffic.
> >Which is not good. Traffic, like this, *is good*. It shows there
> >is a healthy community that actually discusses things.
> >
> >Madhawa, you don’t need to take this conversation off list, and
> >precisely the opposite. The conversation must be kept on list.
> >
> >Chris
> >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >Chris Mattmann, Ph.D.
> >Chief Architect
> >Instrument Software and Science Data Systems Section (398)
> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >Office: 168-519, Mailstop: 168-527
> >Email: chris.a.mattmann@nasa.gov
> >WWW:
> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >Director, Information Retrieval and Data Science Group (IRDS)
> >Adjunct Associate Professor, Computer Science Department
> >University of Southern California, Los Angeles, CA 90089 USA
> >WWW: http://irds.usc.edu/
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >
> >
> >
> >
> >-----Original Message-----
> >From: Nishant Kelkar <ni...@gmail.com>
> >Date: Sunday, March 27, 2016 at 11:44 AM
> >To: <de...@opennlp.apache.org>
> >Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, Harshavardhan
> >Manjunatha <hm...@usc.edu>, Information and Data Science Group USC
> List
> ><ir...@mymaillists.usc.edu>, "kamalaku@usc.edu" <ka...@usc.edu>,
> >"dev@tika.apache.org" <de...@tika.apache.org>
> >Subject: Re: GSOC2016 Sentiment Analysis
> >
> >>Hi Madhawa,
> >>Could you take this discussion off the dev openNLP list for other
> >>problems concerning logging in, participation, etc. now that you have a
> >>positive response? In my humble opinion, that would prevent others not
> >>involved in your discussion from getting email about the topic.
> >>
> >>Good luck!
> >>
> >>Best Regards,
> >>Nishant
> >>
> >>
> >>On Sun, Mar 27, 2016 at 6:37 AM, Mattmann, Chris A (3980)
> >><ch...@jpl.nasa.gov> wrote:
> >>
> >>Thanks please can you create a username with no spaces?
> >>
> >>Sent from my iPhone
> >>
> >>On Mar 27, 2016, at 2:20 AM, Madhawa Kasun Gunasekara
> >><ma...@gmail.com>> wrote:
> >>
> >>Hi Chris,
> >>
> >>Thanks for the reply, I tried to logging to [1], but I couldn't able to
> >>login into that my username is "Madhawa Gunasekara"
> >>[1]
> >https://wiki.apache.org/tika/GSoC2016
> ><https://wiki.apache.org/tika/GSoC2016>
> >>
> >>I have created a jira issue on
> >>https://issues.apache.org/jira/browse/TIKA-1911
> >>
> >>Thanks,
> >>Madhawa
> >>
> >>Madhawa
> >>
> >>On Sat, Mar 26, 2016 at 3:21 AM, Mattmann, Chris A (3980)
> >><ch...@jpl.nasa.gov>>
> >>wrote:
> >>Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
> >>is data related in there that can be used for sentiment analysis :)
> >>It can be adapted and is being used for that.
> >>
> >>Anyways, yes looking forward to the task. Please send in your proposal
> >>Madhawa.
> >>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>Chris Mattmann, Ph.D.
> >>Chief Architect
> >>Instrument Software and Science Data Systems Section (398)
> >>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>Office: 168-519, Mailstop: 168-527
> >>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
> >>WWW:
> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>Director, Information Retrieval and Data Science Group (IRDS)
> >>Adjunct Associate Professor, Computer Science Department
> >>University of Southern California, Los Angeles, CA 90089 USA
> >>WWW: http://irds.usc.edu/
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>
> >>
> >>
> >>
> >>-----Original Message-----
> >>From: Harshavardhan Manjunatha
> >><hm...@usc.edu>>
> >>Date: Friday, March 25, 2016 at 2:45 PM
> >>To: jpluser
> >><ch...@jpl.nasa.gov>>
> >>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
> >><de...@opennlp.apache.org>>, Information and
> >>Data Science Group USC List
> >><ir...@mymaillists.usc.edu>>,
> >>"kamalaku@usc.edu<ma...@usc.edu>"
> >><ka...@usc.edu>>,
> >>"dev@tika.apache.org<ma...@tika.apache.org>"
> >><de...@tika.apache.org>>
> >>Subject: Re: GSOC2016 Sentiment Analysis
> >>
> >>>Dear Prof. Mattmann,
> >>>
> >>>
> >>>Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
> >>>Translation b/w Spanish & Englosh.
> >>>
> >>>
> >>>I dont think it can be adapted to Sentiment Analysis.
> >>>
> >>>
> >>>Developing a generic training model/corpus for Sentiment Analysis that
> >>>encapsulates social media, movie reviews, etc, etc will be a Challenging
> >>>& Exciting Task !!
> >>>
> >>>
> >>>Regards,
> >>>Harsha
> >>>
> >>>
> >>>On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
> >>><ch...@jpl.nasa.gov>>
> >>>wrote:
> >>>
> >>>Sounds great Harsha. This is for Google Summer of Code, so collaborating
> >>>would be great, and in this case, we would be working with Madhawa,
> >>>should
> >>>he choose to accept.
> >>>
> >>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>>Chris Mattmann, Ph.D.
> >>>Chief Architect
> >>>Instrument Software and Science Data Systems Section (398)
> >>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>>Office: 168-519, Mailstop: 168-527
> >>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
> >>>WWW:
> >>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>>Director, Information Retrieval and Data Science Group (IRDS)
> >>>Adjunct Associate Professor, Computer Science Department
> >>>University of Southern California, Los Angeles, CA 90089 USA
> >>>WWW: http://irds.usc.edu/
> >>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>-----Original Message-----
> >>>From: Harshavardhan Manjunatha
> >>><hm...@usc.edu>>
> >>>Date: Friday, March 25, 2016 at 2:38 PM
> >>>To: jpluser
> >>><ch...@jpl.nasa.gov>>
> >>>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
> >>><de...@opennlp.apache.org>>, Information
> and
> >>>Data Science Group USC List
> >>><ir...@mymaillists.usc.edu>>,
> >>>"kamalaku@usc.edu<ma...@usc.edu>"
> >>><ka...@usc.edu>>,
> >>>"dev@tika.apache.org<ma...@tika.apache.org>"
> >>><de...@tika.apache.org>>
> >>>Subject: Re: GSOC2016 Sentiment Analysis
> >>>
> >>>>Dear Prof. Mattmann,
> >>>>
> >>>>
> >>>>I would love to collaborate on this & am interested in developing
> >>>>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
> >>>>
> >>>>
> >>>>I have completed an Applied NLP course @ USC.
> >>>>
> >>>>
> >>>>I have done a Literature Review of Papers & Open Source Tools on the
> >>>>same
> >>>>recently.
> >>>>
> >>>>
> >>>>Regards,
> >>>>Harsha
> >>>>
> >>>>
> >>>>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
> >>>><ch...@jpl.nasa.gov>>
> >>>>wrote:
> >>>>
> >>>>Hi Madhawa,
> >>>>
> >>>>
> >>>>
> >>>>So, how about a project that develops and contributes an Apache
> >>>>
> >>>>Tika and OpenNLP based SentimentAnalysisParser?
> >>>>
> >>>>
> >>>>
> >>>>I have some students currently doing work using the Fisher Callhome
> >>>>
> >>>>Corpus and you can build off that. I am CC’ing my USC IRDS team
> >>>>
> >>>>and my student Indhu who is working on this.
> >>>>
> >>>>
> >>>>
> >>>>Can you start working on your proposal by:
> >>>>
> >>>>
> >>>>
> >>>>1. Creating a JIRA issue here:
> >>>>
> >>>>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_j
> >>>>i
> >>>>r
> >>>>a
> >>>>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8
> >>>>l
> >>>>5
> >>>>6
> >>>>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK
> >>>>1
> >>>>m
> >>>>s
> >>>>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
> >>>>
> >>>> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
> >>>>
> >>>>
> >>>>
> >>>>2. Develop a proposal on the Tika wiki here:
> >>>>
> >>>>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tik
> >>>>a
> >>>>_
> >>>>G
> >>>>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6
> >>>>E
> >>>>U
> >>>>8
> >>>>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPS
> >>>>o
> >>>>N
> >>>>h
> >>>>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
> >>>> (you will need permission, first
> >>>>
> >>>>sign up for your account on the wiki then tell me your username so I
> >>>>
> >>>>can add permissions for you)
> >>>>
> >>>>
> >>>>
> >>>>3. Apply through the Google Summer of Code 2016 program.
> >>>>
> >>>>
> >>>>
> >>
> >>
> >>>>4. Get in touch with me, and Indhu, and keep
> >>>>dev@tika.a.o<ma...@tika.a.o> and
> >>>>
> >>>>dev@openlp.a.o<ma...@openlp.a.o> and
> >>>>irds-L@usc.edu<ma...@usc.edu> in the loop so we can discuss
> >>>>together
> >>>>
> >>>>as a community.
> >>>>
> >>>>
> >>>>
> >>>>Cool?
> >>>>
> >>>>
> >>>>
> >>>>Cheers,
> >>>>
> >>>>Chris
> >>>>
> >>>>
> >>>>
> >>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>>>
> >>>>Chris Mattmann, Ph.D.
> >>>>
> >>>>Chief Architect
> >>>>
> >>>>Instrument Software and Science Data Systems Section (398)
> >>>>
> >>>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>>>
> >>>>Office: 168-519, Mailstop: 168-527
> >>>>
> >>>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
> >>>>
> >>>>WWW:
> >>>
> >>>
> >>>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >>>>
> >>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>>>
> >>>>Director, Information Retrieval and Data Science Group (IRDS)
> >>>>
> >>>>Adjunct Associate Professor, Computer Science Department
> >>>>
> >>>>University of Southern California, Los Angeles, CA 90089 USA
> >>>>
> >>>>WWW: http://irds.usc.edu/
> >>>>
> >>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>-----Original Message-----
> >>>>
> >>>>From: Madhawa Kasun Gunasekara
> >>>><ma...@gmail.com>>
> >>>>
> >>>>Reply-To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
> >>>><de...@opennlp.apache.org>>
> >>>>
> >>>>Date: Wednesday, March 16, 2016 at 10:51 PM
> >>>>
> >>>>To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
> >>>><de...@opennlp.apache.org>>
> >>>>
> >>>>Subject: GSOC2016 Sentiment Analysis
> >>>>
> >>>>
> >>>>
> >>>>>Hi
> >>>>
> >>>>>
> >>>>
> >>>>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis"
> >>>>>for
> >>>>
> >>>>>GSOC2016 this time. Since i have been engaging with some similar
> >>>>>projects
> >>>>
> >>>>>i
> >>>>
> >>>>>think it will be a great experience for me.
> >>>>
> >>>>>
> >>>>
> >>>>>I am a final year student in IESL College of Engineering, Sri lanka. I
> >>>>
> >>>>>have
> >>>>
> >>>>>learned machine learning and natural language processing stuff when
> >>>>>I'm
> >>>>
> >>>>>doing my first degree (Computer Science) in University of Sri
> >>>>
> >>>>>Jayewardhenapura.
> >>>>
> >>>>>
> >>>>
> >>>>>In my internship period, I have actively contributed to a Twitter
> >>>>>based
> >>>>
> >>>>>NLP
> >>>>
> >>>>>project. and We have published an article on IEEE Conference,
> >>>>>"Real-time
> >>>>
> >>>>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2]
> >>>>>.
> >>>>
> >>>>>
> >>>>
> >>>>>Please let me know what you think and what you suggest.
> >>>>
> >>>>>
> >>>>
> >>>>>Please kindly give me further information on how I could proceed. I
> >>>>
> >>>>>couldn't able to find the mentioned paper "Multi-Class Sentiment
> >>>>>Analysis
> >>>>
> >>>>>in Twitter: a Pattern-Based Approach"
> >>>>
> >>>>>[1]
> >>>>
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_
> >>>>j
> >>>>i
> >>>>r
> >>>>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CS
> >>>>f
> >>>>n
> >>>>c
> >>>>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR
> >>>>8
> >>>>t
> >>>>0
> >>>>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
> >>>
> >>>
> >>>><
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org
> >>>>_
> >>>>j
> >>>>i
> >>>>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7C
> >>>>S
> >>>>f
> >>>>n
> >>>>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwp
> >>>>R
> >>>>8
> >>>>t
> >>>>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
> >>>>
> >>>>>[2]
> >>>>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org
> >>>>_
> >>>>x
> >>>>p
> >>>>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgv
> >>>>i
> >>>>0
> >>>>N
> >>>>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rL
> >>>>N
> >>>>i
> >>>>Y
> >>>>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
> >>>><
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.or
> >>>>g
> >>>>_
> >>>>x
> >>>>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIg
> >>>>v
> >>>>i
> >>>>0
> >>>>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_r
> >>>>L
> >>>>N
> >>>>i
> >>>>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
> >>>>
> >>>>>
> >>>>
> >>>>>Thanks
> >>>>
> >>>>>Madhawa Gunasekara
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >
> >
> >
> >
> >
> >
> >
> >
>
>

Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
I think you mean “Chris”, not “Matt”, right?

Additionally, discussion, even related to GSoC should happen
on the dev list. As a dev list, OpenNLP traffic is fairly
non-existent. So, having some discussion about Google Summer
of Code and a project that will use OpenNLP in that effort
is a good thing and discussion should happen on the dev list for
the community. In this case, the conversation involves several
communities (Tika, OpenNLP, etc.)

Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Nishant Kelkar <ni...@gmail.com>
Date: Sunday, March 27, 2016 at 12:07 PM
To: jpluser <ch...@jpl.nasa.gov>
Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Madhawa Kasun
Gunasekara <ma...@gmail.com>, Harshavardhan Manjunatha
<hm...@usc.edu>, "kamalaku@usc.edu" <ka...@usc.edu>
Subject: Re: GSOC2016 Sentiment Analysis

>Matt,
>
>
>What I thought was that the dev. mailing list (per my limited knowledge)
>was for initiation of code-related tickets, bug-filing, feature
>discussions, etc.
>
>
>From the discussion in the thread above, it seems like you've reached a
>consensus on what project you'd like to work on, what team members will
>be participating, etc. All I meant was to then take the discussion into a
>more closed group (e.g. you and the
> participants of this GSoC project). If you come across an OpenNLP-wide
>code-related issue, then I'd love to see such an email in my email.
>However, questions about logging in, for example, can happen in a more
>private group of people (or maybe on a user@ mailing
> list level), not on a dev@ level.
>
>
>But maybe I am wrong, sorry if that is the case.
>
>
>Best Regards,
>Nishant
>
>
>On Sun, Mar 27, 2016 at 11:58 AM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov> wrote:
>
>Nishant,
>
>I’m not sure what you are talking about, at all. It’s part of the
>engagement process in GSoC to *engage the community*. At Apache
>this is done on list.
>
>I’ve been on this list for months and there is about 0..traffic.
>Which is not good. Traffic, like this, *is good*. It shows there
>is a healthy community that actually discusses things.
>
>Madhawa, you don’t need to take this conversation off list, and
>precisely the opposite. The conversation must be kept on list.
>
>Chris
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Chris Mattmann, Ph.D.
>Chief Architect
>Instrument Software and Science Data Systems Section (398)
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>Office: 168-519, Mailstop: 168-527
>Email: chris.a.mattmann@nasa.gov
>WWW:  
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Director, Information Retrieval and Data Science Group (IRDS)
>Adjunct Associate Professor, Computer Science Department
>University of Southern California, Los Angeles, CA 90089 USA
>WWW: http://irds.usc.edu/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>-----Original Message-----
>From: Nishant Kelkar <ni...@gmail.com>
>Date: Sunday, March 27, 2016 at 11:44 AM
>To: <de...@opennlp.apache.org>
>Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, Harshavardhan
>Manjunatha <hm...@usc.edu>, Information and Data Science Group USC List
><ir...@mymaillists.usc.edu>, "kamalaku@usc.edu" <ka...@usc.edu>,
>"dev@tika.apache.org" <de...@tika.apache.org>
>Subject: Re: GSOC2016 Sentiment Analysis
>
>>Hi Madhawa,
>>Could you take this discussion off the dev openNLP list for other
>>problems concerning logging in, participation, etc. now that you have a
>>positive response? In my humble opinion, that would prevent others not
>>involved in your discussion from getting email about the topic.
>>
>>Good luck!
>>
>>Best Regards,
>>Nishant
>>
>>
>>On Sun, Mar 27, 2016 at 6:37 AM, Mattmann, Chris A (3980)
>><ch...@jpl.nasa.gov> wrote:
>>
>>Thanks please can you create a username with no spaces?
>>
>>Sent from my iPhone
>>
>>On Mar 27, 2016, at 2:20 AM, Madhawa Kasun Gunasekara
>><ma...@gmail.com>> wrote:
>>
>>Hi Chris,
>>
>>Thanks for the reply, I tried to logging to [1], but I couldn't able to
>>login into that my username is "Madhawa Gunasekara"
>>[1] 
>https://wiki.apache.org/tika/GSoC2016
><https://wiki.apache.org/tika/GSoC2016>
>>
>>I have created a jira issue on
>>https://issues.apache.org/jira/browse/TIKA-1911
>>
>>Thanks,
>>Madhawa
>>
>>Madhawa
>>
>>On Sat, Mar 26, 2016 at 3:21 AM, Mattmann, Chris A (3980)
>><ch...@jpl.nasa.gov>>
>>wrote:
>>Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
>>is data related in there that can be used for sentiment analysis :)
>>It can be adapted and is being used for that.
>>
>>Anyways, yes looking forward to the task. Please send in your proposal
>>Madhawa.
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>Chris Mattmann, Ph.D.
>>Chief Architect
>>Instrument Software and Science Data Systems Section (398)
>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>Office: 168-519, Mailstop: 168-527
>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>>WWW:  
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>Director, Information Retrieval and Data Science Group (IRDS)
>>Adjunct Associate Professor, Computer Science Department
>>University of Southern California, Los Angeles, CA 90089 USA
>>WWW: http://irds.usc.edu/
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>
>>
>>-----Original Message-----
>>From: Harshavardhan Manjunatha
>><hm...@usc.edu>>
>>Date: Friday, March 25, 2016 at 2:45 PM
>>To: jpluser
>><ch...@jpl.nasa.gov>>
>>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>><de...@opennlp.apache.org>>, Information and
>>Data Science Group USC List
>><ir...@mymaillists.usc.edu>>,
>>"kamalaku@usc.edu<ma...@usc.edu>"
>><ka...@usc.edu>>,
>>"dev@tika.apache.org<ma...@tika.apache.org>"
>><de...@tika.apache.org>>
>>Subject: Re: GSOC2016 Sentiment Analysis
>>
>>>Dear Prof. Mattmann,
>>>
>>>
>>>Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
>>>Translation b/w Spanish & Englosh.
>>>
>>>
>>>I dont think it can be adapted to Sentiment Analysis.
>>>
>>>
>>>Developing a generic training model/corpus for Sentiment Analysis that
>>>encapsulates social media, movie reviews, etc, etc will be a Challenging
>>>& Exciting Task !!
>>>
>>>
>>>Regards,
>>>Harsha
>>>
>>>
>>>On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
>>><ch...@jpl.nasa.gov>>
>>>wrote:
>>>
>>>Sounds great Harsha. This is for Google Summer of Code, so collaborating
>>>would be great, and in this case, we would be working with Madhawa,
>>>should
>>>he choose to accept.
>>>
>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>Chris Mattmann, Ph.D.
>>>Chief Architect
>>>Instrument Software and Science Data Systems Section (398)
>>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>Office: 168-519, Mailstop: 168-527
>>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>>>WWW:
>>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>Director, Information Retrieval and Data Science Group (IRDS)
>>>Adjunct Associate Professor, Computer Science Department
>>>University of Southern California, Los Angeles, CA 90089 USA
>>>WWW: http://irds.usc.edu/
>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
>>>
>>>
>>>
>>>
>>>-----Original Message-----
>>>From: Harshavardhan Manjunatha
>>><hm...@usc.edu>>
>>>Date: Friday, March 25, 2016 at 2:38 PM
>>>To: jpluser
>>><ch...@jpl.nasa.gov>>
>>>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>>><de...@opennlp.apache.org>>, Information and
>>>Data Science Group USC List
>>><ir...@mymaillists.usc.edu>>,
>>>"kamalaku@usc.edu<ma...@usc.edu>"
>>><ka...@usc.edu>>,
>>>"dev@tika.apache.org<ma...@tika.apache.org>"
>>><de...@tika.apache.org>>
>>>Subject: Re: GSOC2016 Sentiment Analysis
>>>
>>>>Dear Prof. Mattmann,
>>>>
>>>>
>>>>I would love to collaborate on this & am interested in developing
>>>>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
>>>>
>>>>
>>>>I have completed an Applied NLP course @ USC.
>>>>
>>>>
>>>>I have done a Literature Review of Papers & Open Source Tools on the
>>>>same
>>>>recently.
>>>>
>>>>
>>>>Regards,
>>>>Harsha
>>>>
>>>>
>>>>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
>>>><ch...@jpl.nasa.gov>>
>>>>wrote:
>>>>
>>>>Hi Madhawa,
>>>>
>>>>
>>>>
>>>>So, how about a project that develops and contributes an Apache
>>>>
>>>>Tika and OpenNLP based SentimentAnalysisParser?
>>>>
>>>>
>>>>
>>>>I have some students currently doing work using the Fisher Callhome
>>>>
>>>>Corpus and you can build off that. I am CC’ing my USC IRDS team
>>>>
>>>>and my student Indhu who is working on this.
>>>>
>>>>
>>>>
>>>>Can you start working on your proposal by:
>>>>
>>>>
>>>>
>>>>1. Creating a JIRA issue here:
>>>>
>>>>https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_j
>>>>i
>>>>r
>>>>a
>>>>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8
>>>>l
>>>>5
>>>>6
>>>>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK
>>>>1
>>>>m
>>>>s
>>>>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
>>>>
>>>> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
>>>>
>>>>
>>>>
>>>>2. Develop a proposal on the Tika wiki here:
>>>>
>>>>https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tik
>>>>a
>>>>_
>>>>G
>>>>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6
>>>>E
>>>>U
>>>>8
>>>>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPS
>>>>o
>>>>N
>>>>h
>>>>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
>>>> (you will need permission, first
>>>>
>>>>sign up for your account on the wiki then tell me your username so I
>>>>
>>>>can add permissions for you)
>>>>
>>>>
>>>>
>>>>3. Apply through the Google Summer of Code 2016 program.
>>>>
>>>>
>>>>
>>
>>
>>>>4. Get in touch with me, and Indhu, and keep
>>>>dev@tika.a.o<ma...@tika.a.o> and
>>>>
>>>>dev@openlp.a.o<ma...@openlp.a.o> and
>>>>irds-L@usc.edu<ma...@usc.edu> in the loop so we can discuss
>>>>together
>>>>
>>>>as a community.
>>>>
>>>>
>>>>
>>>>Cool?
>>>>
>>>>
>>>>
>>>>Cheers,
>>>>
>>>>Chris
>>>>
>>>>
>>>>
>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>
>>>>Chris Mattmann, Ph.D.
>>>>
>>>>Chief Architect
>>>>
>>>>Instrument Software and Science Data Systems Section (398)
>>>>
>>>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>>
>>>>Office: 168-519, Mailstop: 168-527
>>>>
>>>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>>>>
>>>>WWW:
>>>
>>>
>>>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>>>
>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>
>>>>Director, Information Retrieval and Data Science Group (IRDS)
>>>>
>>>>Adjunct Associate Professor, Computer Science Department
>>>>
>>>>University of Southern California, Los Angeles, CA 90089 USA
>>>>
>>>>WWW: http://irds.usc.edu/
>>>>
>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>-----Original Message-----
>>>>
>>>>From: Madhawa Kasun Gunasekara
>>>><ma...@gmail.com>>
>>>>
>>>>Reply-To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>>>><de...@opennlp.apache.org>>
>>>>
>>>>Date: Wednesday, March 16, 2016 at 10:51 PM
>>>>
>>>>To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>>>><de...@opennlp.apache.org>>
>>>>
>>>>Subject: GSOC2016 Sentiment Analysis
>>>>
>>>>
>>>>
>>>>>Hi
>>>>
>>>>>
>>>>
>>>>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis"
>>>>>for
>>>>
>>>>>GSOC2016 this time. Since i have been engaging with some similar
>>>>>projects
>>>>
>>>>>i
>>>>
>>>>>think it will be a great experience for me.
>>>>
>>>>>
>>>>
>>>>>I am a final year student in IESL College of Engineering, Sri lanka. I
>>>>
>>>>>have
>>>>
>>>>>learned machine learning and natural language processing stuff when
>>>>>I'm
>>>>
>>>>>doing my first degree (Computer Science) in University of Sri
>>>>
>>>>>Jayewardhenapura.
>>>>
>>>>>
>>>>
>>>>>In my internship period, I have actively contributed to a Twitter
>>>>>based
>>>>
>>>>>NLP
>>>>
>>>>>project. and We have published an article on IEEE Conference,
>>>>>"Real-time
>>>>
>>>>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2]
>>>>>.
>>>>
>>>>>
>>>>
>>>>>Please let me know what you think and what you suggest.
>>>>
>>>>>
>>>>
>>>>>Please kindly give me further information on how I could proceed. I
>>>>
>>>>>couldn't able to find the mentioned paper "Multi-Class Sentiment
>>>>>Analysis
>>>>
>>>>>in Twitter: a Pattern-Based Approach"
>>>>
>>>>>[1]
>>>>https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_
>>>>j
>>>>i
>>>>r
>>>>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CS
>>>>f
>>>>n
>>>>c
>>>>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR
>>>>8
>>>>t
>>>>0
>>>>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
>>>
>>>
>>>><https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org
>>>>_
>>>>j
>>>>i
>>>>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7C
>>>>S
>>>>f
>>>>n
>>>>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwp
>>>>R
>>>>8
>>>>t
>>>>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
>>>>
>>>>>[2]
>>>>https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org
>>>>_
>>>>x
>>>>p
>>>>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgv
>>>>i
>>>>0
>>>>N
>>>>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rL
>>>>N
>>>>i
>>>>Y
>>>>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
>>>><https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.or
>>>>g
>>>>_
>>>>x
>>>>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIg
>>>>v
>>>>i
>>>>0
>>>>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_r
>>>>L
>>>>N
>>>>i
>>>>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
>>>>
>>>>>
>>>>
>>>>>Thanks
>>>>
>>>>>Madhawa Gunasekara
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>
>
>
>
>
>
>
>


Re: GSOC2016 Sentiment Analysis

Posted by Nishant Kelkar <ni...@gmail.com>.
Matt,

What I thought was that the dev. mailing list (per my limited knowledge)
was for initiation of code-related tickets, bug-filing, feature
discussions, etc.

>From the discussion in the thread above, it seems like you've reached a
consensus on what project you'd like to work on, what team members will be
participating, etc. All I meant was to then take the discussion into a more
closed group (e.g. you and the participants of this GSoC project). If you
come across an OpenNLP-wide code-related issue, then I'd love to see such
an email in my email. However, questions about logging in, for example, can
happen in a more private group of people (or maybe on a user@ mailing list
level), not on a dev@ level.

But maybe I am wrong, sorry if that is the case.

Best Regards,
Nishant

On Sun, Mar 27, 2016 at 11:58 AM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Nishant,
>
> I’m not sure what you are talking about, at all. It’s part of the
> engagement process in GSoC to *engage the community*. At Apache
> this is done on list.
>
> I’ve been on this list for months and there is about 0..traffic.
> Which is not good. Traffic, like this, *is good*. It shows there
> is a healthy community that actually discusses things.
>
> Madhawa, you don’t need to take this conversation off list, and
> precisely the opposite. The conversation must be kept on list.
>
> Chris
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
> -----Original Message-----
> From: Nishant Kelkar <ni...@gmail.com>
> Date: Sunday, March 27, 2016 at 11:44 AM
> To: <de...@opennlp.apache.org>
> Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, Harshavardhan
> Manjunatha <hm...@usc.edu>, Information and Data Science Group USC List
> <ir...@mymaillists.usc.edu>, "kamalaku@usc.edu" <ka...@usc.edu>,
> "dev@tika.apache.org" <de...@tika.apache.org>
> Subject: Re: GSOC2016 Sentiment Analysis
>
> >Hi Madhawa,
> >Could you take this discussion off the dev openNLP list for other
> >problems concerning logging in, participation, etc. now that you have a
> >positive response? In my humble opinion, that would prevent others not
> >involved in your discussion from getting email about the topic.
> >
> >Good luck!
> >
> >Best Regards,
> >Nishant
> >
> >
> >On Sun, Mar 27, 2016 at 6:37 AM, Mattmann, Chris A (3980)
> ><ch...@jpl.nasa.gov> wrote:
> >
> >Thanks please can you create a username with no spaces?
> >
> >Sent from my iPhone
> >
> >On Mar 27, 2016, at 2:20 AM, Madhawa Kasun Gunasekara
> ><ma...@gmail.com>> wrote:
> >
> >Hi Chris,
> >
> >Thanks for the reply, I tried to logging to [1], but I couldn't able to
> >login into that my username is "Madhawa Gunasekara"
> >[1] https://wiki.apache.org/tika/GSoC2016
> >
> >I have created a jira issue on
> >https://issues.apache.org/jira/browse/TIKA-1911
> >
> >Thanks,
> >Madhawa
> >
> >Madhawa
> >
> >On Sat, Mar 26, 2016 at 3:21 AM, Mattmann, Chris A (3980)
> ><ch...@jpl.nasa.gov>>
> >wrote:
> >Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
> >is data related in there that can be used for sentiment analysis :)
> >It can be adapted and is being used for that.
> >
> >Anyways, yes looking forward to the task. Please send in your proposal
> >Madhawa.
> >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >Chris Mattmann, Ph.D.
> >Chief Architect
> >Instrument Software and Science Data Systems Section (398)
> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >Office: 168-519, Mailstop: 168-527
> >Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
> >WWW:  http://sunset.usc.edu/~mattmann/
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >Director, Information Retrieval and Data Science Group (IRDS)
> >Adjunct Associate Professor, Computer Science Department
> >University of Southern California, Los Angeles, CA 90089 USA
> >WWW: http://irds.usc.edu/
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >
> >
> >
> >
> >-----Original Message-----
> >From: Harshavardhan Manjunatha <hmanjuna@usc.edu<mailto:hmanjuna@usc.edu
> >>
> >Date: Friday, March 25, 2016 at 2:45 PM
> >To: jpluser
> ><ch...@jpl.nasa.gov>>
> >Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
> ><de...@opennlp.apache.org>>, Information and
> >Data Science Group USC List
> ><ir...@mymaillists.usc.edu>>,
> >"kamalaku@usc.edu<ma...@usc.edu>"
> ><ka...@usc.edu>>,
> >"dev@tika.apache.org<ma...@tika.apache.org>"
> ><de...@tika.apache.org>>
> >Subject: Re: GSOC2016 Sentiment Analysis
> >
> >>Dear Prof. Mattmann,
> >>
> >>
> >>Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
> >>Translation b/w Spanish & Englosh.
> >>
> >>
> >>I dont think it can be adapted to Sentiment Analysis.
> >>
> >>
> >>Developing a generic training model/corpus for Sentiment Analysis that
> >>encapsulates social media, movie reviews, etc, etc will be a Challenging
> >>& Exciting Task !!
> >>
> >>
> >>Regards,
> >>Harsha
> >>
> >>
> >>On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
> >><ch...@jpl.nasa.gov>>
> >>wrote:
> >>
> >>Sounds great Harsha. This is for Google Summer of Code, so collaborating
> >>would be great, and in this case, we would be working with Madhawa,
> >>should
> >>he choose to accept.
> >>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>Chris Mattmann, Ph.D.
> >>Chief Architect
> >>Instrument Software and Science Data Systems Section (398)
> >>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>Office: 168-519, Mailstop: 168-527
> >>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
> >>WWW:
> >>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>Director, Information Retrieval and Data Science Group (IRDS)
> >>Adjunct Associate Professor, Computer Science Department
> >>University of Southern California, Los Angeles, CA 90089 USA
> >>WWW: http://irds.usc.edu/
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>
> >>
> >>
> >>
> >>-----Original Message-----
> >>From: Harshavardhan Manjunatha
> >><hm...@usc.edu>>
> >>Date: Friday, March 25, 2016 at 2:38 PM
> >>To: jpluser
> >><ch...@jpl.nasa.gov>>
> >>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
> >><de...@opennlp.apache.org>>, Information and
> >>Data Science Group USC List
> >><ir...@mymaillists.usc.edu>>,
> >>"kamalaku@usc.edu<ma...@usc.edu>"
> >><ka...@usc.edu>>,
> >>"dev@tika.apache.org<ma...@tika.apache.org>"
> >><de...@tika.apache.org>>
> >>Subject: Re: GSOC2016 Sentiment Analysis
> >>
> >>>Dear Prof. Mattmann,
> >>>
> >>>
> >>>I would love to collaborate on this & am interested in developing
> >>>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
> >>>
> >>>
> >>>I have completed an Applied NLP course @ USC.
> >>>
> >>>
> >>>I have done a Literature Review of Papers & Open Source Tools on the
> >>>same
> >>>recently.
> >>>
> >>>
> >>>Regards,
> >>>Harsha
> >>>
> >>>
> >>>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
> >>><ch...@jpl.nasa.gov>>
> >>>wrote:
> >>>
> >>>Hi Madhawa,
> >>>
> >>>
> >>>
> >>>So, how about a project that develops and contributes an Apache
> >>>
> >>>Tika and OpenNLP based SentimentAnalysisParser?
> >>>
> >>>
> >>>
> >>>I have some students currently doing work using the Fisher Callhome
> >>>
> >>>Corpus and you can build off that. I am CC’ing my USC IRDS team
> >>>
> >>>and my student Indhu who is working on this.
> >>>
> >>>
> >>>
> >>>Can you start working on your proposal by:
> >>>
> >>>
> >>>
> >>>1. Creating a JIRA issue here:
> >>>
> >>>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_ji
> >>>r
> >>>a
> >>>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l
> >>>5
> >>>6
> >>>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1
> >>>m
> >>>s
> >>>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
> >>>
> >>> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
> >>>
> >>>
> >>>
> >>>2. Develop a proposal on the Tika wiki here:
> >>>
> >>>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika
> >>>_
> >>>G
> >>>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6E
> >>>U
> >>>8
> >>>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSo
> >>>N
> >>>h
> >>>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
> >>> (you will need permission, first
> >>>
> >>>sign up for your account on the wiki then tell me your username so I
> >>>
> >>>can add permissions for you)
> >>>
> >>>
> >>>
> >>>3. Apply through the Google Summer of Code 2016 program.
> >>>
> >>>
> >>>
> >
> >
> >>>4. Get in touch with me, and Indhu, and keep
> >>>dev@tika.a.o<ma...@tika.a.o> and
> >>>
> >>>dev@openlp.a.o<ma...@openlp.a.o> and
> >>>irds-L@usc.edu<ma...@usc.edu> in the loop so we can discuss
> >>>together
> >>>
> >>>as a community.
> >>>
> >>>
> >>>
> >>>Cool?
> >>>
> >>>
> >>>
> >>>Cheers,
> >>>
> >>>Chris
> >>>
> >>>
> >>>
> >>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>>
> >>>Chris Mattmann, Ph.D.
> >>>
> >>>Chief Architect
> >>>
> >>>Instrument Software and Science Data Systems Section (398)
> >>>
> >>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>>
> >>>Office: 168-519, Mailstop: 168-527
> >>>
> >>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
> >>>
> >>>WWW:
> >>
> >>
> >>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >>>
> >>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>>
> >>>Director, Information Retrieval and Data Science Group (IRDS)
> >>>
> >>>Adjunct Associate Professor, Computer Science Department
> >>>
> >>>University of Southern California, Los Angeles, CA 90089 USA
> >>>
> >>>WWW: http://irds.usc.edu/
> >>>
> >>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>-----Original Message-----
> >>>
> >>>From: Madhawa Kasun Gunasekara
> >>><ma...@gmail.com>>
> >>>
> >>>Reply-To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
> >>><de...@opennlp.apache.org>>
> >>>
> >>>Date: Wednesday, March 16, 2016 at 10:51 PM
> >>>
> >>>To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
> >>><de...@opennlp.apache.org>>
> >>>
> >>>Subject: GSOC2016 Sentiment Analysis
> >>>
> >>>
> >>>
> >>>>Hi
> >>>
> >>>>
> >>>
> >>>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
> >>>
> >>>>GSOC2016 this time. Since i have been engaging with some similar
> >>>>projects
> >>>
> >>>>i
> >>>
> >>>>think it will be a great experience for me.
> >>>
> >>>>
> >>>
> >>>>I am a final year student in IESL College of Engineering, Sri lanka. I
> >>>
> >>>>have
> >>>
> >>>>learned machine learning and natural language processing stuff when I'm
> >>>
> >>>>doing my first degree (Computer Science) in University of Sri
> >>>
> >>>>Jayewardhenapura.
> >>>
> >>>>
> >>>
> >>>>In my internship period, I have actively contributed to a Twitter based
> >>>
> >>>>NLP
> >>>
> >>>>project. and We have published an article on IEEE Conference,
> >>>>"Real-time
> >>>
> >>>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
> >>>
> >>>>
> >>>
> >>>>Please let me know what you think and what you suggest.
> >>>
> >>>>
> >>>
> >>>>Please kindly give me further information on how I could proceed. I
> >>>
> >>>>couldn't able to find the mentioned paper "Multi-Class Sentiment
> >>>>Analysis
> >>>
> >>>>in Twitter: a Pattern-Based Approach"
> >>>
> >>>>[1]
> >>>
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_j
> >>>i
> >>>r
> >>>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSf
> >>>n
> >>>c
> >>>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8
> >>>t
> >>>0
> >>>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
> >>
> >>
> >>><
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_
> >>>j
> >>>i
> >>>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CS
> >>>f
> >>>n
> >>>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR
> >>>8
> >>>t
> >>>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
> >>>
> >>>>[2]
> >>>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_
> >>>x
> >>>p
> >>>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi
> >>>0
> >>>N
> >>>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLN
> >>>i
> >>>Y
> >>>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
> >>><
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org
> >>>_
> >>>x
> >>>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgv
> >>>i
> >>>0
> >>>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rL
> >>>N
> >>>i
> >>>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
> >>>
> >>>>
> >>>
> >>>>Thanks
> >>>
> >>>>Madhawa Gunasekara
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>
> >>
> >>
> >>
> >>
> >>
> >
> >
> >
> >
> >
> >
> >
> >
> >
>
>

Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Nishant,

I’m not sure what you are talking about, at all. It’s part of the
engagement process in GSoC to *engage the community*. At Apache
this is done on list.

I’ve been on this list for months and there is about 0..traffic.
Which is not good. Traffic, like this, *is good*. It shows there
is a healthy community that actually discusses things.

Madhawa, you don’t need to take this conversation off list, and
precisely the opposite. The conversation must be kept on list.

Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Nishant Kelkar <ni...@gmail.com>
Date: Sunday, March 27, 2016 at 11:44 AM
To: <de...@opennlp.apache.org>
Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, Harshavardhan
Manjunatha <hm...@usc.edu>, Information and Data Science Group USC List
<ir...@mymaillists.usc.edu>, "kamalaku@usc.edu" <ka...@usc.edu>,
"dev@tika.apache.org" <de...@tika.apache.org>
Subject: Re: GSOC2016 Sentiment Analysis

>Hi Madhawa,
>Could you take this discussion off the dev openNLP list for other
>problems concerning logging in, participation, etc. now that you have a
>positive response? In my humble opinion, that would prevent others not
>involved in your discussion from getting email about the topic.
>
>Good luck!
>
>Best Regards,
>Nishant
>
>
>On Sun, Mar 27, 2016 at 6:37 AM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov> wrote:
>
>Thanks please can you create a username with no spaces?
>
>Sent from my iPhone
>
>On Mar 27, 2016, at 2:20 AM, Madhawa Kasun Gunasekara
><ma...@gmail.com>> wrote:
>
>Hi Chris,
>
>Thanks for the reply, I tried to logging to [1], but I couldn't able to
>login into that my username is "Madhawa Gunasekara"
>[1] https://wiki.apache.org/tika/GSoC2016
>
>I have created a jira issue on
>https://issues.apache.org/jira/browse/TIKA-1911
>
>Thanks,
>Madhawa
>
>Madhawa
>
>On Sat, Mar 26, 2016 at 3:21 AM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov>>
>wrote:
>Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
>is data related in there that can be used for sentiment analysis :)
>It can be adapted and is being used for that.
>
>Anyways, yes looking forward to the task. Please send in your proposal
>Madhawa.
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Chris Mattmann, Ph.D.
>Chief Architect
>Instrument Software and Science Data Systems Section (398)
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>Office: 168-519, Mailstop: 168-527
>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>WWW:  http://sunset.usc.edu/~mattmann/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Director, Information Retrieval and Data Science Group (IRDS)
>Adjunct Associate Professor, Computer Science Department
>University of Southern California, Los Angeles, CA 90089 USA
>WWW: http://irds.usc.edu/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>-----Original Message-----
>From: Harshavardhan Manjunatha <hm...@usc.edu>>
>Date: Friday, March 25, 2016 at 2:45 PM
>To: jpluser 
><ch...@jpl.nasa.gov>>
>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
><de...@opennlp.apache.org>>, Information and
>Data Science Group USC List
><ir...@mymaillists.usc.edu>>,
>"kamalaku@usc.edu<ma...@usc.edu>"
><ka...@usc.edu>>,
>"dev@tika.apache.org<ma...@tika.apache.org>"
><de...@tika.apache.org>>
>Subject: Re: GSOC2016 Sentiment Analysis
>
>>Dear Prof. Mattmann,
>>
>>
>>Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
>>Translation b/w Spanish & Englosh.
>>
>>
>>I dont think it can be adapted to Sentiment Analysis.
>>
>>
>>Developing a generic training model/corpus for Sentiment Analysis that
>>encapsulates social media, movie reviews, etc, etc will be a Challenging
>>& Exciting Task !!
>>
>>
>>Regards,
>>Harsha
>>
>>
>>On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
>><ch...@jpl.nasa.gov>>
>>wrote:
>>
>>Sounds great Harsha. This is for Google Summer of Code, so collaborating
>>would be great, and in this case, we would be working with Madhawa,
>>should
>>he choose to accept.
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>Chris Mattmann, Ph.D.
>>Chief Architect
>>Instrument Software and Science Data Systems Section (398)
>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>Office: 168-519, Mailstop: 168-527
>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>>WWW:
>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>Director, Information Retrieval and Data Science Group (IRDS)
>>Adjunct Associate Professor, Computer Science Department
>>University of Southern California, Los Angeles, CA 90089 USA
>>WWW: http://irds.usc.edu/
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>
>>
>>-----Original Message-----
>>From: Harshavardhan Manjunatha
>><hm...@usc.edu>>
>>Date: Friday, March 25, 2016 at 2:38 PM
>>To: jpluser 
>><ch...@jpl.nasa.gov>>
>>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>><de...@opennlp.apache.org>>, Information and
>>Data Science Group USC List
>><ir...@mymaillists.usc.edu>>,
>>"kamalaku@usc.edu<ma...@usc.edu>"
>><ka...@usc.edu>>,
>>"dev@tika.apache.org<ma...@tika.apache.org>"
>><de...@tika.apache.org>>
>>Subject: Re: GSOC2016 Sentiment Analysis
>>
>>>Dear Prof. Mattmann,
>>>
>>>
>>>I would love to collaborate on this & am interested in developing
>>>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
>>>
>>>
>>>I have completed an Applied NLP course @ USC.
>>>
>>>
>>>I have done a Literature Review of Papers & Open Source Tools on the
>>>same
>>>recently.
>>>
>>>
>>>Regards,
>>>Harsha
>>>
>>>
>>>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
>>><ch...@jpl.nasa.gov>>
>>>wrote:
>>>
>>>Hi Madhawa,
>>>
>>>
>>>
>>>So, how about a project that develops and contributes an Apache
>>>
>>>Tika and OpenNLP based SentimentAnalysisParser?
>>>
>>>
>>>
>>>I have some students currently doing work using the Fisher Callhome
>>>
>>>Corpus and you can build off that. I am CC’ing my USC IRDS team
>>>
>>>and my student Indhu who is working on this.
>>>
>>>
>>>
>>>Can you start working on your proposal by:
>>>
>>>
>>>
>>>1. Creating a JIRA issue here:
>>>
>>>https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_ji
>>>r
>>>a
>>>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l
>>>5
>>>6
>>>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1
>>>m
>>>s
>>>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
>>>
>>> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
>>>
>>>
>>>
>>>2. Develop a proposal on the Tika wiki here:
>>>
>>>https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika
>>>_
>>>G
>>>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6E
>>>U
>>>8
>>>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSo
>>>N
>>>h
>>>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
>>> (you will need permission, first
>>>
>>>sign up for your account on the wiki then tell me your username so I
>>>
>>>can add permissions for you)
>>>
>>>
>>>
>>>3. Apply through the Google Summer of Code 2016 program.
>>>
>>>
>>>
>
>
>>>4. Get in touch with me, and Indhu, and keep
>>>dev@tika.a.o<ma...@tika.a.o> and
>>>
>>>dev@openlp.a.o<ma...@openlp.a.o> and
>>>irds-L@usc.edu<ma...@usc.edu> in the loop so we can discuss
>>>together
>>>
>>>as a community.
>>>
>>>
>>>
>>>Cool?
>>>
>>>
>>>
>>>Cheers,
>>>
>>>Chris
>>>
>>>
>>>
>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
>>>Chris Mattmann, Ph.D.
>>>
>>>Chief Architect
>>>
>>>Instrument Software and Science Data Systems Section (398)
>>>
>>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>
>>>Office: 168-519, Mailstop: 168-527
>>>
>>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>>>
>>>WWW:
>>
>>
>>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>>
>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
>>>Director, Information Retrieval and Data Science Group (IRDS)
>>>
>>>Adjunct Associate Professor, Computer Science Department
>>>
>>>University of Southern California, Los Angeles, CA 90089 USA
>>>
>>>WWW: http://irds.usc.edu/
>>>
>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>-----Original Message-----
>>>
>>>From: Madhawa Kasun Gunasekara
>>><ma...@gmail.com>>
>>>
>>>Reply-To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>>><de...@opennlp.apache.org>>
>>>
>>>Date: Wednesday, March 16, 2016 at 10:51 PM
>>>
>>>To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>>><de...@opennlp.apache.org>>
>>>
>>>Subject: GSOC2016 Sentiment Analysis
>>>
>>>
>>>
>>>>Hi
>>>
>>>>
>>>
>>>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>>>
>>>>GSOC2016 this time. Since i have been engaging with some similar
>>>>projects
>>>
>>>>i
>>>
>>>>think it will be a great experience for me.
>>>
>>>>
>>>
>>>>I am a final year student in IESL College of Engineering, Sri lanka. I
>>>
>>>>have
>>>
>>>>learned machine learning and natural language processing stuff when I'm
>>>
>>>>doing my first degree (Computer Science) in University of Sri
>>>
>>>>Jayewardhenapura.
>>>
>>>>
>>>
>>>>In my internship period, I have actively contributed to a Twitter based
>>>
>>>>NLP
>>>
>>>>project. and We have published an article on IEEE Conference,
>>>>"Real-time
>>>
>>>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>>>
>>>>
>>>
>>>>Please let me know what you think and what you suggest.
>>>
>>>>
>>>
>>>>Please kindly give me further information on how I could proceed. I
>>>
>>>>couldn't able to find the mentioned paper "Multi-Class Sentiment
>>>>Analysis
>>>
>>>>in Twitter: a Pattern-Based Approach"
>>>
>>>>[1]
>>>https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_j
>>>i
>>>r
>>>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSf
>>>n
>>>c
>>>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8
>>>t
>>>0
>>>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
>>
>>
>>><https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_
>>>j
>>>i
>>>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CS
>>>f
>>>n
>>>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR
>>>8
>>>t
>>>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
>>>
>>>>[2]
>>>https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_
>>>x
>>>p
>>>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi
>>>0
>>>N
>>>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLN
>>>i
>>>Y
>>>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
>>><https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org
>>>_
>>>x
>>>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgv
>>>i
>>>0
>>>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rL
>>>N
>>>i
>>>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
>>>
>>>>
>>>
>>>>Thanks
>>>
>>>>Madhawa Gunasekara
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>
>>
>>
>>
>>
>>
>
>
>
>
>
>
>
>
>


Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Nishant,

I’m not sure what you are talking about, at all. It’s part of the
engagement process in GSoC to *engage the community*. At Apache
this is done on list.

I’ve been on this list for months and there is about 0..traffic.
Which is not good. Traffic, like this, *is good*. It shows there
is a healthy community that actually discusses things.

Madhawa, you don’t need to take this conversation off list, and
precisely the opposite. The conversation must be kept on list.

Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Nishant Kelkar <ni...@gmail.com>
Date: Sunday, March 27, 2016 at 11:44 AM
To: <de...@opennlp.apache.org>
Cc: Madhawa Kasun Gunasekara <ma...@gmail.com>, Harshavardhan
Manjunatha <hm...@usc.edu>, Information and Data Science Group USC List
<ir...@mymaillists.usc.edu>, "kamalaku@usc.edu" <ka...@usc.edu>,
"dev@tika.apache.org" <de...@tika.apache.org>
Subject: Re: GSOC2016 Sentiment Analysis

>Hi Madhawa,
>Could you take this discussion off the dev openNLP list for other
>problems concerning logging in, participation, etc. now that you have a
>positive response? In my humble opinion, that would prevent others not
>involved in your discussion from getting email about the topic.
>
>Good luck!
>
>Best Regards,
>Nishant
>
>
>On Sun, Mar 27, 2016 at 6:37 AM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov> wrote:
>
>Thanks please can you create a username with no spaces?
>
>Sent from my iPhone
>
>On Mar 27, 2016, at 2:20 AM, Madhawa Kasun Gunasekara
><ma...@gmail.com>> wrote:
>
>Hi Chris,
>
>Thanks for the reply, I tried to logging to [1], but I couldn't able to
>login into that my username is "Madhawa Gunasekara"
>[1] https://wiki.apache.org/tika/GSoC2016
>
>I have created a jira issue on
>https://issues.apache.org/jira/browse/TIKA-1911
>
>Thanks,
>Madhawa
>
>Madhawa
>
>On Sat, Mar 26, 2016 at 3:21 AM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov>>
>wrote:
>Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
>is data related in there that can be used for sentiment analysis :)
>It can be adapted and is being used for that.
>
>Anyways, yes looking forward to the task. Please send in your proposal
>Madhawa.
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Chris Mattmann, Ph.D.
>Chief Architect
>Instrument Software and Science Data Systems Section (398)
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>Office: 168-519, Mailstop: 168-527
>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>WWW:  http://sunset.usc.edu/~mattmann/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Director, Information Retrieval and Data Science Group (IRDS)
>Adjunct Associate Professor, Computer Science Department
>University of Southern California, Los Angeles, CA 90089 USA
>WWW: http://irds.usc.edu/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>-----Original Message-----
>From: Harshavardhan Manjunatha <hm...@usc.edu>>
>Date: Friday, March 25, 2016 at 2:45 PM
>To: jpluser 
><ch...@jpl.nasa.gov>>
>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
><de...@opennlp.apache.org>>, Information and
>Data Science Group USC List
><ir...@mymaillists.usc.edu>>,
>"kamalaku@usc.edu<ma...@usc.edu>"
><ka...@usc.edu>>,
>"dev@tika.apache.org<ma...@tika.apache.org>"
><de...@tika.apache.org>>
>Subject: Re: GSOC2016 Sentiment Analysis
>
>>Dear Prof. Mattmann,
>>
>>
>>Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
>>Translation b/w Spanish & Englosh.
>>
>>
>>I dont think it can be adapted to Sentiment Analysis.
>>
>>
>>Developing a generic training model/corpus for Sentiment Analysis that
>>encapsulates social media, movie reviews, etc, etc will be a Challenging
>>& Exciting Task !!
>>
>>
>>Regards,
>>Harsha
>>
>>
>>On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
>><ch...@jpl.nasa.gov>>
>>wrote:
>>
>>Sounds great Harsha. This is for Google Summer of Code, so collaborating
>>would be great, and in this case, we would be working with Madhawa,
>>should
>>he choose to accept.
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>Chris Mattmann, Ph.D.
>>Chief Architect
>>Instrument Software and Science Data Systems Section (398)
>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>Office: 168-519, Mailstop: 168-527
>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>>WWW:
>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>Director, Information Retrieval and Data Science Group (IRDS)
>>Adjunct Associate Professor, Computer Science Department
>>University of Southern California, Los Angeles, CA 90089 USA
>>WWW: http://irds.usc.edu/
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>
>>
>>-----Original Message-----
>>From: Harshavardhan Manjunatha
>><hm...@usc.edu>>
>>Date: Friday, March 25, 2016 at 2:38 PM
>>To: jpluser 
>><ch...@jpl.nasa.gov>>
>>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>><de...@opennlp.apache.org>>, Information and
>>Data Science Group USC List
>><ir...@mymaillists.usc.edu>>,
>>"kamalaku@usc.edu<ma...@usc.edu>"
>><ka...@usc.edu>>,
>>"dev@tika.apache.org<ma...@tika.apache.org>"
>><de...@tika.apache.org>>
>>Subject: Re: GSOC2016 Sentiment Analysis
>>
>>>Dear Prof. Mattmann,
>>>
>>>
>>>I would love to collaborate on this & am interested in developing
>>>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
>>>
>>>
>>>I have completed an Applied NLP course @ USC.
>>>
>>>
>>>I have done a Literature Review of Papers & Open Source Tools on the
>>>same
>>>recently.
>>>
>>>
>>>Regards,
>>>Harsha
>>>
>>>
>>>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
>>><ch...@jpl.nasa.gov>>
>>>wrote:
>>>
>>>Hi Madhawa,
>>>
>>>
>>>
>>>So, how about a project that develops and contributes an Apache
>>>
>>>Tika and OpenNLP based SentimentAnalysisParser?
>>>
>>>
>>>
>>>I have some students currently doing work using the Fisher Callhome
>>>
>>>Corpus and you can build off that. I am CC’ing my USC IRDS team
>>>
>>>and my student Indhu who is working on this.
>>>
>>>
>>>
>>>Can you start working on your proposal by:
>>>
>>>
>>>
>>>1. Creating a JIRA issue here:
>>>
>>>https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_ji
>>>r
>>>a
>>>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l
>>>5
>>>6
>>>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1
>>>m
>>>s
>>>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
>>>
>>> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
>>>
>>>
>>>
>>>2. Develop a proposal on the Tika wiki here:
>>>
>>>https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika
>>>_
>>>G
>>>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6E
>>>U
>>>8
>>>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSo
>>>N
>>>h
>>>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
>>> (you will need permission, first
>>>
>>>sign up for your account on the wiki then tell me your username so I
>>>
>>>can add permissions for you)
>>>
>>>
>>>
>>>3. Apply through the Google Summer of Code 2016 program.
>>>
>>>
>>>
>
>
>>>4. Get in touch with me, and Indhu, and keep
>>>dev@tika.a.o<ma...@tika.a.o> and
>>>
>>>dev@openlp.a.o<ma...@openlp.a.o> and
>>>irds-L@usc.edu<ma...@usc.edu> in the loop so we can discuss
>>>together
>>>
>>>as a community.
>>>
>>>
>>>
>>>Cool?
>>>
>>>
>>>
>>>Cheers,
>>>
>>>Chris
>>>
>>>
>>>
>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
>>>Chris Mattmann, Ph.D.
>>>
>>>Chief Architect
>>>
>>>Instrument Software and Science Data Systems Section (398)
>>>
>>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>
>>>Office: 168-519, Mailstop: 168-527
>>>
>>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>>>
>>>WWW:
>>
>>
>>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>>
>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
>>>Director, Information Retrieval and Data Science Group (IRDS)
>>>
>>>Adjunct Associate Professor, Computer Science Department
>>>
>>>University of Southern California, Los Angeles, CA 90089 USA
>>>
>>>WWW: http://irds.usc.edu/
>>>
>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>-----Original Message-----
>>>
>>>From: Madhawa Kasun Gunasekara
>>><ma...@gmail.com>>
>>>
>>>Reply-To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>>><de...@opennlp.apache.org>>
>>>
>>>Date: Wednesday, March 16, 2016 at 10:51 PM
>>>
>>>To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>"
>>><de...@opennlp.apache.org>>
>>>
>>>Subject: GSOC2016 Sentiment Analysis
>>>
>>>
>>>
>>>>Hi
>>>
>>>>
>>>
>>>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>>>
>>>>GSOC2016 this time. Since i have been engaging with some similar
>>>>projects
>>>
>>>>i
>>>
>>>>think it will be a great experience for me.
>>>
>>>>
>>>
>>>>I am a final year student in IESL College of Engineering, Sri lanka. I
>>>
>>>>have
>>>
>>>>learned machine learning and natural language processing stuff when I'm
>>>
>>>>doing my first degree (Computer Science) in University of Sri
>>>
>>>>Jayewardhenapura.
>>>
>>>>
>>>
>>>>In my internship period, I have actively contributed to a Twitter based
>>>
>>>>NLP
>>>
>>>>project. and We have published an article on IEEE Conference,
>>>>"Real-time
>>>
>>>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>>>
>>>>
>>>
>>>>Please let me know what you think and what you suggest.
>>>
>>>>
>>>
>>>>Please kindly give me further information on how I could proceed. I
>>>
>>>>couldn't able to find the mentioned paper "Multi-Class Sentiment
>>>>Analysis
>>>
>>>>in Twitter: a Pattern-Based Approach"
>>>
>>>>[1]
>>>https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_j
>>>i
>>>r
>>>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSf
>>>n
>>>c
>>>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8
>>>t
>>>0
>>>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
>>
>>
>>><https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_
>>>j
>>>i
>>>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CS
>>>f
>>>n
>>>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR
>>>8
>>>t
>>>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
>>>
>>>>[2]
>>>https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_
>>>x
>>>p
>>>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi
>>>0
>>>N
>>>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLN
>>>i
>>>Y
>>>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
>>><https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org
>>>_
>>>x
>>>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgv
>>>i
>>>0
>>>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rL
>>>N
>>>i
>>>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
>>>
>>>>
>>>
>>>>Thanks
>>>
>>>>Madhawa Gunasekara
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>
>>
>>
>>
>>
>>
>
>
>
>
>
>
>
>
>


Re: GSOC2016 Sentiment Analysis

Posted by Nishant Kelkar <ni...@gmail.com>.
Hi Madhawa,

Could you take this discussion off the dev openNLP list for other problems
concerning logging in, participation, etc. now that you have a positive
response? In my humble opinion, that would prevent others not involved in
your discussion from getting email about the topic.

Good luck!

Best Regards,
Nishant

On Sun, Mar 27, 2016 at 6:37 AM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Thanks please can you create a username with no spaces?
>
> Sent from my iPhone
>
> On Mar 27, 2016, at 2:20 AM, Madhawa Kasun Gunasekara <madhawa30@gmail.com
> <ma...@gmail.com>> wrote:
>
> Hi Chris,
>
> Thanks for the reply, I tried to logging to [1], but I couldn't able to
> login into that my username is "Madhawa Gunasekara"
> [1] https://wiki.apache.org/tika/GSoC2016
>
> I have created a jira issue on
> https://issues.apache.org/jira/browse/TIKA-1911
>
> Thanks,
> Madhawa
>
> Madhawa
>
> On Sat, Mar 26, 2016 at 3:21 AM, Mattmann, Chris A (3980) <
> chris.a.mattmann@jpl.nasa.gov<ma...@jpl.nasa.gov>>
> wrote:
> Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
> is data related in there that can be used for sentiment analysis :)
> It can be adapted and is being used for that.
>
> Anyways, yes looking forward to the task. Please send in your proposal
> Madhawa.
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
> -----Original Message-----
> From: Harshavardhan Manjunatha <hm...@usc.edu>>
> Date: Friday, March 25, 2016 at 2:45 PM
> To: jpluser <chris.a.mattmann@jpl.nasa.gov<mailto:
> chris.a.mattmann@jpl.nasa.gov>>
> Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <
> dev@opennlp.apache.org<ma...@opennlp.apache.org>>, Information and
> Data Science Group USC List <irds-L@mymaillists.usc.edu<mailto:
> irds-L@mymaillists.usc.edu>>,
> "kamalaku@usc.edu<ma...@usc.edu>" <kamalaku@usc.edu<mailto:
> kamalaku@usc.edu>>, "dev@tika.apache.org<ma...@tika.apache.org>"
> <de...@tika.apache.org>>
> Subject: Re: GSOC2016 Sentiment Analysis
>
> >Dear Prof. Mattmann,
> >
> >
> >Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
> >Translation b/w Spanish & Englosh.
> >
> >
> >I dont think it can be adapted to Sentiment Analysis.
> >
> >
> >Developing a generic training model/corpus for Sentiment Analysis that
> >encapsulates social media, movie reviews, etc, etc will be a Challenging
> >& Exciting Task !!
> >
> >
> >Regards,
> >Harsha
> >
> >
> >On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
> ><ch...@jpl.nasa.gov>>
> wrote:
> >
> >Sounds great Harsha. This is for Google Summer of Code, so collaborating
> >would be great, and in this case, we would be working with Madhawa, should
> >he choose to accept.
> >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >Chris Mattmann, Ph.D.
> >Chief Architect
> >Instrument Software and Science Data Systems Section (398)
> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >Office: 168-519, Mailstop: 168-527
> >Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
> >WWW:
> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >Director, Information Retrieval and Data Science Group (IRDS)
> >Adjunct Associate Professor, Computer Science Department
> >University of Southern California, Los Angeles, CA 90089 USA
> >WWW: http://irds.usc.edu/
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >
> >
> >
> >
> >-----Original Message-----
> >From: Harshavardhan Manjunatha <hmanjuna@usc.edu<mailto:hmanjuna@usc.edu
> >>
> >Date: Friday, March 25, 2016 at 2:38 PM
> >To: jpluser <chris.a.mattmann@jpl.nasa.gov<mailto:
> chris.a.mattmann@jpl.nasa.gov>>
> >Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <
> dev@opennlp.apache.org<ma...@opennlp.apache.org>>, Information and
> >Data Science Group USC List <irds-L@mymaillists.usc.edu<mailto:
> irds-L@mymaillists.usc.edu>>,
> >"kamalaku@usc.edu<ma...@usc.edu>" <kamalaku@usc.edu<mailto:
> kamalaku@usc.edu>>, "dev@tika.apache.org<ma...@tika.apache.org>"
> ><de...@tika.apache.org>>
> >Subject: Re: GSOC2016 Sentiment Analysis
> >
> >>Dear Prof. Mattmann,
> >>
> >>
> >>I would love to collaborate on this & am interested in developing
> >>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
> >>
> >>
> >>I have completed an Applied NLP course @ USC.
> >>
> >>
> >>I have done a Literature Review of Papers & Open Source Tools on the same
> >>recently.
> >>
> >>
> >>Regards,
> >>Harsha
> >>
> >>
> >>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
> >><ch...@jpl.nasa.gov>>
> wrote:
> >>
> >>Hi Madhawa,
> >>
> >>
> >>
> >>So, how about a project that develops and contributes an Apache
> >>
> >>Tika and OpenNLP based SentimentAnalysisParser?
> >>
> >>
> >>
> >>I have some students currently doing work using the Fisher Callhome
> >>
> >>Corpus and you can build off that. I am CC’ing my USC IRDS team
> >>
> >>and my student Indhu who is working on this.
> >>
> >>
> >>
> >>Can you start working on your proposal by:
> >>
> >>
> >>
> >>1. Creating a JIRA issue here:
> >>
> >>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jir
> >>a
> >>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l5
> >>6
> >>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1m
> >>s
> >>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
> >>
> >> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
> >>
> >>
> >>
> >>2. Develop a proposal on the Tika wiki here:
> >>
> >>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_
> >>G
> >>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU
> >>8
> >>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoN
> >>h
> >>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
> >> (you will need permission, first
> >>
> >>sign up for your account on the wiki then tell me your username so I
> >>
> >>can add permissions for you)
> >>
> >>
> >>
> >>3. Apply through the Google Summer of Code 2016 program.
> >>
> >>
> >>
> >>4. Get in touch with me, and Indhu, and keep dev@tika.a.o<mailto:
> dev@tika.a.o> and
> >>
> >>dev@openlp.a.o<ma...@openlp.a.o> and irds-L@usc.edu<mailto:
> irds-L@usc.edu> in the loop so we can discuss together
> >>
> >>as a community.
> >>
> >>
> >>
> >>Cool?
> >>
> >>
> >>
> >>Cheers,
> >>
> >>Chris
> >>
> >>
> >>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>Chris Mattmann, Ph.D.
> >>
> >>Chief Architect
> >>
> >>Instrument Software and Science Data Systems Section (398)
> >>
> >>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>
> >>Office: 168-519, Mailstop: 168-527
> >>
> >>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
> >>
> >>WWW:
> >
> >
> >>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>Director, Information Retrieval and Data Science Group (IRDS)
> >>
> >>Adjunct Associate Professor, Computer Science Department
> >>
> >>University of Southern California, Los Angeles, CA 90089 USA
> >>
> >>WWW: http://irds.usc.edu/
> >>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>-----Original Message-----
> >>
> >>From: Madhawa Kasun Gunasekara <madhawa30@gmail.com<mailto:
> madhawa30@gmail.com>>
> >>
> >>Reply-To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <
> dev@opennlp.apache.org<ma...@opennlp.apache.org>>
> >>
> >>Date: Wednesday, March 16, 2016 at 10:51 PM
> >>
> >>To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <
> dev@opennlp.apache.org<ma...@opennlp.apache.org>>
> >>
> >>Subject: GSOC2016 Sentiment Analysis
> >>
> >>
> >>
> >>>Hi
> >>
> >>>
> >>
> >>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
> >>
> >>>GSOC2016 this time. Since i have been engaging with some similar
> >>>projects
> >>
> >>>i
> >>
> >>>think it will be a great experience for me.
> >>
> >>>
> >>
> >>>I am a final year student in IESL College of Engineering, Sri lanka. I
> >>
> >>>have
> >>
> >>>learned machine learning and natural language processing stuff when I'm
> >>
> >>>doing my first degree (Computer Science) in University of Sri
> >>
> >>>Jayewardhenapura.
> >>
> >>>
> >>
> >>>In my internship period, I have actively contributed to a Twitter based
> >>
> >>>NLP
> >>
> >>>project. and We have published an article on IEEE Conference, "Real-time
> >>
> >>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
> >>
> >>>
> >>
> >>>Please let me know what you think and what you suggest.
> >>
> >>>
> >>
> >>>Please kindly give me further information on how I could proceed. I
> >>
> >>>couldn't able to find the mentioned paper "Multi-Class Sentiment
> >>>Analysis
> >>
> >>>in Twitter: a Pattern-Based Approach"
> >>
> >>>[1]
> >>
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_ji
> >>r
> >>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfn
> >>c
> >>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t
> >>0
> >>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
> >
> >
> >><
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_j
> >>i
> >>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSf
> >>n
> >>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8
> >>t
> >>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
> >>
> >>>[2]
> >>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_x
> >>p
> >>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0
> >>N
> >>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNi
> >>Y
> >>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
> >><
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_
> >>x
> >>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi
> >>0
> >>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLN
> >>i
> >>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
> >>
> >>>
> >>
> >>>Thanks
> >>
> >>>Madhawa Gunasekara
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >
> >
> >
> >
> >
> >
>
>
>

Re: GSOC2016 Sentiment Analysis

Posted by Nishant Kelkar <ni...@gmail.com>.
Hi Madhawa,

Could you take this discussion off the dev openNLP list for other problems
concerning logging in, participation, etc. now that you have a positive
response? In my humble opinion, that would prevent others not involved in
your discussion from getting email about the topic.

Good luck!

Best Regards,
Nishant

On Sun, Mar 27, 2016 at 6:37 AM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Thanks please can you create a username with no spaces?
>
> Sent from my iPhone
>
> On Mar 27, 2016, at 2:20 AM, Madhawa Kasun Gunasekara <madhawa30@gmail.com
> <ma...@gmail.com>> wrote:
>
> Hi Chris,
>
> Thanks for the reply, I tried to logging to [1], but I couldn't able to
> login into that my username is "Madhawa Gunasekara"
> [1] https://wiki.apache.org/tika/GSoC2016
>
> I have created a jira issue on
> https://issues.apache.org/jira/browse/TIKA-1911
>
> Thanks,
> Madhawa
>
> Madhawa
>
> On Sat, Mar 26, 2016 at 3:21 AM, Mattmann, Chris A (3980) <
> chris.a.mattmann@jpl.nasa.gov<ma...@jpl.nasa.gov>>
> wrote:
> Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
> is data related in there that can be used for sentiment analysis :)
> It can be adapted and is being used for that.
>
> Anyways, yes looking forward to the task. Please send in your proposal
> Madhawa.
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
> -----Original Message-----
> From: Harshavardhan Manjunatha <hm...@usc.edu>>
> Date: Friday, March 25, 2016 at 2:45 PM
> To: jpluser <chris.a.mattmann@jpl.nasa.gov<mailto:
> chris.a.mattmann@jpl.nasa.gov>>
> Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <
> dev@opennlp.apache.org<ma...@opennlp.apache.org>>, Information and
> Data Science Group USC List <irds-L@mymaillists.usc.edu<mailto:
> irds-L@mymaillists.usc.edu>>,
> "kamalaku@usc.edu<ma...@usc.edu>" <kamalaku@usc.edu<mailto:
> kamalaku@usc.edu>>, "dev@tika.apache.org<ma...@tika.apache.org>"
> <de...@tika.apache.org>>
> Subject: Re: GSOC2016 Sentiment Analysis
>
> >Dear Prof. Mattmann,
> >
> >
> >Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
> >Translation b/w Spanish & Englosh.
> >
> >
> >I dont think it can be adapted to Sentiment Analysis.
> >
> >
> >Developing a generic training model/corpus for Sentiment Analysis that
> >encapsulates social media, movie reviews, etc, etc will be a Challenging
> >& Exciting Task !!
> >
> >
> >Regards,
> >Harsha
> >
> >
> >On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
> ><ch...@jpl.nasa.gov>>
> wrote:
> >
> >Sounds great Harsha. This is for Google Summer of Code, so collaborating
> >would be great, and in this case, we would be working with Madhawa, should
> >he choose to accept.
> >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >Chris Mattmann, Ph.D.
> >Chief Architect
> >Instrument Software and Science Data Systems Section (398)
> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >Office: 168-519, Mailstop: 168-527
> >Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
> >WWW:
> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >Director, Information Retrieval and Data Science Group (IRDS)
> >Adjunct Associate Professor, Computer Science Department
> >University of Southern California, Los Angeles, CA 90089 USA
> >WWW: http://irds.usc.edu/
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >
> >
> >
> >
> >-----Original Message-----
> >From: Harshavardhan Manjunatha <hmanjuna@usc.edu<mailto:hmanjuna@usc.edu
> >>
> >Date: Friday, March 25, 2016 at 2:38 PM
> >To: jpluser <chris.a.mattmann@jpl.nasa.gov<mailto:
> chris.a.mattmann@jpl.nasa.gov>>
> >Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <
> dev@opennlp.apache.org<ma...@opennlp.apache.org>>, Information and
> >Data Science Group USC List <irds-L@mymaillists.usc.edu<mailto:
> irds-L@mymaillists.usc.edu>>,
> >"kamalaku@usc.edu<ma...@usc.edu>" <kamalaku@usc.edu<mailto:
> kamalaku@usc.edu>>, "dev@tika.apache.org<ma...@tika.apache.org>"
> ><de...@tika.apache.org>>
> >Subject: Re: GSOC2016 Sentiment Analysis
> >
> >>Dear Prof. Mattmann,
> >>
> >>
> >>I would love to collaborate on this & am interested in developing
> >>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
> >>
> >>
> >>I have completed an Applied NLP course @ USC.
> >>
> >>
> >>I have done a Literature Review of Papers & Open Source Tools on the same
> >>recently.
> >>
> >>
> >>Regards,
> >>Harsha
> >>
> >>
> >>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
> >><ch...@jpl.nasa.gov>>
> wrote:
> >>
> >>Hi Madhawa,
> >>
> >>
> >>
> >>So, how about a project that develops and contributes an Apache
> >>
> >>Tika and OpenNLP based SentimentAnalysisParser?
> >>
> >>
> >>
> >>I have some students currently doing work using the Fisher Callhome
> >>
> >>Corpus and you can build off that. I am CC’ing my USC IRDS team
> >>
> >>and my student Indhu who is working on this.
> >>
> >>
> >>
> >>Can you start working on your proposal by:
> >>
> >>
> >>
> >>1. Creating a JIRA issue here:
> >>
> >>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jir
> >>a
> >>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l5
> >>6
> >>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1m
> >>s
> >>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
> >>
> >> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
> >>
> >>
> >>
> >>2. Develop a proposal on the Tika wiki here:
> >>
> >>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_
> >>G
> >>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU
> >>8
> >>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoN
> >>h
> >>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
> >> (you will need permission, first
> >>
> >>sign up for your account on the wiki then tell me your username so I
> >>
> >>can add permissions for you)
> >>
> >>
> >>
> >>3. Apply through the Google Summer of Code 2016 program.
> >>
> >>
> >>
> >>4. Get in touch with me, and Indhu, and keep dev@tika.a.o<mailto:
> dev@tika.a.o> and
> >>
> >>dev@openlp.a.o<ma...@openlp.a.o> and irds-L@usc.edu<mailto:
> irds-L@usc.edu> in the loop so we can discuss together
> >>
> >>as a community.
> >>
> >>
> >>
> >>Cool?
> >>
> >>
> >>
> >>Cheers,
> >>
> >>Chris
> >>
> >>
> >>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>Chris Mattmann, Ph.D.
> >>
> >>Chief Architect
> >>
> >>Instrument Software and Science Data Systems Section (398)
> >>
> >>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>
> >>Office: 168-519, Mailstop: 168-527
> >>
> >>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
> >>
> >>WWW:
> >
> >
> >>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>Director, Information Retrieval and Data Science Group (IRDS)
> >>
> >>Adjunct Associate Professor, Computer Science Department
> >>
> >>University of Southern California, Los Angeles, CA 90089 USA
> >>
> >>WWW: http://irds.usc.edu/
> >>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>-----Original Message-----
> >>
> >>From: Madhawa Kasun Gunasekara <madhawa30@gmail.com<mailto:
> madhawa30@gmail.com>>
> >>
> >>Reply-To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <
> dev@opennlp.apache.org<ma...@opennlp.apache.org>>
> >>
> >>Date: Wednesday, March 16, 2016 at 10:51 PM
> >>
> >>To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <
> dev@opennlp.apache.org<ma...@opennlp.apache.org>>
> >>
> >>Subject: GSOC2016 Sentiment Analysis
> >>
> >>
> >>
> >>>Hi
> >>
> >>>
> >>
> >>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
> >>
> >>>GSOC2016 this time. Since i have been engaging with some similar
> >>>projects
> >>
> >>>i
> >>
> >>>think it will be a great experience for me.
> >>
> >>>
> >>
> >>>I am a final year student in IESL College of Engineering, Sri lanka. I
> >>
> >>>have
> >>
> >>>learned machine learning and natural language processing stuff when I'm
> >>
> >>>doing my first degree (Computer Science) in University of Sri
> >>
> >>>Jayewardhenapura.
> >>
> >>>
> >>
> >>>In my internship period, I have actively contributed to a Twitter based
> >>
> >>>NLP
> >>
> >>>project. and We have published an article on IEEE Conference, "Real-time
> >>
> >>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
> >>
> >>>
> >>
> >>>Please let me know what you think and what you suggest.
> >>
> >>>
> >>
> >>>Please kindly give me further information on how I could proceed. I
> >>
> >>>couldn't able to find the mentioned paper "Multi-Class Sentiment
> >>>Analysis
> >>
> >>>in Twitter: a Pattern-Based Approach"
> >>
> >>>[1]
> >>
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_ji
> >>r
> >>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfn
> >>c
> >>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t
> >>0
> >>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
> >
> >
> >><
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_j
> >>i
> >>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSf
> >>n
> >>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8
> >>t
> >>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
> >>
> >>>[2]
> >>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_x
> >>p
> >>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0
> >>N
> >>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNi
> >>Y
> >>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
> >><
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_
> >>x
> >>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi
> >>0
> >>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLN
> >>i
> >>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
> >>
> >>>
> >>
> >>>Thanks
> >>
> >>>Madhawa Gunasekara
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >
> >
> >
> >
> >
> >
>
>
>

Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Thanks please can you create a username with no spaces?

Sent from my iPhone

On Mar 27, 2016, at 2:20 AM, Madhawa Kasun Gunasekara <ma...@gmail.com>> wrote:

Hi Chris,

Thanks for the reply, I tried to logging to [1], but I couldn't able to login into that my username is "Madhawa Gunasekara"
[1] https://wiki.apache.org/tika/GSoC2016

I have created a jira issue on https://issues.apache.org/jira/browse/TIKA-1911

Thanks,
Madhawa

Madhawa

On Sat, Mar 26, 2016 at 3:21 AM, Mattmann, Chris A (3980) <ch...@jpl.nasa.gov>> wrote:
Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
is data related in there that can be used for sentiment analysis :)
It can be adapted and is being used for that.

Anyways, yes looking forward to the task. Please send in your proposal
Madhawa.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Harshavardhan Manjunatha <hm...@usc.edu>>
Date: Friday, March 25, 2016 at 2:45 PM
To: jpluser <ch...@jpl.nasa.gov>>
Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <de...@opennlp.apache.org>>, Information and
Data Science Group USC List <ir...@mymaillists.usc.edu>>,
"kamalaku@usc.edu<ma...@usc.edu>" <ka...@usc.edu>>, "dev@tika.apache.org<ma...@tika.apache.org>"
<de...@tika.apache.org>>
Subject: Re: GSOC2016 Sentiment Analysis

>Dear Prof. Mattmann,
>
>
>Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
>Translation b/w Spanish & Englosh.
>
>
>I dont think it can be adapted to Sentiment Analysis.
>
>
>Developing a generic training model/corpus for Sentiment Analysis that
>encapsulates social media, movie reviews, etc, etc will be a Challenging
>& Exciting Task !!
>
>
>Regards,
>Harsha
>
>
>On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov>> wrote:
>
>Sounds great Harsha. This is for Google Summer of Code, so collaborating
>would be great, and in this case, we would be working with Madhawa, should
>he choose to accept.
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Chris Mattmann, Ph.D.
>Chief Architect
>Instrument Software and Science Data Systems Section (398)
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>Office: 168-519, Mailstop: 168-527
>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>WWW:
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Director, Information Retrieval and Data Science Group (IRDS)
>Adjunct Associate Professor, Computer Science Department
>University of Southern California, Los Angeles, CA 90089 USA
>WWW: http://irds.usc.edu/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>-----Original Message-----
>From: Harshavardhan Manjunatha <hm...@usc.edu>>
>Date: Friday, March 25, 2016 at 2:38 PM
>To: jpluser <ch...@jpl.nasa.gov>>
>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <de...@opennlp.apache.org>>, Information and
>Data Science Group USC List <ir...@mymaillists.usc.edu>>,
>"kamalaku@usc.edu<ma...@usc.edu>" <ka...@usc.edu>>, "dev@tika.apache.org<ma...@tika.apache.org>"
><de...@tika.apache.org>>
>Subject: Re: GSOC2016 Sentiment Analysis
>
>>Dear Prof. Mattmann,
>>
>>
>>I would love to collaborate on this & am interested in developing
>>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
>>
>>
>>I have completed an Applied NLP course @ USC.
>>
>>
>>I have done a Literature Review of Papers & Open Source Tools on the same
>>recently.
>>
>>
>>Regards,
>>Harsha
>>
>>
>>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
>><ch...@jpl.nasa.gov>> wrote:
>>
>>Hi Madhawa,
>>
>>
>>
>>So, how about a project that develops and contributes an Apache
>>
>>Tika and OpenNLP based SentimentAnalysisParser?
>>
>>
>>
>>I have some students currently doing work using the Fisher Callhome
>>
>>Corpus and you can build off that. I am CC’ing my USC IRDS team
>>
>>and my student Indhu who is working on this.
>>
>>
>>
>>Can you start working on your proposal by:
>>
>>
>>
>>1. Creating a JIRA issue here:
>>
>>https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jir
>>a
>>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l5
>>6
>>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1m
>>s
>>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
>>
>> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
>>
>>
>>
>>2. Develop a proposal on the Tika wiki here:
>>
>>https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_
>>G
>>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU
>>8
>>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoN
>>h
>>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
>> (you will need permission, first
>>
>>sign up for your account on the wiki then tell me your username so I
>>
>>can add permissions for you)
>>
>>
>>
>>3. Apply through the Google Summer of Code 2016 program.
>>
>>
>>
>>4. Get in touch with me, and Indhu, and keep dev@tika.a.o<ma...@tika.a.o> and
>>
>>dev@openlp.a.o<ma...@openlp.a.o> and irds-L@usc.edu<ma...@usc.edu> in the loop so we can discuss together
>>
>>as a community.
>>
>>
>>
>>Cool?
>>
>>
>>
>>Cheers,
>>
>>Chris
>>
>>
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>Chris Mattmann, Ph.D.
>>
>>Chief Architect
>>
>>Instrument Software and Science Data Systems Section (398)
>>
>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>
>>Office: 168-519, Mailstop: 168-527
>>
>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>>
>>WWW:
>
>
>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>Director, Information Retrieval and Data Science Group (IRDS)
>>
>>Adjunct Associate Professor, Computer Science Department
>>
>>University of Southern California, Los Angeles, CA 90089 USA
>>
>>WWW: http://irds.usc.edu/
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>-----Original Message-----
>>
>>From: Madhawa Kasun Gunasekara <ma...@gmail.com>>
>>
>>Reply-To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <de...@opennlp.apache.org>>
>>
>>Date: Wednesday, March 16, 2016 at 10:51 PM
>>
>>To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <de...@opennlp.apache.org>>
>>
>>Subject: GSOC2016 Sentiment Analysis
>>
>>
>>
>>>Hi
>>
>>>
>>
>>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>>
>>>GSOC2016 this time. Since i have been engaging with some similar
>>>projects
>>
>>>i
>>
>>>think it will be a great experience for me.
>>
>>>
>>
>>>I am a final year student in IESL College of Engineering, Sri lanka. I
>>
>>>have
>>
>>>learned machine learning and natural language processing stuff when I'm
>>
>>>doing my first degree (Computer Science) in University of Sri
>>
>>>Jayewardhenapura.
>>
>>>
>>
>>>In my internship period, I have actively contributed to a Twitter based
>>
>>>NLP
>>
>>>project. and We have published an article on IEEE Conference, "Real-time
>>
>>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>>
>>>
>>
>>>Please let me know what you think and what you suggest.
>>
>>>
>>
>>>Please kindly give me further information on how I could proceed. I
>>
>>>couldn't able to find the mentioned paper "Multi-Class Sentiment
>>>Analysis
>>
>>>in Twitter: a Pattern-Based Approach"
>>
>>>[1]
>>https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_ji
>>r
>>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfn
>>c
>>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t
>>0
>>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
>
>
>><https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_j
>>i
>>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSf
>>n
>>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8
>>t
>>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
>>
>>>[2]
>>https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_x
>>p
>>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0
>>N
>>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNi
>>Y
>>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
>><https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_
>>x
>>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi
>>0
>>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLN
>>i
>>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
>>
>>>
>>
>>>Thanks
>>
>>>Madhawa Gunasekara
>>
>>
>>
>>
>>
>>
>>
>>
>
>
>
>
>
>



Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Thanks please can you create a username with no spaces?

Sent from my iPhone

On Mar 27, 2016, at 2:20 AM, Madhawa Kasun Gunasekara <ma...@gmail.com>> wrote:

Hi Chris,

Thanks for the reply, I tried to logging to [1], but I couldn't able to login into that my username is "Madhawa Gunasekara"
[1] https://wiki.apache.org/tika/GSoC2016

I have created a jira issue on https://issues.apache.org/jira/browse/TIKA-1911

Thanks,
Madhawa

Madhawa

On Sat, Mar 26, 2016 at 3:21 AM, Mattmann, Chris A (3980) <ch...@jpl.nasa.gov>> wrote:
Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
is data related in there that can be used for sentiment analysis :)
It can be adapted and is being used for that.

Anyways, yes looking forward to the task. Please send in your proposal
Madhawa.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Harshavardhan Manjunatha <hm...@usc.edu>>
Date: Friday, March 25, 2016 at 2:45 PM
To: jpluser <ch...@jpl.nasa.gov>>
Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <de...@opennlp.apache.org>>, Information and
Data Science Group USC List <ir...@mymaillists.usc.edu>>,
"kamalaku@usc.edu<ma...@usc.edu>" <ka...@usc.edu>>, "dev@tika.apache.org<ma...@tika.apache.org>"
<de...@tika.apache.org>>
Subject: Re: GSOC2016 Sentiment Analysis

>Dear Prof. Mattmann,
>
>
>Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
>Translation b/w Spanish & Englosh.
>
>
>I dont think it can be adapted to Sentiment Analysis.
>
>
>Developing a generic training model/corpus for Sentiment Analysis that
>encapsulates social media, movie reviews, etc, etc will be a Challenging
>& Exciting Task !!
>
>
>Regards,
>Harsha
>
>
>On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov>> wrote:
>
>Sounds great Harsha. This is for Google Summer of Code, so collaborating
>would be great, and in this case, we would be working with Madhawa, should
>he choose to accept.
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Chris Mattmann, Ph.D.
>Chief Architect
>Instrument Software and Science Data Systems Section (398)
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>Office: 168-519, Mailstop: 168-527
>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>WWW:
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Director, Information Retrieval and Data Science Group (IRDS)
>Adjunct Associate Professor, Computer Science Department
>University of Southern California, Los Angeles, CA 90089 USA
>WWW: http://irds.usc.edu/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>-----Original Message-----
>From: Harshavardhan Manjunatha <hm...@usc.edu>>
>Date: Friday, March 25, 2016 at 2:38 PM
>To: jpluser <ch...@jpl.nasa.gov>>
>Cc: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <de...@opennlp.apache.org>>, Information and
>Data Science Group USC List <ir...@mymaillists.usc.edu>>,
>"kamalaku@usc.edu<ma...@usc.edu>" <ka...@usc.edu>>, "dev@tika.apache.org<ma...@tika.apache.org>"
><de...@tika.apache.org>>
>Subject: Re: GSOC2016 Sentiment Analysis
>
>>Dear Prof. Mattmann,
>>
>>
>>I would love to collaborate on this & am interested in developing
>>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
>>
>>
>>I have completed an Applied NLP course @ USC.
>>
>>
>>I have done a Literature Review of Papers & Open Source Tools on the same
>>recently.
>>
>>
>>Regards,
>>Harsha
>>
>>
>>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
>><ch...@jpl.nasa.gov>> wrote:
>>
>>Hi Madhawa,
>>
>>
>>
>>So, how about a project that develops and contributes an Apache
>>
>>Tika and OpenNLP based SentimentAnalysisParser?
>>
>>
>>
>>I have some students currently doing work using the Fisher Callhome
>>
>>Corpus and you can build off that. I am CC’ing my USC IRDS team
>>
>>and my student Indhu who is working on this.
>>
>>
>>
>>Can you start working on your proposal by:
>>
>>
>>
>>1. Creating a JIRA issue here:
>>
>>https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jir
>>a
>>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l5
>>6
>>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1m
>>s
>>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
>>
>> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
>>
>>
>>
>>2. Develop a proposal on the Tika wiki here:
>>
>>https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_
>>G
>>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU
>>8
>>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoN
>>h
>>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
>> (you will need permission, first
>>
>>sign up for your account on the wiki then tell me your username so I
>>
>>can add permissions for you)
>>
>>
>>
>>3. Apply through the Google Summer of Code 2016 program.
>>
>>
>>
>>4. Get in touch with me, and Indhu, and keep dev@tika.a.o<ma...@tika.a.o> and
>>
>>dev@openlp.a.o<ma...@openlp.a.o> and irds-L@usc.edu<ma...@usc.edu> in the loop so we can discuss together
>>
>>as a community.
>>
>>
>>
>>Cool?
>>
>>
>>
>>Cheers,
>>
>>Chris
>>
>>
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>Chris Mattmann, Ph.D.
>>
>>Chief Architect
>>
>>Instrument Software and Science Data Systems Section (398)
>>
>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>
>>Office: 168-519, Mailstop: 168-527
>>
>>Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
>>
>>WWW:
>
>
>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>Director, Information Retrieval and Data Science Group (IRDS)
>>
>>Adjunct Associate Professor, Computer Science Department
>>
>>University of Southern California, Los Angeles, CA 90089 USA
>>
>>WWW: http://irds.usc.edu/
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>-----Original Message-----
>>
>>From: Madhawa Kasun Gunasekara <ma...@gmail.com>>
>>
>>Reply-To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <de...@opennlp.apache.org>>
>>
>>Date: Wednesday, March 16, 2016 at 10:51 PM
>>
>>To: "dev@opennlp.apache.org<ma...@opennlp.apache.org>" <de...@opennlp.apache.org>>
>>
>>Subject: GSOC2016 Sentiment Analysis
>>
>>
>>
>>>Hi
>>
>>>
>>
>>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>>
>>>GSOC2016 this time. Since i have been engaging with some similar
>>>projects
>>
>>>i
>>
>>>think it will be a great experience for me.
>>
>>>
>>
>>>I am a final year student in IESL College of Engineering, Sri lanka. I
>>
>>>have
>>
>>>learned machine learning and natural language processing stuff when I'm
>>
>>>doing my first degree (Computer Science) in University of Sri
>>
>>>Jayewardhenapura.
>>
>>>
>>
>>>In my internship period, I have actively contributed to a Twitter based
>>
>>>NLP
>>
>>>project. and We have published an article on IEEE Conference, "Real-time
>>
>>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>>
>>>
>>
>>>Please let me know what you think and what you suggest.
>>
>>>
>>
>>>Please kindly give me further information on how I could proceed. I
>>
>>>couldn't able to find the mentioned paper "Multi-Class Sentiment
>>>Analysis
>>
>>>in Twitter: a Pattern-Based Approach"
>>
>>>[1]
>>https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_ji
>>r
>>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfn
>>c
>>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t
>>0
>>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
>
>
>><https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_j
>>i
>>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSf
>>n
>>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8
>>t
>>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
>>
>>>[2]
>>https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_x
>>p
>>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0
>>N
>>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNi
>>Y
>>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
>><https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_
>>x
>>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi
>>0
>>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLN
>>i
>>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
>>
>>>
>>
>>>Thanks
>>
>>>Madhawa Gunasekara
>>
>>
>>
>>
>>
>>
>>
>>
>
>
>
>
>
>



Re: GSOC2016 Sentiment Analysis

Posted by Madhawa Kasun Gunasekara <ma...@gmail.com>.
Hi Chris,

Thanks for the reply, I tried to logging to [1], but I couldn't able to
login into that my username is "Madhawa Gunasekara"
[1] https://wiki.apache.org/tika/GSoC2016

I have created a jira issue on
https://issues.apache.org/jira/browse/TIKA-1911

Thanks,
Madhawa

Madhawa

On Sat, Mar 26, 2016 at 3:21 AM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
> is data related in there that can be used for sentiment analysis :)
> It can be adapted and is being used for that.
>
> Anyways, yes looking forward to the task. Please send in your proposal
> Madhawa.
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
> -----Original Message-----
> From: Harshavardhan Manjunatha <hm...@usc.edu>
> Date: Friday, March 25, 2016 at 2:45 PM
> To: jpluser <ch...@jpl.nasa.gov>
> Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Information and
> Data Science Group USC List <ir...@mymaillists.usc.edu>,
> "kamalaku@usc.edu" <ka...@usc.edu>, "dev@tika.apache.org"
> <de...@tika.apache.org>
> Subject: Re: GSOC2016 Sentiment Analysis
>
> >Dear Prof. Mattmann,
> >
> >
> >Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
> >Translation b/w Spanish & Englosh.
> >
> >
> >I dont think it can be adapted to Sentiment Analysis.
> >
> >
> >Developing a generic training model/corpus for Sentiment Analysis that
> >encapsulates social media, movie reviews, etc, etc will be a Challenging
> >& Exciting Task !!
> >
> >
> >Regards,
> >Harsha
> >
> >
> >On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
> ><ch...@jpl.nasa.gov> wrote:
> >
> >Sounds great Harsha. This is for Google Summer of Code, so collaborating
> >would be great, and in this case, we would be working with Madhawa, should
> >he choose to accept.
> >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >Chris Mattmann, Ph.D.
> >Chief Architect
> >Instrument Software and Science Data Systems Section (398)
> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >Office: 168-519, Mailstop: 168-527
> >Email: chris.a.mattmann@nasa.gov
> >WWW:
> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >Director, Information Retrieval and Data Science Group (IRDS)
> >Adjunct Associate Professor, Computer Science Department
> >University of Southern California, Los Angeles, CA 90089 USA
> >WWW: http://irds.usc.edu/
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >
> >
> >
> >
> >-----Original Message-----
> >From: Harshavardhan Manjunatha <hm...@usc.edu>
> >Date: Friday, March 25, 2016 at 2:38 PM
> >To: jpluser <ch...@jpl.nasa.gov>
> >Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Information and
> >Data Science Group USC List <ir...@mymaillists.usc.edu>,
> >"kamalaku@usc.edu" <ka...@usc.edu>, "dev@tika.apache.org"
> ><de...@tika.apache.org>
> >Subject: Re: GSOC2016 Sentiment Analysis
> >
> >>Dear Prof. Mattmann,
> >>
> >>
> >>I would love to collaborate on this & am interested in developing
> >>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
> >>
> >>
> >>I have completed an Applied NLP course @ USC.
> >>
> >>
> >>I have done a Literature Review of Papers & Open Source Tools on the same
> >>recently.
> >>
> >>
> >>Regards,
> >>Harsha
> >>
> >>
> >>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
> >><ch...@jpl.nasa.gov> wrote:
> >>
> >>Hi Madhawa,
> >>
> >>
> >>
> >>So, how about a project that develops and contributes an Apache
> >>
> >>Tika and OpenNLP based SentimentAnalysisParser?
> >>
> >>
> >>
> >>I have some students currently doing work using the Fisher Callhome
> >>
> >>Corpus and you can build off that. I am CC’ing my USC IRDS team
> >>
> >>and my student Indhu who is working on this.
> >>
> >>
> >>
> >>Can you start working on your proposal by:
> >>
> >>
> >>
> >>1. Creating a JIRA issue here:
> >>
> >>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jir
> >>a
> >>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l5
> >>6
> >>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1m
> >>s
> >>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
> >>
> >> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
> >>
> >>
> >>
> >>2. Develop a proposal on the Tika wiki here:
> >>
> >>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_
> >>G
> >>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU
> >>8
> >>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoN
> >>h
> >>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
> >> (you will need permission, first
> >>
> >>sign up for your account on the wiki then tell me your username so I
> >>
> >>can add permissions for you)
> >>
> >>
> >>
> >>3. Apply through the Google Summer of Code 2016 program.
> >>
> >>
> >>
> >>4. Get in touch with me, and Indhu, and keep dev@tika.a.o and
> >>
> >>dev@openlp.a.o and irds-L@usc.edu in the loop so we can discuss together
> >>
> >>as a community.
> >>
> >>
> >>
> >>Cool?
> >>
> >>
> >>
> >>Cheers,
> >>
> >>Chris
> >>
> >>
> >>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>Chris Mattmann, Ph.D.
> >>
> >>Chief Architect
> >>
> >>Instrument Software and Science Data Systems Section (398)
> >>
> >>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>
> >>Office: 168-519, Mailstop: 168-527
> >>
> >>Email: chris.a.mattmann@nasa.gov
> >>
> >>WWW:
> >
> >
> >>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>Director, Information Retrieval and Data Science Group (IRDS)
> >>
> >>Adjunct Associate Professor, Computer Science Department
> >>
> >>University of Southern California, Los Angeles, CA 90089 USA
> >>
> >>WWW: http://irds.usc.edu/
> >>
> >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>-----Original Message-----
> >>
> >>From: Madhawa Kasun Gunasekara <ma...@gmail.com>
> >>
> >>Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
> >>
> >>Date: Wednesday, March 16, 2016 at 10:51 PM
> >>
> >>To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
> >>
> >>Subject: GSOC2016 Sentiment Analysis
> >>
> >>
> >>
> >>>Hi
> >>
> >>>
> >>
> >>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
> >>
> >>>GSOC2016 this time. Since i have been engaging with some similar
> >>>projects
> >>
> >>>i
> >>
> >>>think it will be a great experience for me.
> >>
> >>>
> >>
> >>>I am a final year student in IESL College of Engineering, Sri lanka. I
> >>
> >>>have
> >>
> >>>learned machine learning and natural language processing stuff when I'm
> >>
> >>>doing my first degree (Computer Science) in University of Sri
> >>
> >>>Jayewardhenapura.
> >>
> >>>
> >>
> >>>In my internship period, I have actively contributed to a Twitter based
> >>
> >>>NLP
> >>
> >>>project. and We have published an article on IEEE Conference, "Real-time
> >>
> >>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
> >>
> >>>
> >>
> >>>Please let me know what you think and what you suggest.
> >>
> >>>
> >>
> >>>Please kindly give me further information on how I could proceed. I
> >>
> >>>couldn't able to find the mentioned paper "Multi-Class Sentiment
> >>>Analysis
> >>
> >>>in Twitter: a Pattern-Based Approach"
> >>
> >>>[1]
> >>
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_ji
> >>r
> >>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfn
> >>c
> >>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t
> >>0
> >>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
> >
> >
> >><
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_j
> >>i
> >>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSf
> >>n
> >>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8
> >>t
> >>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
> >>
> >>>[2]
> >>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_x
> >>p
> >>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0
> >>N
> >>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNi
> >>Y
> >>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
> >><
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_
> >>x
> >>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi
> >>0
> >>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLN
> >>i
> >>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
> >>
> >>>
> >>
> >>>Thanks
> >>
> >>>Madhawa Gunasekara
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >
> >
> >
> >
> >
> >
>
>

Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
is data related in there that can be used for sentiment analysis :)
It can be adapted and is being used for that.

Anyways, yes looking forward to the task. Please send in your proposal
Madhawa.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Harshavardhan Manjunatha <hm...@usc.edu>
Date: Friday, March 25, 2016 at 2:45 PM
To: jpluser <ch...@jpl.nasa.gov>
Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Information and
Data Science Group USC List <ir...@mymaillists.usc.edu>,
"kamalaku@usc.edu" <ka...@usc.edu>, "dev@tika.apache.org"
<de...@tika.apache.org>
Subject: Re: GSOC2016 Sentiment Analysis

>Dear Prof. Mattmann,
>
>
>Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
>Translation b/w Spanish & Englosh.
>
>
>I dont think it can be adapted to Sentiment Analysis.
>
>
>Developing a generic training model/corpus for Sentiment Analysis that
>encapsulates social media, movie reviews, etc, etc will be a Challenging
>& Exciting Task !!
>
>
>Regards,
>Harsha
>
>
>On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov> wrote:
>
>Sounds great Harsha. This is for Google Summer of Code, so collaborating
>would be great, and in this case, we would be working with Madhawa, should
>he choose to accept.
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Chris Mattmann, Ph.D.
>Chief Architect
>Instrument Software and Science Data Systems Section (398)
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>Office: 168-519, Mailstop: 168-527
>Email: chris.a.mattmann@nasa.gov
>WWW:  
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Director, Information Retrieval and Data Science Group (IRDS)
>Adjunct Associate Professor, Computer Science Department
>University of Southern California, Los Angeles, CA 90089 USA
>WWW: http://irds.usc.edu/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>-----Original Message-----
>From: Harshavardhan Manjunatha <hm...@usc.edu>
>Date: Friday, March 25, 2016 at 2:38 PM
>To: jpluser <ch...@jpl.nasa.gov>
>Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Information and
>Data Science Group USC List <ir...@mymaillists.usc.edu>,
>"kamalaku@usc.edu" <ka...@usc.edu>, "dev@tika.apache.org"
><de...@tika.apache.org>
>Subject: Re: GSOC2016 Sentiment Analysis
>
>>Dear Prof. Mattmann,
>>
>>
>>I would love to collaborate on this & am interested in developing
>>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
>>
>>
>>I have completed an Applied NLP course @ USC.
>>
>>
>>I have done a Literature Review of Papers & Open Source Tools on the same
>>recently.
>>
>>
>>Regards,
>>Harsha
>>
>>
>>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
>><ch...@jpl.nasa.gov> wrote:
>>
>>Hi Madhawa,
>>
>>
>>
>>So, how about a project that develops and contributes an Apache
>>
>>Tika and OpenNLP based SentimentAnalysisParser?
>>
>>
>>
>>I have some students currently doing work using the Fisher Callhome
>>
>>Corpus and you can build off that. I am CC’ing my USC IRDS team
>>
>>and my student Indhu who is working on this.
>>
>>
>>
>>Can you start working on your proposal by:
>>
>>
>>
>>1. Creating a JIRA issue here:
>>
>>https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jir
>>a
>>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l5
>>6
>>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1m
>>s
>>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
>>
>> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
>>
>>
>>
>>2. Develop a proposal on the Tika wiki here:
>>
>>https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_
>>G
>>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU
>>8
>>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoN
>>h
>>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
>> (you will need permission, first
>>
>>sign up for your account on the wiki then tell me your username so I
>>
>>can add permissions for you)
>>
>>
>>
>>3. Apply through the Google Summer of Code 2016 program.
>>
>>
>>
>>4. Get in touch with me, and Indhu, and keep dev@tika.a.o and
>>
>>dev@openlp.a.o and irds-L@usc.edu in the loop so we can discuss together
>>
>>as a community.
>>
>>
>>
>>Cool?
>>
>>
>>
>>Cheers,
>>
>>Chris
>>
>>
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>Chris Mattmann, Ph.D.
>>
>>Chief Architect
>>
>>Instrument Software and Science Data Systems Section (398)
>>
>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>
>>Office: 168-519, Mailstop: 168-527
>>
>>Email: chris.a.mattmann@nasa.gov
>>
>>WWW:
>
>
>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>Director, Information Retrieval and Data Science Group (IRDS)
>>
>>Adjunct Associate Professor, Computer Science Department
>>
>>University of Southern California, Los Angeles, CA 90089 USA
>>
>>WWW: http://irds.usc.edu/
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>-----Original Message-----
>>
>>From: Madhawa Kasun Gunasekara <ma...@gmail.com>
>>
>>Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
>>
>>Date: Wednesday, March 16, 2016 at 10:51 PM
>>
>>To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
>>
>>Subject: GSOC2016 Sentiment Analysis
>>
>>
>>
>>>Hi
>>
>>>
>>
>>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>>
>>>GSOC2016 this time. Since i have been engaging with some similar
>>>projects
>>
>>>i
>>
>>>think it will be a great experience for me.
>>
>>>
>>
>>>I am a final year student in IESL College of Engineering, Sri lanka. I
>>
>>>have
>>
>>>learned machine learning and natural language processing stuff when I'm
>>
>>>doing my first degree (Computer Science) in University of Sri
>>
>>>Jayewardhenapura.
>>
>>>
>>
>>>In my internship period, I have actively contributed to a Twitter based
>>
>>>NLP
>>
>>>project. and We have published an article on IEEE Conference, "Real-time
>>
>>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>>
>>>
>>
>>>Please let me know what you think and what you suggest.
>>
>>>
>>
>>>Please kindly give me further information on how I could proceed. I
>>
>>>couldn't able to find the mentioned paper "Multi-Class Sentiment
>>>Analysis
>>
>>>in Twitter: a Pattern-Based Approach"
>>
>>>[1]
>>https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_ji
>>r
>>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfn
>>c
>>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t
>>0
>>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
>
>
>><https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_j
>>i
>>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSf
>>n
>>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8
>>t
>>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
>>
>>>[2]
>>https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_x
>>p
>>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0
>>N
>>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNi
>>Y
>>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
>><https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_
>>x
>>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi
>>0
>>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLN
>>i
>>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
>>
>>>
>>
>>>Thanks
>>
>>>Madhawa Gunasekara
>>
>>
>>
>>
>>
>>
>>
>>
>
>
>
>
>
>


Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Thanks Harsha. Yes, I know about the Fisher Callhome Corpus. There
is data related in there that can be used for sentiment analysis :)
It can be adapted and is being used for that.

Anyways, yes looking forward to the task. Please send in your proposal
Madhawa.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Harshavardhan Manjunatha <hm...@usc.edu>
Date: Friday, March 25, 2016 at 2:45 PM
To: jpluser <ch...@jpl.nasa.gov>
Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Information and
Data Science Group USC List <ir...@mymaillists.usc.edu>,
"kamalaku@usc.edu" <ka...@usc.edu>, "dev@tika.apache.org"
<de...@tika.apache.org>
Subject: Re: GSOC2016 Sentiment Analysis

>Dear Prof. Mattmann,
>
>
>Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
>Translation b/w Spanish & Englosh.
>
>
>I dont think it can be adapted to Sentiment Analysis.
>
>
>Developing a generic training model/corpus for Sentiment Analysis that
>encapsulates social media, movie reviews, etc, etc will be a Challenging
>& Exciting Task !!
>
>
>Regards,
>Harsha
>
>
>On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov> wrote:
>
>Sounds great Harsha. This is for Google Summer of Code, so collaborating
>would be great, and in this case, we would be working with Madhawa, should
>he choose to accept.
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Chris Mattmann, Ph.D.
>Chief Architect
>Instrument Software and Science Data Systems Section (398)
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>Office: 168-519, Mailstop: 168-527
>Email: chris.a.mattmann@nasa.gov
>WWW:  
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Director, Information Retrieval and Data Science Group (IRDS)
>Adjunct Associate Professor, Computer Science Department
>University of Southern California, Los Angeles, CA 90089 USA
>WWW: http://irds.usc.edu/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>-----Original Message-----
>From: Harshavardhan Manjunatha <hm...@usc.edu>
>Date: Friday, March 25, 2016 at 2:38 PM
>To: jpluser <ch...@jpl.nasa.gov>
>Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Information and
>Data Science Group USC List <ir...@mymaillists.usc.edu>,
>"kamalaku@usc.edu" <ka...@usc.edu>, "dev@tika.apache.org"
><de...@tika.apache.org>
>Subject: Re: GSOC2016 Sentiment Analysis
>
>>Dear Prof. Mattmann,
>>
>>
>>I would love to collaborate on this & am interested in developing
>>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
>>
>>
>>I have completed an Applied NLP course @ USC.
>>
>>
>>I have done a Literature Review of Papers & Open Source Tools on the same
>>recently.
>>
>>
>>Regards,
>>Harsha
>>
>>
>>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
>><ch...@jpl.nasa.gov> wrote:
>>
>>Hi Madhawa,
>>
>>
>>
>>So, how about a project that develops and contributes an Apache
>>
>>Tika and OpenNLP based SentimentAnalysisParser?
>>
>>
>>
>>I have some students currently doing work using the Fisher Callhome
>>
>>Corpus and you can build off that. I am CC’ing my USC IRDS team
>>
>>and my student Indhu who is working on this.
>>
>>
>>
>>Can you start working on your proposal by:
>>
>>
>>
>>1. Creating a JIRA issue here:
>>
>>https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jir
>>a
>>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l5
>>6
>>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1m
>>s
>>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
>>
>> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
>>
>>
>>
>>2. Develop a proposal on the Tika wiki here:
>>
>>https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_
>>G
>>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU
>>8
>>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoN
>>h
>>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
>> (you will need permission, first
>>
>>sign up for your account on the wiki then tell me your username so I
>>
>>can add permissions for you)
>>
>>
>>
>>3. Apply through the Google Summer of Code 2016 program.
>>
>>
>>
>>4. Get in touch with me, and Indhu, and keep dev@tika.a.o and
>>
>>dev@openlp.a.o and irds-L@usc.edu in the loop so we can discuss together
>>
>>as a community.
>>
>>
>>
>>Cool?
>>
>>
>>
>>Cheers,
>>
>>Chris
>>
>>
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>Chris Mattmann, Ph.D.
>>
>>Chief Architect
>>
>>Instrument Software and Science Data Systems Section (398)
>>
>>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>
>>Office: 168-519, Mailstop: 168-527
>>
>>Email: chris.a.mattmann@nasa.gov
>>
>>WWW:
>
>
>>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>Director, Information Retrieval and Data Science Group (IRDS)
>>
>>Adjunct Associate Professor, Computer Science Department
>>
>>University of Southern California, Los Angeles, CA 90089 USA
>>
>>WWW: http://irds.usc.edu/
>>
>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>-----Original Message-----
>>
>>From: Madhawa Kasun Gunasekara <ma...@gmail.com>
>>
>>Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
>>
>>Date: Wednesday, March 16, 2016 at 10:51 PM
>>
>>To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
>>
>>Subject: GSOC2016 Sentiment Analysis
>>
>>
>>
>>>Hi
>>
>>>
>>
>>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>>
>>>GSOC2016 this time. Since i have been engaging with some similar
>>>projects
>>
>>>i
>>
>>>think it will be a great experience for me.
>>
>>>
>>
>>>I am a final year student in IESL College of Engineering, Sri lanka. I
>>
>>>have
>>
>>>learned machine learning and natural language processing stuff when I'm
>>
>>>doing my first degree (Computer Science) in University of Sri
>>
>>>Jayewardhenapura.
>>
>>>
>>
>>>In my internship period, I have actively contributed to a Twitter based
>>
>>>NLP
>>
>>>project. and We have published an article on IEEE Conference, "Real-time
>>
>>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>>
>>>
>>
>>>Please let me know what you think and what you suggest.
>>
>>>
>>
>>>Please kindly give me further information on how I could proceed. I
>>
>>>couldn't able to find the mentioned paper "Multi-Class Sentiment
>>>Analysis
>>
>>>in Twitter: a Pattern-Based Approach"
>>
>>>[1]
>>https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_ji
>>r
>>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfn
>>c
>>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t
>>0
>>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
>
>
>><https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_j
>>i
>>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSf
>>n
>>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8
>>t
>>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
>>
>>>[2]
>>https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_x
>>p
>>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0
>>N
>>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNi
>>Y
>>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
>><https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_
>>x
>>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi
>>0
>>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLN
>>i
>>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
>>
>>>
>>
>>>Thanks
>>
>>>Madhawa Gunasekara
>>
>>
>>
>>
>>
>>
>>
>>
>
>
>
>
>
>


Re: GSOC2016 Sentiment Analysis

Posted by Harshavardhan Manjunatha <hm...@usc.edu>.
Dear Prof. Mattmann,

Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
Translation b/w Spanish & Englosh.

I dont think it can be adapted to Sentiment Analysis.

Developing a generic training model/corpus for Sentiment Analysis that
encapsulates social media, movie reviews, etc, etc will be a Challenging &
Exciting Task !!

Regards,
Harsha

On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Sounds great Harsha. This is for Google Summer of Code, so collaborating
> would be great, and in this case, we would be working with Madhawa, should
> he choose to accept.
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
> -----Original Message-----
> From: Harshavardhan Manjunatha <hm...@usc.edu>
> Date: Friday, March 25, 2016 at 2:38 PM
> To: jpluser <ch...@jpl.nasa.gov>
> Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Information and
> Data Science Group USC List <ir...@mymaillists.usc.edu>,
> "kamalaku@usc.edu" <ka...@usc.edu>, "dev@tika.apache.org"
> <de...@tika.apache.org>
> Subject: Re: GSOC2016 Sentiment Analysis
>
> >Dear Prof. Mattmann,
> >
> >
> >I would love to collaborate on this & am interested in developing
> >Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
> >
> >
> >I have completed an Applied NLP course @ USC.
> >
> >
> >I have done a Literature Review of Papers & Open Source Tools on the same
> >recently.
> >
> >
> >Regards,
> >Harsha
> >
> >
> >On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
> ><ch...@jpl.nasa.gov> wrote:
> >
> >Hi Madhawa,
> >
> >
> >
> >So, how about a project that develops and contributes an Apache
> >
> >Tika and OpenNLP based SentimentAnalysisParser?
> >
> >
> >
> >I have some students currently doing work using the Fisher Callhome
> >
> >Corpus and you can build off that. I am CC’ing my USC IRDS team
> >
> >and my student Indhu who is working on this.
> >
> >
> >
> >Can you start working on your proposal by:
> >
> >
> >
> >1. Creating a JIRA issue here:
> >
> >
> https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jira
> >_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56
> >W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1ms
> >1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
> >
> > tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
> >
> >
> >
> >2. Develop a proposal on the Tika wiki here:
> >
> >
> https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_G
> >SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8
> >xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoNh
> >rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
> > (you will need permission, first
> >
> >sign up for your account on the wiki then tell me your username so I
> >
> >can add permissions for you)
> >
> >
> >
> >3. Apply through the Google Summer of Code 2016 program.
> >
> >
> >
> >4. Get in touch with me, and Indhu, and keep dev@tika.a.o and
> >
> >dev@openlp.a.o and irds-L@usc.edu in the loop so we can discuss together
> >
> >as a community.
> >
> >
> >
> >Cool?
> >
> >
> >
> >Cheers,
> >
> >Chris
> >
> >
> >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >Chris Mattmann, Ph.D.
> >
> >Chief Architect
> >
> >Instrument Software and Science Data Systems Section (398)
> >
> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >
> >Office: 168-519, Mailstop: 168-527
> >
> >Email: chris.a.mattmann@nasa.gov
> >
> >WWW:
> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >Director, Information Retrieval and Data Science Group (IRDS)
> >
> >Adjunct Associate Professor, Computer Science Department
> >
> >University of Southern California, Los Angeles, CA 90089 USA
> >
> >WWW: http://irds.usc.edu/
> >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >-----Original Message-----
> >
> >From: Madhawa Kasun Gunasekara <ma...@gmail.com>
> >
> >Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
> >
> >Date: Wednesday, March 16, 2016 at 10:51 PM
> >
> >To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
> >
> >Subject: GSOC2016 Sentiment Analysis
> >
> >
> >
> >>Hi
> >
> >>
> >
> >>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
> >
> >>GSOC2016 this time. Since i have been engaging with some similar projects
> >
> >>i
> >
> >>think it will be a great experience for me.
> >
> >>
> >
> >>I am a final year student in IESL College of Engineering, Sri lanka. I
> >
> >>have
> >
> >>learned machine learning and natural language processing stuff when I'm
> >
> >>doing my first degree (Computer Science) in University of Sri
> >
> >>Jayewardhenapura.
> >
> >>
> >
> >>In my internship period, I have actively contributed to a Twitter based
> >
> >>NLP
> >
> >>project. and We have published an article on IEEE Conference, "Real-time
> >
> >>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
> >
> >>
> >
> >>Please let me know what you think and what you suggest.
> >
> >>
> >
> >>Please kindly give me further information on how I could proceed. I
> >
> >>couldn't able to find the mentioned paper "Multi-Class Sentiment Analysis
> >
> >>in Twitter: a Pattern-Based Approach"
> >
> >>[1]
> >
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_jir
> >a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc
> >_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0
> >&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
> ><
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_ji
> >ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfn
> >c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t
> >0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
> >
> >>[2]
> >
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_xp
> >l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0N
> >U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiY
> >McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
> ><
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_x
> >pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0
> >NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNi
> >YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
> >
> >>
> >
> >>Thanks
> >
> >>Madhawa Gunasekara
> >
> >
> >
> >
> >
> >
> >
> >
>
>

Re: GSOC2016 Sentiment Analysis

Posted by Harshavardhan Manjunatha <hm...@usc.edu>.
Dear Prof. Mattmann,

Thanks. But the Fisher Callhome Corpus is a training Corpus for Machine
Translation b/w Spanish & Englosh.

I dont think it can be adapted to Sentiment Analysis.

Developing a generic training model/corpus for Sentiment Analysis that
encapsulates social media, movie reviews, etc, etc will be a Challenging &
Exciting Task !!

Regards,
Harsha

On Fri, Mar 25, 2016 at 2:42 PM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Sounds great Harsha. This is for Google Summer of Code, so collaborating
> would be great, and in this case, we would be working with Madhawa, should
> he choose to accept.
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
> -----Original Message-----
> From: Harshavardhan Manjunatha <hm...@usc.edu>
> Date: Friday, March 25, 2016 at 2:38 PM
> To: jpluser <ch...@jpl.nasa.gov>
> Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Information and
> Data Science Group USC List <ir...@mymaillists.usc.edu>,
> "kamalaku@usc.edu" <ka...@usc.edu>, "dev@tika.apache.org"
> <de...@tika.apache.org>
> Subject: Re: GSOC2016 Sentiment Analysis
>
> >Dear Prof. Mattmann,
> >
> >
> >I would love to collaborate on this & am interested in developing
> >Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
> >
> >
> >I have completed an Applied NLP course @ USC.
> >
> >
> >I have done a Literature Review of Papers & Open Source Tools on the same
> >recently.
> >
> >
> >Regards,
> >Harsha
> >
> >
> >On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
> ><ch...@jpl.nasa.gov> wrote:
> >
> >Hi Madhawa,
> >
> >
> >
> >So, how about a project that develops and contributes an Apache
> >
> >Tika and OpenNLP based SentimentAnalysisParser?
> >
> >
> >
> >I have some students currently doing work using the Fisher Callhome
> >
> >Corpus and you can build off that. I am CC’ing my USC IRDS team
> >
> >and my student Indhu who is working on this.
> >
> >
> >
> >Can you start working on your proposal by:
> >
> >
> >
> >1. Creating a JIRA issue here:
> >
> >
> https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jira
> >_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56
> >W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1ms
> >1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
> >
> > tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
> >
> >
> >
> >2. Develop a proposal on the Tika wiki here:
> >
> >
> https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_G
> >SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8
> >xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoNh
> >rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
> > (you will need permission, first
> >
> >sign up for your account on the wiki then tell me your username so I
> >
> >can add permissions for you)
> >
> >
> >
> >3. Apply through the Google Summer of Code 2016 program.
> >
> >
> >
> >4. Get in touch with me, and Indhu, and keep dev@tika.a.o and
> >
> >dev@openlp.a.o and irds-L@usc.edu in the loop so we can discuss together
> >
> >as a community.
> >
> >
> >
> >Cool?
> >
> >
> >
> >Cheers,
> >
> >Chris
> >
> >
> >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >Chris Mattmann, Ph.D.
> >
> >Chief Architect
> >
> >Instrument Software and Science Data Systems Section (398)
> >
> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >
> >Office: 168-519, Mailstop: 168-527
> >
> >Email: chris.a.mattmann@nasa.gov
> >
> >WWW:
> >http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
> >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >Director, Information Retrieval and Data Science Group (IRDS)
> >
> >Adjunct Associate Professor, Computer Science Department
> >
> >University of Southern California, Los Angeles, CA 90089 USA
> >
> >WWW: http://irds.usc.edu/
> >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >-----Original Message-----
> >
> >From: Madhawa Kasun Gunasekara <ma...@gmail.com>
> >
> >Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
> >
> >Date: Wednesday, March 16, 2016 at 10:51 PM
> >
> >To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
> >
> >Subject: GSOC2016 Sentiment Analysis
> >
> >
> >
> >>Hi
> >
> >>
> >
> >>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
> >
> >>GSOC2016 this time. Since i have been engaging with some similar projects
> >
> >>i
> >
> >>think it will be a great experience for me.
> >
> >>
> >
> >>I am a final year student in IESL College of Engineering, Sri lanka. I
> >
> >>have
> >
> >>learned machine learning and natural language processing stuff when I'm
> >
> >>doing my first degree (Computer Science) in University of Sri
> >
> >>Jayewardhenapura.
> >
> >>
> >
> >>In my internship period, I have actively contributed to a Twitter based
> >
> >>NLP
> >
> >>project. and We have published an article on IEEE Conference, "Real-time
> >
> >>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
> >
> >>
> >
> >>Please let me know what you think and what you suggest.
> >
> >>
> >
> >>Please kindly give me further information on how I could proceed. I
> >
> >>couldn't able to find the mentioned paper "Multi-Class Sentiment Analysis
> >
> >>in Twitter: a Pattern-Based Approach"
> >
> >>[1]
> >
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_jir
> >a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc
> >_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0
> >&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
> ><
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_ji
> >ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfn
> >c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t
> >0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
> >
> >>[2]
> >
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_xp
> >l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0N
> >U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiY
> >McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
> ><
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_x
> >pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0
> >NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNi
> >YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
> >
> >>
> >
> >>Thanks
> >
> >>Madhawa Gunasekara
> >
> >
> >
> >
> >
> >
> >
> >
>
>

Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Sounds great Harsha. This is for Google Summer of Code, so collaborating
would be great, and in this case, we would be working with Madhawa, should
he choose to accept.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Harshavardhan Manjunatha <hm...@usc.edu>
Date: Friday, March 25, 2016 at 2:38 PM
To: jpluser <ch...@jpl.nasa.gov>
Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Information and
Data Science Group USC List <ir...@mymaillists.usc.edu>,
"kamalaku@usc.edu" <ka...@usc.edu>, "dev@tika.apache.org"
<de...@tika.apache.org>
Subject: Re: GSOC2016 Sentiment Analysis

>Dear Prof. Mattmann,
>
>
>I would love to collaborate on this & am interested in developing
>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
>
>
>I have completed an Applied NLP course @ USC.
>
>
>I have done a Literature Review of Papers & Open Source Tools on the same
>recently.
>
>
>Regards,
>Harsha
>
>
>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov> wrote:
>
>Hi Madhawa,
>
>
>
>So, how about a project that develops and contributes an Apache
>
>Tika and OpenNLP based SentimentAnalysisParser?
>
>
>
>I have some students currently doing work using the Fisher Callhome
>
>Corpus and you can build off that. I am CC’ing my USC IRDS team
>
>and my student Indhu who is working on this.
>
>
>
>Can you start working on your proposal by:
>
>
>
>1. Creating a JIRA issue here:
>
>https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jira
>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56
>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1ms
>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
>
> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
>
>
>
>2. Develop a proposal on the Tika wiki here:
>
>https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_G
>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8
>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoNh
>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
> (you will need permission, first
>
>sign up for your account on the wiki then tell me your username so I
>
>can add permissions for you)
>
>
>
>3. Apply through the Google Summer of Code 2016 program.
>
>
>
>4. Get in touch with me, and Indhu, and keep dev@tika.a.o and
>
>dev@openlp.a.o and irds-L@usc.edu in the loop so we can discuss together
>
>as a community.
>
>
>
>Cool?
>
>
>
>Cheers,
>
>Chris
>
>
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>Chris Mattmann, Ph.D.
>
>Chief Architect
>
>Instrument Software and Science Data Systems Section (398)
>
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>
>Office: 168-519, Mailstop: 168-527
>
>Email: chris.a.mattmann@nasa.gov
>
>WWW:  
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>Director, Information Retrieval and Data Science Group (IRDS)
>
>Adjunct Associate Professor, Computer Science Department
>
>University of Southern California, Los Angeles, CA 90089 USA
>
>WWW: http://irds.usc.edu/
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>
>
>
>
>
>
>-----Original Message-----
>
>From: Madhawa Kasun Gunasekara <ma...@gmail.com>
>
>Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
>
>Date: Wednesday, March 16, 2016 at 10:51 PM
>
>To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
>
>Subject: GSOC2016 Sentiment Analysis
>
>
>
>>Hi
>
>>
>
>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>
>>GSOC2016 this time. Since i have been engaging with some similar projects
>
>>i
>
>>think it will be a great experience for me.
>
>>
>
>>I am a final year student in IESL College of Engineering, Sri lanka. I
>
>>have
>
>>learned machine learning and natural language processing stuff when I'm
>
>>doing my first degree (Computer Science) in University of Sri
>
>>Jayewardhenapura.
>
>>
>
>>In my internship period, I have actively contributed to a Twitter based
>
>>NLP
>
>>project. and We have published an article on IEEE Conference, "Real-time
>
>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>
>>
>
>>Please let me know what you think and what you suggest.
>
>>
>
>>Please kindly give me further information on how I could proceed. I
>
>>couldn't able to find the mentioned paper "Multi-Class Sentiment Analysis
>
>>in Twitter: a Pattern-Based Approach"
>
>>[1] 
>https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_jir
>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc
>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0
>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
><https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_ji
>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfn
>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t
>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
>
>>[2] 
>https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_xp
>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0N
>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiY
>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
><https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_x
>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0
>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNi
>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
>
>>
>
>>Thanks
>
>>Madhawa Gunasekara
>
>
>
>
>
>
>
>


Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Sounds great Harsha. This is for Google Summer of Code, so collaborating
would be great, and in this case, we would be working with Madhawa, should
he choose to accept.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Harshavardhan Manjunatha <hm...@usc.edu>
Date: Friday, March 25, 2016 at 2:38 PM
To: jpluser <ch...@jpl.nasa.gov>
Cc: "dev@opennlp.apache.org" <de...@opennlp.apache.org>, Information and
Data Science Group USC List <ir...@mymaillists.usc.edu>,
"kamalaku@usc.edu" <ka...@usc.edu>, "dev@tika.apache.org"
<de...@tika.apache.org>
Subject: Re: GSOC2016 Sentiment Analysis

>Dear Prof. Mattmann,
>
>
>I would love to collaborate on this & am interested in developing
>Sentiment Analysis Tika Parsers leveraging Apache OpenNLP.
>
>
>I have completed an Applied NLP course @ USC.
>
>
>I have done a Literature Review of Papers & Open Source Tools on the same
>recently.
>
>
>Regards,
>Harsha
>
>
>On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980)
><ch...@jpl.nasa.gov> wrote:
>
>Hi Madhawa,
>
>
>
>So, how about a project that develops and contributes an Apache
>
>Tika and OpenNLP based SentimentAnalysisParser?
>
>
>
>I have some students currently doing work using the Fisher Callhome
>
>Corpus and you can build off that. I am CC’ing my USC IRDS team
>
>and my student Indhu who is working on this.
>
>
>
>Can you start working on your proposal by:
>
>
>
>1. Creating a JIRA issue here:
>
>https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jira
>_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56
>W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1ms
>1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
>
> tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
>
>
>
>2. Develop a proposal on the Tika wiki here:
>
>https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_G
>SoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8
>xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoNh
>rlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
> (you will need permission, first
>
>sign up for your account on the wiki then tell me your username so I
>
>can add permissions for you)
>
>
>
>3. Apply through the Google Summer of Code 2016 program.
>
>
>
>4. Get in touch with me, and Indhu, and keep dev@tika.a.o and
>
>dev@openlp.a.o and irds-L@usc.edu in the loop so we can discuss together
>
>as a community.
>
>
>
>Cool?
>
>
>
>Cheers,
>
>Chris
>
>
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>Chris Mattmann, Ph.D.
>
>Chief Architect
>
>Instrument Software and Science Data Systems Section (398)
>
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>
>Office: 168-519, Mailstop: 168-527
>
>Email: chris.a.mattmann@nasa.gov
>
>WWW:  
>http://sunset.usc.edu/~mattmann/ <http://sunset.usc.edu/~mattmann/>
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>Director, Information Retrieval and Data Science Group (IRDS)
>
>Adjunct Associate Professor, Computer Science Department
>
>University of Southern California, Los Angeles, CA 90089 USA
>
>WWW: http://irds.usc.edu/
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>
>
>
>
>
>
>-----Original Message-----
>
>From: Madhawa Kasun Gunasekara <ma...@gmail.com>
>
>Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
>
>Date: Wednesday, March 16, 2016 at 10:51 PM
>
>To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
>
>Subject: GSOC2016 Sentiment Analysis
>
>
>
>>Hi
>
>>
>
>>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>
>>GSOC2016 this time. Since i have been engaging with some similar projects
>
>>i
>
>>think it will be a great experience for me.
>
>>
>
>>I am a final year student in IESL College of Engineering, Sri lanka. I
>
>>have
>
>>learned machine learning and natural language processing stuff when I'm
>
>>doing my first degree (Computer Science) in University of Sri
>
>>Jayewardhenapura.
>
>>
>
>>In my internship period, I have actively contributed to a Twitter based
>
>>NLP
>
>>project. and We have published an article on IEEE Conference, "Real-time
>
>>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>
>>
>
>>Please let me know what you think and what you suggest.
>
>>
>
>>Please kindly give me further information on how I could proceed. I
>
>>couldn't able to find the mentioned paper "Multi-Class Sentiment Analysis
>
>>in Twitter: a Pattern-Based Approach"
>
>>[1] 
>https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_jir
>a_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc
>_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0
>&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
><https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_ji
>ra_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfn
>c_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t
>0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=>
>
>>[2] 
>https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_xp
>l_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0N
>U5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiY
>McQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
><https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_x
>pl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0
>NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNi
>YMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=>
>
>>
>
>>Thanks
>
>>Madhawa Gunasekara
>
>
>
>
>
>
>
>


Re: GSOC2016 Sentiment Analysis

Posted by Harshavardhan Manjunatha <hm...@usc.edu>.
Dear Prof. Mattmann,

I would love to collaborate on this & am interested in developing Sentiment
Analysis Tika Parsers leveraging Apache OpenNLP.

I have completed an Applied NLP course @ USC.

I have done a Literature Review of Papers & Open Source Tools on the same
recently.

Regards,
Harsha

On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Hi Madhawa,
>
>
>
> So, how about a project that develops and contributes an Apache
>
> Tika and OpenNLP based SentimentAnalysisParser?
>
>
>
> I have some students currently doing work using the Fisher Callhome
>
> Corpus and you can build off that. I am CC’ing my USC IRDS team
>
> and my student Indhu who is working on this.
>
>
>
> Can you start working on your proposal by:
>
>
>
> 1. Creating a JIRA issue here:
>
>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jira_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1ms1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
>
>  tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
>
>
>
> 2. Develop a proposal on the Tika wiki here:
>
>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_GSoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoNhrlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
> (you will need permission, first
>
> sign up for your account on the wiki then tell me your username so I
>
> can add permissions for you)
>
>
>
> 3. Apply through the Google Summer of Code 2016 program.
>
>
>
> 4. Get in touch with me, and Indhu, and keep dev@tika.a.o and
>
> dev@openlp.a.o and irds-L@usc.edu in the loop so we can discuss together
>
> as a community.
>
>
>
> Cool?
>
>
>
> Cheers,
>
> Chris
>
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
> Chris Mattmann, Ph.D.
>
> Chief Architect
>
> Instrument Software and Science Data Systems Section (398)
>
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>
> Office: 168-519, Mailstop: 168-527
>
> Email: chris.a.mattmann@nasa.gov
>
> WWW:  http://sunset.usc.edu/~mattmann/
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
> Director, Information Retrieval and Data Science Group (IRDS)
>
> Adjunct Associate Professor, Computer Science Department
>
> University of Southern California, Los Angeles, CA 90089 USA
>
> WWW: http://irds.usc.edu/
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>
>
>
>
>
>
> -----Original Message-----
>
> From: Madhawa Kasun Gunasekara <ma...@gmail.com>
>
> Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
>
> Date: Wednesday, March 16, 2016 at 10:51 PM
>
> To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
>
> Subject: GSOC2016 Sentiment Analysis
>
>
>
> >Hi
>
> >
>
> >I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>
> >GSOC2016 this time. Since i have been engaging with some similar projects
>
> >i
>
> >think it will be a great experience for me.
>
> >
>
> >I am a final year student in IESL College of Engineering, Sri lanka. I
>
> >have
>
> >learned machine learning and natural language processing stuff when I'm
>
> >doing my first degree (Computer Science) in University of Sri
>
> >Jayewardhenapura.
>
> >
>
> >In my internship period, I have actively contributed to a Twitter based
>
> >NLP
>
> >project. and We have published an article on IEEE Conference, "Real-time
>
> >Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>
> >
>
> >Please let me know what you think and what you suggest.
>
> >
>
> >Please kindly give me further information on how I could proceed. I
>
> >couldn't able to find the mentioned paper "Multi-Class Sentiment Analysis
>
> >in Twitter: a Pattern-Based Approach"
>
> >[1]
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_jira_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
>
> >[2]
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_xpl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
>
> >
>
> >Thanks
>
> >Madhawa Gunasekara
>
>
>
>

Re: GSOC2016 Sentiment Analysis

Posted by Harshavardhan Manjunatha <hm...@usc.edu>.
Dear Prof. Mattmann,

I would love to collaborate on this & am interested in developing Sentiment
Analysis Tika Parsers leveraging Apache OpenNLP.

I have completed an Applied NLP course @ USC.

I have done a Literature Review of Papers & Open Source Tools on the same
recently.

Regards,
Harsha

On Fri, Mar 25, 2016 at 2:07 PM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Hi Madhawa,
>
>
>
> So, how about a project that develops and contributes an Apache
>
> Tika and OpenNLP based SentimentAnalysisParser?
>
>
>
> I have some students currently doing work using the Fisher Callhome
>
> Corpus and you can build off that. I am CC’ing my USC IRDS team
>
> and my student Indhu who is working on this.
>
>
>
> Can you start working on your proposal by:
>
>
>
> 1. Creating a JIRA issue here:
>
>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__issues.apache.org_jira_browse_TIKA&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=BPBK1ms1hzt9Tb5RdkU5B7FqRxuyMu3BoROpgd8Tvdw&e=
>
>  tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please
>
>
>
> 2. Develop a proposal on the Tika wiki here:
>
>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.apache.org_tika_GSoC2016&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=GGQdxogPSoNhrlr5mALyeK4Jkn7og7u5K0Mr6qGuQ1s&e=
> (you will need permission, first
>
> sign up for your account on the wiki then tell me your username so I
>
> can add permissions for you)
>
>
>
> 3. Apply through the Google Summer of Code 2016 program.
>
>
>
> 4. Get in touch with me, and Indhu, and keep dev@tika.a.o and
>
> dev@openlp.a.o and irds-L@usc.edu in the loop so we can discuss together
>
> as a community.
>
>
>
> Cool?
>
>
>
> Cheers,
>
> Chris
>
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
> Chris Mattmann, Ph.D.
>
> Chief Architect
>
> Instrument Software and Science Data Systems Section (398)
>
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>
> Office: 168-519, Mailstop: 168-527
>
> Email: chris.a.mattmann@nasa.gov
>
> WWW:  http://sunset.usc.edu/~mattmann/
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
> Director, Information Retrieval and Data Science Group (IRDS)
>
> Adjunct Associate Professor, Computer Science Department
>
> University of Southern California, Los Angeles, CA 90089 USA
>
> WWW: http://irds.usc.edu/
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>
>
>
>
>
>
> -----Original Message-----
>
> From: Madhawa Kasun Gunasekara <ma...@gmail.com>
>
> Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
>
> Date: Wednesday, March 16, 2016 at 10:51 PM
>
> To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
>
> Subject: GSOC2016 Sentiment Analysis
>
>
>
> >Hi
>
> >
>
> >I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>
> >GSOC2016 this time. Since i have been engaging with some similar projects
>
> >i
>
> >think it will be a great experience for me.
>
> >
>
> >I am a final year student in IESL College of Engineering, Sri lanka. I
>
> >have
>
> >learned machine learning and natural language processing stuff when I'm
>
> >doing my first degree (Computer Science) in University of Sri
>
> >Jayewardhenapura.
>
> >
>
> >In my internship period, I have actively contributed to a Twitter based
>
> >NLP
>
> >project. and We have published an article on IEEE Conference, "Real-time
>
> >Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>
> >
>
> >Please let me know what you think and what you suggest.
>
> >
>
> >Please kindly give me further information on how I could proceed. I
>
> >couldn't able to find the mentioned paper "Multi-Class Sentiment Analysis
>
> >in Twitter: a Pattern-Based Approach"
>
> >[1]
> https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_jira_browse_OPENNLP-2D840&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=p9CPiDKtrgF3BYZ8nLSWUXFncDjBBYV2ejUW4wPXtCY&e=
>
> >[2]
> https://urldefense.proofpoint.com/v2/url?u=http-3A__ieeexplore.ieee.org_xpl_articleDetails.jsp-3Farnumber-3D7377667&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8l56W6EU8xpHKOeTqpG03w&m=FEfICxmcDheHndXqky_rLNiYMcQE9yeOn7RoOwpR8t0&s=V6EFcS7WaMwDxGZ5Ttm5-f-UTMLBlmIIBgkJYHB7P1w&e=
>
> >
>
> >Thanks
>
> >Madhawa Gunasekara
>
>
>
>

Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Hi Madhawa,

So, how about a project that develops and contributes an Apache
Tika and OpenNLP based SentimentAnalysisParser?

I have some students currently doing work using the Fisher Callhome
Corpus and you can build off that. I am CC’ing my USC IRDS team
and my student Indhu who is working on this.

Can you start working on your proposal by:

1. Creating a JIRA issue here:
http://issues.apache.org/jira/browse/TIKA
 tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please

2. Develop a proposal on the Tika wiki here:
http://wiki.apache.org/tika/GSoC2016 (you will need permission, first
sign up for your account on the wiki then tell me your username so I
can add permissions for you)

3. Apply through the Google Summer of Code 2016 program.

4. Get in touch with me, and Indhu, and keep dev@tika.a.o and
dev@openlp.a.o and irds-L@usc.edu in the loop so we can discuss together
as a community.

Cool?

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Madhawa Kasun Gunasekara <ma...@gmail.com>
Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
Date: Wednesday, March 16, 2016 at 10:51 PM
To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
Subject: GSOC2016 Sentiment Analysis

>Hi
>
>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>GSOC2016 this time. Since i have been engaging with some similar projects
>i
>think it will be a great experience for me.
>
>I am a final year student in IESL College of Engineering, Sri lanka. I
>have
>learned machine learning and natural language processing stuff when I'm
>doing my first degree (Computer Science) in University of Sri
>Jayewardhenapura.
>
>In my internship period, I have actively contributed to a Twitter based
>NLP
>project. and We have published an article on IEEE Conference, "Real-time
>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>
>Please let me know what you think and what you suggest.
>
>Please kindly give me further information on how I could proceed. I
>couldn't able to find the mentioned paper "Multi-Class Sentiment Analysis
>in Twitter: a Pattern-Based Approach"
>[1] https://issues.apache.org/jira/browse/OPENNLP-840
>[2] http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7377667
>
>Thanks
>Madhawa Gunasekara


Re: GSOC2016 Sentiment Analysis

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Hi Madhawa,

So, how about a project that develops and contributes an Apache
Tika and OpenNLP based SentimentAnalysisParser?

I have some students currently doing work using the Fisher Callhome
Corpus and you can build off that. I am CC’ing my USC IRDS team
and my student Indhu who is working on this.

Can you start working on your proposal by:

1. Creating a JIRA issue here:
http://issues.apache.org/jira/browse/TIKA
 tag it with ‘gsoc2016’, ‘memex’, and ‘irds’ please

2. Develop a proposal on the Tika wiki here:
http://wiki.apache.org/tika/GSoC2016 (you will need permission, first
sign up for your account on the wiki then tell me your username so I
can add permissions for you)

3. Apply through the Google Summer of Code 2016 program.

4. Get in touch with me, and Indhu, and keep dev@tika.a.o and
dev@openlp.a.o and irds-L@usc.edu in the loop so we can discuss together
as a community.

Cool?

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Madhawa Kasun Gunasekara <ma...@gmail.com>
Reply-To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
Date: Wednesday, March 16, 2016 at 10:51 PM
To: "dev@opennlp.apache.org" <de...@opennlp.apache.org>
Subject: GSOC2016 Sentiment Analysis

>Hi
>
>I am interesting on contribute to OPENNLP-840: "Sentiment Analysis" for
>GSOC2016 this time. Since i have been engaging with some similar projects
>i
>think it will be a great experience for me.
>
>I am a final year student in IESL College of Engineering, Sri lanka. I
>have
>learned machine learning and natural language processing stuff when I'm
>doing my first degree (Computer Science) in University of Sri
>Jayewardhenapura.
>
>In my internship period, I have actively contributed to a Twitter based
>NLP
>project. and We have published an article on IEEE Conference, "Real-time
>Natural Language Processing for Crowdsourced Road Traffic Alerts" [2] .
>
>Please let me know what you think and what you suggest.
>
>Please kindly give me further information on how I could proceed. I
>couldn't able to find the mentioned paper "Multi-Class Sentiment Analysis
>in Twitter: a Pattern-Based Approach"
>[1] https://issues.apache.org/jira/browse/OPENNLP-840
>[2] http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7377667
>
>Thanks
>Madhawa Gunasekara