You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Vidura Mudalige (JIRA)" <ji...@apache.org> on 2015/03/04 06:36:04 UTC

[jira] [Commented] (OPENNLP-758) Unsupervised WSD techniques

    [ https://issues.apache.org/jira/browse/OPENNLP-758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346426#comment-14346426 ] 

Vidura Mudalige commented on OPENNLP-758:
-----------------------------------------

Hi,
I am Vidura, a third year Computer Science and Engineering undergraduate from University of Moratuwa. I'm very much interested in working with Apache OpenNLP project in GSoC 2015.

I have worked in some open source projects. Also I have used Apache OpenNLP and Apache UIMA for some of my previous projects. Nowadays I am working in a open source project called WSO2 User Engagement Server.[1]
 
I cloned and successfully built the apache/opennlp.git.[2] I would like to know more details about the issue and expected deliverables.

[1].https://github.com/wso2/product-ues/tree/dashboards-2.0
[2].https://github.com/apache/opennlp

> Unsupervised WSD techniques
> ---------------------------
>
>                 Key: OPENNLP-758
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-758
>             Project: OpenNLP
>          Issue Type: New Feature
>          Components: POS Tagger, Sentence Detector, Stemmer
>            Reporter: Mondher Bouazizi
>              Labels: gsoc, gsoc2015, java, nlp, wsd
>
> The objective of Word Sense Disambiguation (WSD) is to determine which sense of a word is meant in a particular context. Therefore, WSD is a classification task, where the classes are the different senses of the ambiguous word.
> Different techniques are proposed in the academic literature, which fall mainly into two categories: Supervised and Unsupervised.
> For this component, we focus on unsupervised techniques: these methods are based on unlabeled data, and do not exploit any manually tagged data.
> The object of this project is to create a WSD solution (for English) that implements some unsupervised techniques. For example:
> - Context Clustering
> - Word Clustering
> - Cooccurrence Graphs
> - Overlap of Sense Definitions
> - Selectional Preferences
> - Structural Approaches
> - Etc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)