You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Mondher Bouazizi (JIRA)" <ji...@apache.org> on 2015/02/12 10:58:12 UTC

[jira] [Created] (OPENNLP-758) Unsupervised WSD techniques

Mondher Bouazizi created OPENNLP-758:
----------------------------------------

             Summary: Unsupervised WSD techniques
                 Key: OPENNLP-758
                 URL: https://issues.apache.org/jira/browse/OPENNLP-758
             Project: OpenNLP
          Issue Type: New Feature
          Components: POS Tagger, Sentence Detector, Stemmer
            Reporter: Mondher Bouazizi


The objective of Word Sense Disambiguation (WSD) is to determine which sense of a word is meant in a particular context. Therefore, WSD is a classification task, where the classes are the different senses of the ambiguous word.
Different techniques are proposed in the academic literature, which fall mainly into two categories: Supervised and Unsupervised.
For this component, we focus on unsupervised techniques: these methods are based on unlabeled data, and do not exploit any manually tagged data.
The object of this project is to create a WSD solution (for English) that implements some unsupervised techniques. For example:
       Context Clustering
       Word Clustering
       Cooccurrence Graphs
       Overlap of Sense Definitions
       Selectional Preferences
       Structural Approaches
       Etc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)