You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Anthony Beylerian (JIRA)" <ji...@apache.org> on 2015/06/27 15:25:04 UTC
[jira] [Comment Edited] (OPENNLP-758) Unsupervised WSD techniques
[ https://issues.apache.org/jira/browse/OPENNLP-758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14601507#comment-14601507 ]
Anthony Beylerian edited comment on OPENNLP-758 at 6/27/15 1:24 PM:
--------------------------------------------------------------------
Thank you very much for the help ! I will write the docs as required.
Also, I will modify the pre-processing step accordingly, and maybe include that in the test section as an example.
was (Author: beylerian):
Thank you very much for the help ! I will write the docs as required.
Also, I will modify the pre-processing step accordingly, and maybe include that in the test section as an example.
We will submit an ICLA asap.
> Unsupervised WSD techniques
> ---------------------------
>
> Key: OPENNLP-758
> URL: https://issues.apache.org/jira/browse/OPENNLP-758
> Project: OpenNLP
> Issue Type: New Feature
> Components: POS Tagger, Sentence Detector, Stemmer
> Reporter: Mondher Bouazizi
> Labels: gsoc, gsoc2015, java, nlp, wsd
> Attachments: lesk_parameters.patch, opennlp-tools-disambiguator.patch
>
>
> The objective of Word Sense Disambiguation (WSD) is to determine which sense of a word is meant in a particular context. Therefore, WSD is a classification task, where the classes are the different senses of the ambiguous word.
> Different techniques are proposed in the academic literature, which fall mainly into two categories: Supervised and Unsupervised.
> For this component, we focus on unsupervised techniques: these methods are based on unlabeled data, and do not exploit any manually tagged data.
> The object of this project is to create a WSD solution (for English) that implements some unsupervised techniques. For example:
> - Context Clustering
> - Word Clustering
> - Cooccurrence Graphs
> - Overlap of Sense Definitions
> - Selectional Preferences
> - Structural Approaches
> - Etc.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)