You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@stanbol.apache.org by Fabian Christ <ch...@googlemail.com> on 2012/05/16 10:51:43 UTC

Re: [jira] [Created] (STANBOL-614) Enhancer returns inconsistent results

Hi Nosiert,

2012/5/15 Nosiert Batiste (JIRA) <ji...@apache.org>:
> I will try to work around this problem by simply converting everything to plain text.

Yes that's the best way to solve this for the moment. Apache Stanbol
currently has no (good) support for annotating HTML sources. Maybe you
would like to implement an enhancement engine that converts your HTML
into plain text. This engine could run before the entity extraction
engines come into play.

Best,
 - Fabian

-- 
Fabian
http://twitter.com/fctwitt