You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "raghavendra prabhu (JIRA)" <ji...@apache.org> on 2005/11/25 13:56:55 UTC

[jira] Created: (NUTCH-129) rtf-parser does not work when opened with wordpad files and saved

rtf-parser does not work when opened with wordpad files and saved
-----------------------------------------------------------------

         Key: NUTCH-129
         URL: http://issues.apache.org/jira/browse/NUTCH-129
     Project: Nutch
        Type: Bug
  Components: indexer  
 Environment: A sample rtf file modified under windows wordpad and then indexed by nutch
windows
    Reporter: raghavendra prabhu


The above thing failed as wordpad seems to rewrite control information 

Cant we use RTFEdit kit to do the parser and it will be not a LGPL issue also

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Commented: (NUTCH-129) rtf-parser does not work when opened with wordpad files and saved

Posted by "Andy Hedges (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/NUTCH-129?page=comments#action_12417694 ] 

Andy Hedges commented on NUTCH-129:
-----------------------------------

RTFEditKit requires a running X server on Solaris and Linux with Sun's VM. I originally wrote it using this library but it was rejected for this reason and so I rewrote it with the current implementation. If you upload a windows RTF with this problem I'll have a look into it.

> rtf-parser does not work when opened with wordpad files and saved
> -----------------------------------------------------------------
>
>          Key: NUTCH-129
>          URL: http://issues.apache.org/jira/browse/NUTCH-129
>      Project: Nutch
>         Type: Bug

>   Components: indexer
>  Environment: A sample rtf file modified under windows wordpad and then indexed by nutch
> windows
>     Reporter: raghavendra prabhu

>
> The above thing failed as wordpad seems to rewrite control information 
> Cant we use RTFEdit kit to do the parser and it will be not a LGPL issue also

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira