You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/27 05:18:01 UTC

[jira] Created: (NUTCH-705) parse-rtf plugin

parse-rtf plugin
----------------

                 Key: NUTCH-705
                 URL: https://issues.apache.org/jira/browse/NUTCH-705
             Project: Nutch
          Issue Type: New Feature
          Components: fetcher
    Affects Versions: 1.0.0
            Reporter: Dmitry Lihachev
             Fix For: 1.0.0




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (NUTCH-705) parse-rtf plugin

Posted by "Sami Siren (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12680411#action_12680411 ] 

Sami Siren commented on NUTCH-705:
----------------------------------

I think we should start looking at Apache Tika for most (or all) of our parsers.

> parse-rtf plugin
> ----------------
>
>                 Key: NUTCH-705
>                 URL: https://issues.apache.org/jira/browse/NUTCH-705
>             Project: Nutch
>          Issue Type: New Feature
>          Components: fetcher
>    Affects Versions: 1.0.0
>            Reporter: Dmitry Lihachev
>            Priority: Minor
>             Fix For: 1.1
>
>         Attachments: NUTCH-705.patch
>
>
> Demoting this issue and moving to 1.1 - current patch is not suitable due to LGPL licensed parts.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (NUTCH-705) parse-rtf plugin

Posted by "Julien Nioche (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Julien Nioche resolved NUTCH-705.
---------------------------------

    Resolution: Fixed

RTF parsing is now handled by the TikaPlugin (NUTCH-766). Please open an issue on Tika if  the original problem with non-ascii chars still occurs

> parse-rtf plugin
> ----------------
>
>                 Key: NUTCH-705
>                 URL: https://issues.apache.org/jira/browse/NUTCH-705
>             Project: Nutch
>          Issue Type: New Feature
>          Components: fetcher
>    Affects Versions: 1.0.0
>            Reporter: Dmitry Lihachev
>            Priority: Minor
>             Fix For: 1.1
>
>         Attachments: NUTCH-705.patch
>
>
> Demoting this issue and moving to 1.1 - current patch is not suitable due to LGPL licensed parts.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (NUTCH-705) parse-rtf plugin

Posted by "Sami Siren (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677508#action_12677508 ] 

Sami Siren commented on NUTCH-705:
----------------------------------

I think that the patch contains some lgpl code that we cannot commit into apache repository.

> parse-rtf plugin
> ----------------
>
>                 Key: NUTCH-705
>                 URL: https://issues.apache.org/jira/browse/NUTCH-705
>             Project: Nutch
>          Issue Type: New Feature
>          Components: fetcher
>    Affects Versions: 1.0.0
>            Reporter: Dmitry Lihachev
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-705.patch
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (NUTCH-705) parse-rtf plugin

Posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dmitry Lihachev updated NUTCH-705:
----------------------------------

    Attachment: NUTCH-705.patch

> parse-rtf plugin
> ----------------
>
>                 Key: NUTCH-705
>                 URL: https://issues.apache.org/jira/browse/NUTCH-705
>             Project: Nutch
>          Issue Type: New Feature
>          Components: fetcher
>    Affects Versions: 1.0.0
>            Reporter: Dmitry Lihachev
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-705.patch
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (NUTCH-705) parse-rtf plugin

Posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677242#action_12677242 ] 

Dmitry Lihachev commented on NUTCH-705:
---------------------------------------

This parser correctly handles non ascii input

> parse-rtf plugin
> ----------------
>
>                 Key: NUTCH-705
>                 URL: https://issues.apache.org/jira/browse/NUTCH-705
>             Project: Nutch
>          Issue Type: New Feature
>          Components: fetcher
>    Affects Versions: 1.0.0
>            Reporter: Dmitry Lihachev
>             Fix For: 1.0.0
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (NUTCH-705) parse-rtf plugin

Posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andrzej Bialecki  updated NUTCH-705:
------------------------------------

      Description: Demoting this issue and moving to 1.1 - current patch is not suitable due to LGPL licensed parts.
         Priority: Minor  (was: Major)
    Fix Version/s:     (was: 1.0.0)
                   1.1

> parse-rtf plugin
> ----------------
>
>                 Key: NUTCH-705
>                 URL: https://issues.apache.org/jira/browse/NUTCH-705
>             Project: Nutch
>          Issue Type: New Feature
>          Components: fetcher
>    Affects Versions: 1.0.0
>            Reporter: Dmitry Lihachev
>            Priority: Minor
>             Fix For: 1.1
>
>         Attachments: NUTCH-705.patch
>
>
> Demoting this issue and moving to 1.1 - current patch is not suitable due to LGPL licensed parts.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (NUTCH-705) parse-rtf plugin

Posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677878#action_12677878 ] 

Dmitry Lihachev commented on NUTCH-705:
---------------------------------------

Yes, it looks a bit like a problem... How can we handle this?

> parse-rtf plugin
> ----------------
>
>                 Key: NUTCH-705
>                 URL: https://issues.apache.org/jira/browse/NUTCH-705
>             Project: Nutch
>          Issue Type: New Feature
>          Components: fetcher
>    Affects Versions: 1.0.0
>            Reporter: Dmitry Lihachev
>            Priority: Minor
>             Fix For: 1.1
>
>         Attachments: NUTCH-705.patch
>
>
> Demoting this issue and moving to 1.1 - current patch is not suitable due to LGPL licensed parts.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.