You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/27 05:18:01 UTC
[jira] Created: (NUTCH-705) parse-rtf plugin
parse-rtf plugin
----------------
Key: NUTCH-705
URL: https://issues.apache.org/jira/browse/NUTCH-705
Project: Nutch
Issue Type: New Feature
Components: fetcher
Affects Versions: 1.0.0
Reporter: Dmitry Lihachev
Fix For: 1.0.0
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-705) parse-rtf plugin
Posted by "Sami Siren (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12680411#action_12680411 ]
Sami Siren commented on NUTCH-705:
----------------------------------
I think we should start looking at Apache Tika for most (or all) of our parsers.
> parse-rtf plugin
> ----------------
>
> Key: NUTCH-705
> URL: https://issues.apache.org/jira/browse/NUTCH-705
> Project: Nutch
> Issue Type: New Feature
> Components: fetcher
> Affects Versions: 1.0.0
> Reporter: Dmitry Lihachev
> Priority: Minor
> Fix For: 1.1
>
> Attachments: NUTCH-705.patch
>
>
> Demoting this issue and moving to 1.1 - current patch is not suitable due to LGPL licensed parts.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Resolved: (NUTCH-705) parse-rtf plugin
Posted by "Julien Nioche (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Julien Nioche resolved NUTCH-705.
---------------------------------
Resolution: Fixed
RTF parsing is now handled by the TikaPlugin (NUTCH-766). Please open an issue on Tika if the original problem with non-ascii chars still occurs
> parse-rtf plugin
> ----------------
>
> Key: NUTCH-705
> URL: https://issues.apache.org/jira/browse/NUTCH-705
> Project: Nutch
> Issue Type: New Feature
> Components: fetcher
> Affects Versions: 1.0.0
> Reporter: Dmitry Lihachev
> Priority: Minor
> Fix For: 1.1
>
> Attachments: NUTCH-705.patch
>
>
> Demoting this issue and moving to 1.1 - current patch is not suitable due to LGPL licensed parts.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-705) parse-rtf plugin
Posted by "Sami Siren (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677508#action_12677508 ]
Sami Siren commented on NUTCH-705:
----------------------------------
I think that the patch contains some lgpl code that we cannot commit into apache repository.
> parse-rtf plugin
> ----------------
>
> Key: NUTCH-705
> URL: https://issues.apache.org/jira/browse/NUTCH-705
> Project: Nutch
> Issue Type: New Feature
> Components: fetcher
> Affects Versions: 1.0.0
> Reporter: Dmitry Lihachev
> Fix For: 1.0.0
>
> Attachments: NUTCH-705.patch
>
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (NUTCH-705) parse-rtf plugin
Posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dmitry Lihachev updated NUTCH-705:
----------------------------------
Attachment: NUTCH-705.patch
> parse-rtf plugin
> ----------------
>
> Key: NUTCH-705
> URL: https://issues.apache.org/jira/browse/NUTCH-705
> Project: Nutch
> Issue Type: New Feature
> Components: fetcher
> Affects Versions: 1.0.0
> Reporter: Dmitry Lihachev
> Fix For: 1.0.0
>
> Attachments: NUTCH-705.patch
>
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-705) parse-rtf plugin
Posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677242#action_12677242 ]
Dmitry Lihachev commented on NUTCH-705:
---------------------------------------
This parser correctly handles non ascii input
> parse-rtf plugin
> ----------------
>
> Key: NUTCH-705
> URL: https://issues.apache.org/jira/browse/NUTCH-705
> Project: Nutch
> Issue Type: New Feature
> Components: fetcher
> Affects Versions: 1.0.0
> Reporter: Dmitry Lihachev
> Fix For: 1.0.0
>
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (NUTCH-705) parse-rtf plugin
Posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andrzej Bialecki updated NUTCH-705:
------------------------------------
Description: Demoting this issue and moving to 1.1 - current patch is not suitable due to LGPL licensed parts.
Priority: Minor (was: Major)
Fix Version/s: (was: 1.0.0)
1.1
> parse-rtf plugin
> ----------------
>
> Key: NUTCH-705
> URL: https://issues.apache.org/jira/browse/NUTCH-705
> Project: Nutch
> Issue Type: New Feature
> Components: fetcher
> Affects Versions: 1.0.0
> Reporter: Dmitry Lihachev
> Priority: Minor
> Fix For: 1.1
>
> Attachments: NUTCH-705.patch
>
>
> Demoting this issue and moving to 1.1 - current patch is not suitable due to LGPL licensed parts.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-705) parse-rtf plugin
Posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677878#action_12677878 ]
Dmitry Lihachev commented on NUTCH-705:
---------------------------------------
Yes, it looks a bit like a problem... How can we handle this?
> parse-rtf plugin
> ----------------
>
> Key: NUTCH-705
> URL: https://issues.apache.org/jira/browse/NUTCH-705
> Project: Nutch
> Issue Type: New Feature
> Components: fetcher
> Affects Versions: 1.0.0
> Reporter: Dmitry Lihachev
> Priority: Minor
> Fix For: 1.1
>
> Attachments: NUTCH-705.patch
>
>
> Demoting this issue and moving to 1.1 - current patch is not suitable due to LGPL licensed parts.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.