You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by "Marshall Schor (JIRA)" <de...@uima.apache.org> on 2011/03/22 20:26:06 UTC
[jira] [Created] (UIMA-2097) Treatment of URLs with blanks is
incorrect in some places
Treatment of URLs with blanks is incorrect in some places
---------------------------------------------------------
Key: UIMA-2097
URL: https://issues.apache.org/jira/browse/UIMA-2097
Project: UIMA
Issue Type: Bug
Components: Core Java Framework
Affects Versions: 2.3.1SDK
Reporter: Marshall Schor
Assignee: Marshall Schor
Priority: Minor
Fix For: 2.3.2SDK
User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows). Traced this to failing URL handling.
Proper URL handling seems to need to observe these principles:
1) URL may have "blanks" and other "invalid-in-URI chars"
2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
Creating files from these: need to use File(a-uri-form).
Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
Fix the patches in UIMA-1879 and UIMA-1748.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Reopened] (UIMA-2097) Treatment of URLs with blanks is
incorrect in some places
Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
[ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marshall Schor reopened UIMA-2097:
----------------------------------
> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
> Key: UIMA-2097
> URL: https://issues.apache.org/jira/browse/UIMA-2097
> Project: UIMA
> Issue Type: Bug
> Components: Core Java Framework
> Affects Versions: 2.3.1SDK
> Reporter: Marshall Schor
> Assignee: Marshall Schor
> Priority: Minor
> Fix For: 2.3.2SDK
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows). Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (UIMA-2097) Treatment of URLs with blanks is
incorrect in some places
Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
[ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marshall Schor updated UIMA-2097:
---------------------------------
Fix Version/s: 2.3.2AS
2.3.1Addons
> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
> Key: UIMA-2097
> URL: https://issues.apache.org/jira/browse/UIMA-2097
> Project: UIMA
> Issue Type: Bug
> Components: Core Java Framework
> Affects Versions: 2.3.1SDK
> Reporter: Marshall Schor
> Assignee: Marshall Schor
> Priority: Minor
> Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows). Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Reopened] (UIMA-2097) Treatment of URLs with blanks is
incorrect in some places
Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
[ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marshall Schor reopened UIMA-2097:
----------------------------------
fix more instances of URI.create()
> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
> Key: UIMA-2097
> URL: https://issues.apache.org/jira/browse/UIMA-2097
> Project: UIMA
> Issue Type: Bug
> Components: Core Java Framework
> Affects Versions: 2.3.1SDK
> Reporter: Marshall Schor
> Assignee: Marshall Schor
> Priority: Minor
> Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows). Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Closed] (UIMA-2097) Treatment of URLs with blanks is
incorrect in some places
Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
[ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marshall Schor closed UIMA-2097.
--------------------------------
> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
> Key: UIMA-2097
> URL: https://issues.apache.org/jira/browse/UIMA-2097
> Project: UIMA
> Issue Type: Bug
> Components: Core Java Framework
> Affects Versions: 2.3.1SDK
> Reporter: Marshall Schor
> Assignee: Marshall Schor
> Priority: Minor
> Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows). Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Closed] (UIMA-2097) Treatment of URLs with blanks is
incorrect in some places
Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
[ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marshall Schor closed UIMA-2097.
--------------------------------
Resolution: Fixed
After 2.3.2SDK is released, the copies of the URI quote method can be removed, and the common one in the base SDK reused. But for now, it's duplicated.
> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
> Key: UIMA-2097
> URL: https://issues.apache.org/jira/browse/UIMA-2097
> Project: UIMA
> Issue Type: Bug
> Components: Core Java Framework
> Affects Versions: 2.3.1SDK
> Reporter: Marshall Schor
> Assignee: Marshall Schor
> Priority: Minor
> Fix For: 2.3.2SDK
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows). Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Reopened] (UIMA-2097) Treatment of URLs with blanks is
incorrect in some places
Posted by "Tommaso Teofili (JIRA)" <de...@uima.apache.org>.
[ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tommaso Teofili reopened UIMA-2097:
-----------------------------------
fix blank file:// URLs in Solrcas
> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
> Key: UIMA-2097
> URL: https://issues.apache.org/jira/browse/UIMA-2097
> Project: UIMA
> Issue Type: Bug
> Components: Core Java Framework
> Affects Versions: 2.3.1SDK
> Reporter: Marshall Schor
> Assignee: Marshall Schor
> Priority: Minor
> Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows). Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (UIMA-2097) Treatment of URLs with blanks is
incorrect in some places
Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
[ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marshall Schor resolved UIMA-2097.
----------------------------------
Resolution: Fixed
> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
> Key: UIMA-2097
> URL: https://issues.apache.org/jira/browse/UIMA-2097
> Project: UIMA
> Issue Type: Bug
> Components: Core Java Framework
> Affects Versions: 2.3.1SDK
> Reporter: Marshall Schor
> Assignee: Marshall Schor
> Priority: Minor
> Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows). Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Closed] (UIMA-2097) Treatment of URLs with blanks is
incorrect in some places
Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
[ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marshall Schor closed UIMA-2097.
--------------------------------
Resolution: Fixed
> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
> Key: UIMA-2097
> URL: https://issues.apache.org/jira/browse/UIMA-2097
> Project: UIMA
> Issue Type: Bug
> Components: Core Java Framework
> Affects Versions: 2.3.1SDK
> Reporter: Marshall Schor
> Assignee: Marshall Schor
> Priority: Minor
> Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows). Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (UIMA-2097) Treatment of URLs with blanks is
incorrect in some places
Posted by "Tommaso Teofili (JIRA)" <de...@uima.apache.org>.
[ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tommaso Teofili resolved UIMA-2097.
-----------------------------------
Resolution: Fixed
this is resolved for Solrcas
> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
> Key: UIMA-2097
> URL: https://issues.apache.org/jira/browse/UIMA-2097
> Project: UIMA
> Issue Type: Bug
> Components: Core Java Framework
> Affects Versions: 2.3.1SDK
> Reporter: Marshall Schor
> Assignee: Marshall Schor
> Priority: Minor
> Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows). Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira