You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by "Marshall Schor (JIRA)" <de...@uima.apache.org> on 2011/03/22 20:26:06 UTC

[jira] [Created] (UIMA-2097) Treatment of URLs with blanks is incorrect in some places

Treatment of URLs with blanks is incorrect in some places
---------------------------------------------------------

                 Key: UIMA-2097
                 URL: https://issues.apache.org/jira/browse/UIMA-2097
             Project: UIMA
          Issue Type: Bug
          Components: Core Java Framework
    Affects Versions: 2.3.1SDK
            Reporter: Marshall Schor
            Assignee: Marshall Schor
            Priority: Minor
             Fix For: 2.3.2SDK


User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows).  Traced this to failing URL handling.

Proper URL handling seems to need to observe these principles:

1) URL may have "blanks" and other "invalid-in-URI chars"
2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters

Creating files from these: need to use File(a-uri-form).

Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).

To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).

Fix the patches in UIMA-1879 and UIMA-1748.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Reopened] (UIMA-2097) Treatment of URLs with blanks is incorrect in some places

Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor reopened UIMA-2097:
----------------------------------


> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
>                 Key: UIMA-2097
>                 URL: https://issues.apache.org/jira/browse/UIMA-2097
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.3.1SDK
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3.2SDK
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows).  Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (UIMA-2097) Treatment of URLs with blanks is incorrect in some places

Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor updated UIMA-2097:
---------------------------------

    Fix Version/s: 2.3.2AS
                   2.3.1Addons

> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
>                 Key: UIMA-2097
>                 URL: https://issues.apache.org/jira/browse/UIMA-2097
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.3.1SDK
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows).  Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Reopened] (UIMA-2097) Treatment of URLs with blanks is incorrect in some places

Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor reopened UIMA-2097:
----------------------------------


fix more instances of URI.create()

> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
>                 Key: UIMA-2097
>                 URL: https://issues.apache.org/jira/browse/UIMA-2097
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.3.1SDK
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows).  Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Closed] (UIMA-2097) Treatment of URLs with blanks is incorrect in some places

Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor closed UIMA-2097.
--------------------------------


> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
>                 Key: UIMA-2097
>                 URL: https://issues.apache.org/jira/browse/UIMA-2097
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.3.1SDK
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows).  Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Closed] (UIMA-2097) Treatment of URLs with blanks is incorrect in some places

Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor closed UIMA-2097.
--------------------------------

    Resolution: Fixed

After 2.3.2SDK is released, the copies of the URI quote method can be removed, and the common one in the base SDK reused.  But for now, it's duplicated.

> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
>                 Key: UIMA-2097
>                 URL: https://issues.apache.org/jira/browse/UIMA-2097
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.3.1SDK
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3.2SDK
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows).  Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Reopened] (UIMA-2097) Treatment of URLs with blanks is incorrect in some places

Posted by "Tommaso Teofili (JIRA)" <de...@uima.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tommaso Teofili reopened UIMA-2097:
-----------------------------------


fix blank file:// URLs in Solrcas

> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
>                 Key: UIMA-2097
>                 URL: https://issues.apache.org/jira/browse/UIMA-2097
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.3.1SDK
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows).  Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (UIMA-2097) Treatment of URLs with blanks is incorrect in some places

Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor resolved UIMA-2097.
----------------------------------

    Resolution: Fixed

> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
>                 Key: UIMA-2097
>                 URL: https://issues.apache.org/jira/browse/UIMA-2097
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.3.1SDK
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows).  Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Closed] (UIMA-2097) Treatment of URLs with blanks is incorrect in some places

Posted by "Marshall Schor (JIRA)" <de...@uima.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor closed UIMA-2097.
--------------------------------

    Resolution: Fixed

> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
>                 Key: UIMA-2097
>                 URL: https://issues.apache.org/jira/browse/UIMA-2097
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.3.1SDK
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows).  Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (UIMA-2097) Treatment of URLs with blanks is incorrect in some places

Posted by "Tommaso Teofili (JIRA)" <de...@uima.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tommaso Teofili resolved UIMA-2097.
-----------------------------------

    Resolution: Fixed

this is resolved for Solrcas

> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
>                 Key: UIMA-2097
>                 URL: https://issues.apache.org/jira/browse/UIMA-2097
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.3.1SDK
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3.1Addons, 2.3.2SDK, 2.3.2AS
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows).  Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira