You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by "Marshall Schor (JIRA)" <de...@uima.apache.org> on 2011/03/24 16:26:05 UTC

[jira] [Reopened] (UIMA-2097) Treatment of URLs with blanks is incorrect in some places

     [ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor reopened UIMA-2097:
----------------------------------


> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
>                 Key: UIMA-2097
>                 URL: https://issues.apache.org/jira/browse/UIMA-2097
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.3.1SDK
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3.2SDK
>
>
> User reported Document Analyzer failing to write correct names in the output directory, when the input directory had a blank in it (on Windows).  Traced this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira