You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@commons.apache.org by "Leo-Andreas Ervik (JIRA)" <ji...@apache.org> on 2019/01/29 15:13:00 UTC
[jira] [Updated] (VALIDATOR-453) UrlValidator.isValidPath does not
support unicode characters
[ https://issues.apache.org/jira/browse/VALIDATOR-453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Leo-Andreas Ervik updated VALIDATOR-453:
----------------------------------------
Description:
LinkedIn allows unicode characters such as æøå in their urls, which fails due to the PATH_REGEX using w instead of pL
Old:
{code:java}
private static final String PATH_REGEX = "^(/[-\\w:@&?=+,.!/~*'%$_;\\(\\)]*)?$";
{code}
New:
{code:java}
private static final String PATH_REGEX = "^(/[-\\pL:@&?=+,.!/~*'%$_;\\(\\)]*)?$";
{code}
was:
LinkedIn allows unicode characters such as æøå in their urls, which fails due to the PATH_REGEX using w instead of pL
{code:java}
private static final String PATH_REGEX = "^(/[-\\w:@&?=+,.!/~*'%$_;\\(\\)]*)?$";
{code}
> UrlValidator.isValidPath does not support unicode characters
> ------------------------------------------------------------
>
> Key: VALIDATOR-453
> URL: https://issues.apache.org/jira/browse/VALIDATOR-453
> Project: Commons Validator
> Issue Type: Bug
> Components: Routines
> Affects Versions: 1.6
> Reporter: Leo-Andreas Ervik
> Priority: Major
> Labels: Unicode, UrlValidator
>
> LinkedIn allows unicode characters such as æøå in their urls, which fails due to the PATH_REGEX using w instead of pL
> Old:
> {code:java}
> private static final String PATH_REGEX = "^(/[-\\w:@&?=+,.!/~*'%$_;\\(\\)]*)?$";
> {code}
> New:
> {code:java}
> private static final String PATH_REGEX = "^(/[-\\pL:@&?=+,.!/~*'%$_;\\(\\)]*)?$";
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)