You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "Karl Wright (JIRA)" <ji...@apache.org> on 2019/02/19 01:07:00 UTC
[jira] [Commented] (CONNECTORS-1584) regex documentation
[ https://issues.apache.org/jira/browse/CONNECTORS-1584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16771462#comment-16771462 ]
Karl Wright commented on CONNECTORS-1584:
-----------------------------------------
The mailing list is users@manifoldcf.apache.org.
The regular expressions are standard Java regular expressions. The documentation is widely available. You can also experiment with regular expressions in a java applet online at: https://www.cis.upenn.edu/~matuszek/General/RegexTester/regex-tester.html
> regex documentation
> -------------------
>
> Key: CONNECTORS-1584
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1584
> Project: ManifoldCF
> Issue Type: Improvement
> Components: Web connector
> Affects Versions: ManifoldCF 2.12
> Reporter: Tim Steenbeke
> Priority: Minor
>
> What type of regexs does manifold include and exclude support and also in general regex support?
> At the moment i'm using a web repository connection and an Elastic output connection.
> I'm trying to exclude urls that link to documents.
> e.g. website.com/document/path/this.pdf and website.com/document/path/other.PDF
> The issue i'm having is that the regex that I have found so far doesn't work case insensitive, so for every possible case i have to add a new line.
> e.g.:
> {code:java}
> .*.pdf$ and .*.PDF$ and .*.Pdf and ... .{code}
> Is it possible to add documentation what type of regex is able to be used or maybe a tool to test your regex and see if it is supported by manifold ?
> I tried mailing this question to [user@manifoldcf.apache.org|mailto:user@manifoldcf.apache.org] but this mail adress returns a failure notice.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)