You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Mark Harwood (Jira)" <ji...@apache.org> on 2020/05/19 17:41:00 UTC

[jira] [Resolved] (LUCENE-9371) Make RegExp internal state more visible to support more rendering formats

     [ https://issues.apache.org/jira/browse/LUCENE-9371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Mark Harwood resolved LUCENE-9371.
----------------------------------
    Resolution: Fixed

Fixed in master and backported to branch_8x in https://github.com/apache/lucene-solr/commit/9d6c738ffce0c3164691f161ba8b92a615b1e062

> Make RegExp internal state more visible to support more rendering formats
> -------------------------------------------------------------------------
>
>                 Key: LUCENE-9371
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9371
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/search
>            Reporter: Mark Harwood
>            Assignee: Mark Harwood
>            Priority: Minor
>
> This is a proposal to open up read-only access to the internal state of RegExp objects.
> The RegExp parser provides a useful parsed object model for regular expressions. Today it offers three rendering functions:
>  1) To Automaton (for query execution)
>  2) To string (for machine-readable regular expressions)
>  3) To StringTree (for debug purposes)
> There are at least 2 other rendering functions that would be useful:
>  a) To "Explain" format (like the plain-English descriptions used in [regex debugging tools|https://regex101.com/r/2DUzac/1])
>  b) To Query (queries used to accelerate regex searches by providing an approximation of the search terms and [hitting an ngram index|https://github.com/wikimedia/search-extra/blob/master/docs/source_regex.md])
> To support these and other renderings/transformations it would be useful to open read-only access to the fields held in RegExp objects - either through making them public finals or offering getter access methods. This would free the RegExp class from having to support all possible transformations.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org