You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@jmeter.apache.org by "PAULO LOURENCO (RE-ESI-Workflow)" <pa...@esi.pt> on 2008/12/18 15:51:28 UTC

regular expression extractor

Greetings,

 

I know this is going to look like a basic question, which I apologize
for in advance, but I'm having problems with a simple regular
expression.

Basically I want to extract a number that comes after a word and that is
followed by a space and something else. Ex:

Word 123 - more stuff

>From the previous sentence, what I want o extract is the 123, regardless
of what comes after ('-', '.', etc) knowing that the number comes always
after a word.

 

Thanks in advance!

Re: regular expression extractor

Posted by Serpent_Guard <bl...@gmail.com>.

Whoops, that's '[\w^\d]+'.
-- 
View this message in context: http://www.nabble.com/regular-expression-extractor-tp21074258p21086088.html
Sent from the JMeter - User mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: jmeter-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: jmeter-user-help@jakarta.apache.org

Re: regular expression extractor

Posted by sebb <se...@gmail.com>.

On 22/12/2008, PAULO LOURENCO (RE-ESI-Workflow) <pa...@esi.pt> wrote:
> Greetings!
>
>  Let me just say that I added the \s to the end of your proposed solution:
>
> [a-zA-Z]{min,max}(\d+)
>
> To look lik:
>  [a-zA-Z]{min,max}(\d+)\s
>
>  To ensure the number is followed by a white space and everything is looking good!!!

If you refer back to my posting, you'll see that:

'[a-zA-Z]{min,max}(\d+) '

already has a space after the number...

Your original posting said the word was followed by a space, rather
than white space (which includes tab etc), otherwise I would have used
\s.

>  Thanks!
>
>
>  -----Original Message-----
>  From: sebb [mailto:sebbaz@gmail.com]
>  Sent: sábado, 20 de Dezembro de 2008 1:47
>  To: JMeter Users List
>  Subject: Re: regular expression extractor
>
>  On 19/12/2008, Serpent_Guard <bl...@gmail.com> wrote:
>  >
>  >  Whoops, that's '[\w^\d]+'.
>
>  [^\d] means all but 0-9 but [\w^\d] means \w, ^, or \d so it will
>  match more than is required.
>  The ^ only negates a class if present at the start.
>
>  This can be seen by using the demo at:
>
>  http://jakarta.apache.org/oro/demo.html
>
>  For example, try the input 'abcd12^3xyz' - '[\w^\d]+' matches in its entirety.
>
>  The original poster wrote:
>
>  > Basically I want to extract a number that comes after a word and that is
>  > followed by a space and something else. Ex:
>  >
>  > Word 123 - more stuff
>
>  If by a word you mean only alphabetic characters, then
>
>  '[a-zA-Z](\d+) '
>
>  (without the ')
>
>  should extract the digits following the last letter of a "word".
>
>  Or, if you know the min and max lengths of a word, you could use:
>
>  '[a-zA-Z]{min,max}(\d+) '
>
>
>  >  --
>  >
>  > View this message in context: http://www.nabble.com/regular-expression-extractor-tp21074258p21086104.html
>  >
>  > Sent from the JMeter - User mailing list archive at Nabble.com.
>  >
>  >
>  >  ---------------------------------------------------------------------
>  >  To unsubscribe, e-mail: jmeter-user-unsubscribe@jakarta.apache.org
>  >  For additional commands, e-mail: jmeter-user-help@jakarta.apache.org
>  >
>  >
>
>  ---------------------------------------------------------------------
>  To unsubscribe, e-mail: jmeter-user-unsubscribe@jakarta.apache.org
>  For additional commands, e-mail: jmeter-user-help@jakarta.apache.org
>
>
>  ---------------------------------------------------------------------
>  To unsubscribe, e-mail: jmeter-user-unsubscribe@jakarta.apache.org
>  For additional commands, e-mail: jmeter-user-help@jakarta.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: jmeter-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: jmeter-user-help@jakarta.apache.org

RE: regular expression extractor

Posted by "PAULO LOURENCO (RE-ESI-Workflow)" <pa...@esi.pt>.

Greetings!

Let me just say that I added the \s to the end of your proposed solution:
[a-zA-Z]{min,max}(\d+) 
To look lik:
[a-zA-Z]{min,max}(\d+)\s

To ensure the number is followed by a white space and everything is looking good!!!

Thanks!

-----Original Message-----
From: sebb [mailto:sebbaz@gmail.com] 
Sent: sábado, 20 de Dezembro de 2008 1:47
To: JMeter Users List
Subject: Re: regular expression extractor

On 19/12/2008, Serpent_Guard <bl...@gmail.com> wrote:
>
>  Whoops, that's '[\w^\d]+'.

[^\d] means all but 0-9 but [\w^\d] means \w, ^, or \d so it will
match more than is required.
The ^ only negates a class if present at the start.

This can be seen by using the demo at:

http://jakarta.apache.org/oro/demo.html

For example, try the input 'abcd12^3xyz' - '[\w^\d]+' matches in its entirety.

The original poster wrote:

> Basically I want to extract a number that comes after a word and that is
> followed by a space and something else. Ex:
>
> Word 123 - more stuff

If by a word you mean only alphabetic characters, then

'[a-zA-Z](\d+) '

(without the ')

should extract the digits following the last letter of a "word".

Or, if you know the min and max lengths of a word, you could use:

'[a-zA-Z]{min,max}(\d+) '


>  --
>
> View this message in context: http://www.nabble.com/regular-expression-extractor-tp21074258p21086104.html
>
> Sent from the JMeter - User mailing list archive at Nabble.com.
>
>
>  ---------------------------------------------------------------------
>  To unsubscribe, e-mail: jmeter-user-unsubscribe@jakarta.apache.org
>  For additional commands, e-mail: jmeter-user-help@jakarta.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: jmeter-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: jmeter-user-help@jakarta.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: jmeter-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: jmeter-user-help@jakarta.apache.org

Re: regular expression extractor

Posted by sebb <se...@gmail.com>.

On 19/12/2008, Serpent_Guard <bl...@gmail.com> wrote:
>
>  Whoops, that's '[\w^\d]+'.

[^\d] means all but 0-9 but [\w^\d] means \w, ^, or \d so it will
match more than is required.
The ^ only negates a class if present at the start.

This can be seen by using the demo at:

http://jakarta.apache.org/oro/demo.html

For example, try the input 'abcd12^3xyz' - '[\w^\d]+' matches in its entirety.

The original poster wrote:

> Basically I want to extract a number that comes after a word and that is
> followed by a space and something else. Ex:
>
> Word 123 - more stuff

If by a word you mean only alphabetic characters, then

'[a-zA-Z](\d+) '

(without the ')

should extract the digits following the last letter of a "word".

Or, if you know the min and max lengths of a word, you could use:

'[a-zA-Z]{min,max}(\d+) '


>  --
>
> View this message in context: http://www.nabble.com/regular-expression-extractor-tp21074258p21086104.html
>
> Sent from the JMeter - User mailing list archive at Nabble.com.
>
>
>  ---------------------------------------------------------------------
>  To unsubscribe, e-mail: jmeter-user-unsubscribe@jakarta.apache.org
>  For additional commands, e-mail: jmeter-user-help@jakarta.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: jmeter-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: jmeter-user-help@jakarta.apache.org

Re: regular expression extractor

Posted by Serpent_Guard <bl...@gmail.com>.

Whoops, that's '[\w^\d]+'.
-- 
View this message in context: http://www.nabble.com/regular-expression-extractor-tp21074258p21086104.html
Sent from the JMeter - User mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: jmeter-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: jmeter-user-help@jakarta.apache.org

Re: regular expression extractor

Posted by Serpent_Guard <bl...@gmail.com>.

I think you want '[\w+] (\d+) .+' (w/o quotes).

[\w+] - \w is a character class that matches any word character
(alphanumeric plus underscores).  '+' changes it from matching a single
character to matching one or more characters.

(\d+) - \d is a character class that matches digits.  '+' makes it match one
or more digits.

.+ - '.' matches any non-newline character.  '+' makes it match one or more
of those characters.

If you specifically don't want to allow digits in the first word, you can
change '[\w+]' to '[\w^\d]'

I'm no expert at regex, but I've been learning a lot about it lately and I'm
reasonably sure this'll do the trick.

PAULO LOURENCO (RE-ESI-Workflow) wrote:
> 
> Greetings,
> 
>  
> 
> I know this is going to look like a basic question, which I apologize
> for in advance, but I'm having problems with a simple regular
> expression.
> 
> Basically I want to extract a number that comes after a word and that is
> followed by a space and something else. Ex:
> 
> Word 123 - more stuff
> 
> From the previous sentence, what I want o extract is the 123, regardless
> of what comes after ('-', '.', etc) knowing that the number comes always
> after a word.
> 
>  
> 
> Thanks in advance!
> 
>  
> 
> 
> 

-- 
View this message in context: http://www.nabble.com/regular-expression-extractor-tp21074258p21086078.html
Sent from the JMeter - User mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: jmeter-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: jmeter-user-help@jakarta.apache.org