Extract first word after specific word

Question

I'm having difficulty writing a Perl program to extract the word following a certain word.

For example:

Today i'm not  going anywhere except to office.

I want the word after anywhere, so the output should be except.

I have tried this

my $words = "Today i'm not  going anywhere except to office.";
my $w_after = ( $words =~ /anywhere (\S+)/ );

but it seems this is wrong.

@ssr1012: One may also wait a day or two to see if a better answer appears — Borodin, Jan 16 at 15:28
@Borodin: OP said/confirms it helps for Jim Garrison answers. Hence I requested here. — ssr1012, Jan 17 at 6:50

Jim Garrison · Answer 1 · 2017-01-16 06:11:30Z

up vote 3 down vote

Very close:

my ($w_after) = ($words =~ /anywhere\s+(\S+)/);
   ^        ^                       ^^^
   +--------+                        |
     Note 1                        Note 2

Note 1: =~ returns a list of captured items, so the assignment target needs to be a list.

Note 2: allow one or more blanks after anywhere

answered Jan 16 at 6:11

Jim Garrison

59k1293127

Thanks Jim..it helps!!! – Azizul Jan 16 at 6:27

@JimGarrison Can you explain the use of ().? my ($w_after) = $words =~ /anywhere\s+(\S+)/; it is also give the same result then why.? – mkHun Jan 16 at 7:13

2

@mkHun that is used for operator precedence. In this case =~ has higher prcedence than = that's why giving same result. – Arunesh Singh Jan 16 at 7:36

add a comment |

Hynek -Pichi- Vychodil · Answer 2 · 2017-01-20 09:44:15Z

First, you have to write parentheses around left side expression of = operator to force array context for regexp evaluation. See m// and // in perlop documentation.[1] You can write parentheses also around =~ binding operator to improve readability but it is not necessary because =~ has pretty high priority.

Use POSIX Character Classes word

my ($w_after) = ($words =~ / \b anywhere \W+ (\w+) \b /x);

Note I'm using x so whitespaces in regexp are ignored. Also use \b word boundary to anchor regexp correctly.

[1]: I write my ($w_after) just for convenience because you can write my ($a, $b, $c, @rest) as equivalent of (my $a, my $b, my $c, my @rest) but you can also control scope of your variables like (my $a, our $UGLY_GLOBAL, local $_, @_).

Whilst your answer is correct, the thing the OP needs is the my ( $w_after ) so it does the assignment in a list context. I think it would be useful to spell that out. — Sobrique, Jan 16 at 10:40

khw · Answer 3 · 2017-01-24 02:19:22Z

In Perl v5.22 and later, you can use \b{wb} to get better results for natural language. The pattern could be

/anywhere\b{wb}.+?\b{wb}(.+?\b{wb})/

"wb" stands for word break, and it will account for words that have apostrophes in them, like "I'll", that plain \b doesn't.

.+?\b{wb}

matches the shortest non-empty sequence of characters that don't have a word break in them. The first one matches the span of spaces in your sentence; and the second one matches "except". It is enclosed in parentheses, so upon completion $1 contains "except".

\b{wb} is documented most fully in perlrebackslash

ssr1012 · Answer 4 · 2017-01-16 06:16:20Z

up vote -1 down vote

This Regex to be matched:

my ($expect) = ($words=~m/anywhere\s+([^\s]+)\s+/);

^\s+ the word between two spaces

Thanks.

answered Jan 16 at 6:16

ssr1012

1,137316

Thanks @ssr1012..it helps!!! – Azizul Jan 16 at 6:28

add a comment |

Selçuk Cihan · Answer 5 · 2017-01-16 08:33:20Z

up vote -1 down vote

If you want to also take into consideration the punctuation marks, like in:

my $words = "Today i'm not going anywhere; except to office.";

Then try this:

my ($w_after) = ($words =~ /anywhere[[:punct:]|\s]+(\S+)/);

answered Jan 16 at 8:33

Selçuk Cihan

1,2362516

add a comment |

asked	9 days ago
viewed	82 times
active	yesterday

current community

your communities

more stack exchange communities

Extract first word after specific word

5 Answers 5

Your Answer

Not the answer you're looking for? Browse other questions tagged regex perl or ask your own question.

Hot Network Questions

current community

your communities

more stack exchange communities

Extract first word after specific word

5 Answers 5

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged regex perl or ask your own question.

Related

Hot Network Questions