Hello shapper,
On Nov 20, 2:33 am, Arne Vajhøj <a...@vajhoej.dkwrote:
>shapper wrote:
>>I have a text as follows:
"My email is someth...@something.xyz and I posted this @ 2 am"
I need to replace the @ by (AT) bu only the ones that are in email
addresses.
All other @ shouldn't be replaced.
I know how to replace all @ but I am having problems in replacing
only the @'s in the email addresses.
How can I do this?
Regex.Replace with a reasonable regex expression for valid email
addresses should give a reasonable correct replacement.
Arne
Hi,
I was trying that but I have two problems:
return Regex.Replace(text, "\w+([-+.]\w+)*@\w+([-.]\w+)*\.\w+([-.]\w+)
*", "(at)");
First I get a lot of errors on the pattern in each \ character:
Unrecognized escape sequence
The second problem is that this identifies all emails on my string but
I don't want to replace the emails but the @ in the emails.
There are two options here: Either capture the first and second part of the
email in a group and put them back in.
Second, match only the @, but use lookahead and lookbehind constructions
to make sure you only replace the @ in an emailaddress
So here we go:
capture the first and last parts and replace:
(?<name>\w+([-+.]\w+)+)@(?<domain>\w+([-.]\w+)*\.\w+([-.]\w+)*))
and replace that with:
${name}(at)${domain}
This should be quite easy to understand. Capture the parts you want to keep
in a named group (?<name>..) and put them back into the replacement pattern
${name}.
The second option using look arounds:
(?<=\w+([-+.]\w+)+)(at)(?>\w+([-.]\w+)*\.\w+([-.]\w+)*))
and replace that with:
(at)
This one is usually harder to understand. (?<=...) looks back in your string
and tries to find the pattern that is defined at the '...'. If it cannot
find this exact pattern the whole expression fails.
(?>...) essentially does the same, but looks ahead, instead of backwards.
The funny thing is, that even though you search for something in your regex,
it doesn't become part of the actual Match and therefore doesn't get replaced.
Now to touch on your issue with th escape sequences. Any special character
sequence in regex starts with a \. And any escape sequence in C# code starts
with a \ as well, so if you put a regex in a string you again have two options:
First, escape all your \'s in your regex by putting an additional \ in front
of it. eg. \w becomes \\w. Second option is to use verbatim strings in C#
by placing an @ in front of the string definition like this: @"\w...more
regex goes here".
Hope this helps
--
Jesse Houwing
jesse.houwing at sogeti.nl