By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
424,694 Members | 2,121 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 424,694 IT Pros & Developers. It's quick & easy.

splitting string with a string

P: n/a
I have a html page that I retrived and stored in a string and I want
to split it based on <tdelements. I know only way you can split
using string.split is using characters. The other option is traverse
and split. Is there any other way you can split a string using string
token itself?
Thanks,
Jul 30 '08 #1
Share this Question
Share on Google+
10 Replies


P: n/a
On Jul 30, 3:24*pm, CSharper <cshar...@gmx.comwrote:
I have a html page that I retrived and stored in a string and I want
to split it based on <tdelements. I know only way you can split
using string.split is using characters. The other option is traverse
and split. Is there any other way you can split a string using string
token itself?
Use Regex.Split.

Jon
Jul 30 '08 #2

P: n/a
On Jul 30, 6:24*pm, CSharper <cshar...@gmx.comwrote:
I have a html page that I retrived and stored in a string and I want
to split it based on <tdelements. I know only way you can split
using string.split is using characters. The other option is traverse
and split. Is there any other way you can split a string using string
token itself?
Yes; use String.Split. It has an overload which takes String (not
char) delimiters:

public string[] Split(
string[] separator,
StringSplitOptions options
)

Jul 30 '08 #3

P: n/a
CSharper,

Have you taken a look at the RegEx class? Specifically, the Split
method on the RegEx class?
--
- Nicholas Paldino [.NET/C# MVP]
- mv*@spam.guard.caspershouse.com

"CSharper" <cs******@gmx.comwrote in message
news:57**********************************@r66g2000 hsg.googlegroups.com...
>I have a html page that I retrived and stored in a string and I want
to split it based on <tdelements. I know only way you can split
using string.split is using characters. The other option is traverse
and split. Is there any other way you can split a string using string
token itself?
Thanks,

Jul 30 '08 #4

P: n/a
On Jul 30, 9:27*am, "Jon Skeet [C# MVP]" <sk...@pobox.comwrote:
On Jul 30, 3:24*pm, CSharper <cshar...@gmx.comwrote:
I have a html page that I retrived and stored in a string and I want
to split it based on <tdelements. I know only way you can split
using string.split is using characters. The other option is traverse
and split. Is there any other way you can split a string using string
token itself?

Use Regex.Split.

Jon
Duh???

Thanks.
Jul 30 '08 #5

P: n/a
CSharper wrote:
I have a html page that I retrived and stored in a string and I want
to split it based on <tdelements. I know only way you can split
using string.split is using characters. The other option is traverse
and split. Is there any other way you can split a string using string
token itself?
Thanks,
As suggested, the Regex class also has a Split method, but you can do
better than that with a regular expression.

You can use the pattern "<td[^>]*>([\w\W]*?)</td>" with the Regex.Match
method to find the contents of all td elements in the string.

<td[^>]*matches the starting tag even if it has arguments
[^>] matches any character except >
* means zero or more matches
() catches the value
[\w\W] matches any character
*? makes a non-gready match, so that it ends at the first </td>, not the
last

Note: This doesn't work well if you have nested tables.

--
Göran Andersson
_____
http://www.guffa.com
Jul 30 '08 #6

P: n/a
On Jul 30, 3:55*pm, CSharper <cshar...@gmx.comwrote:
Use Regex.Split.

Duh???
Which part didn't you understand? In the RegEx class, there's a Split
method. Construct an appropriate regex, and call the Split method.

As Pavel mentioned, String also now contains an overload for
String.Split which takes an array of delimiter strings instead of
chars. It's "new" to 2.0, but hopefully that won't be an issue for
you.

Jon

Jul 30 '08 #7

P: n/a
haha, he was talking about himself I believe. As in "Duh, why didn't I
figure that out"
--
- Nicholas Paldino [.NET/C# MVP]
- mv*@spam.guard.caspershouse.com

"Jon Skeet [C# MVP]" <sk***@pobox.comwrote in message
news:5b**********************************@s50g2000 hsb.googlegroups.com...
On Jul 30, 3:55 pm, CSharper <cshar...@gmx.comwrote:
Use Regex.Split.

Duh???
Which part didn't you understand? In the RegEx class, there's a Split
method. Construct an appropriate regex, and call the Split method.

As Pavel mentioned, String also now contains an overload for
String.Split which takes an array of delimiter strings instead of
chars. It's "new" to 2.0, but hopefully that won't be an issue for
you.

Jon
Jul 30 '08 #8

P: n/a
Nicholas Paldino [.NET/C# MVP] <mv*@spam.guard.caspershouse.comwrote:
haha, he was talking about himself I believe. As in "Duh, why didn't I
figure that out"
Ah, that would explain it :)

--
Jon Skeet - <sk***@pobox.com>
Web site: http://www.pobox.com/~skeet
Blog: http://www.msmvps.com/jon.skeet
C# in Depth: http://csharpindepth.com
Jul 30 '08 #9

P: n/a
I actually used this functionality quite heavily recently, to narrow in on
an encoded url in a webpage source. I split the string after a "<td
id=\"...\">" element, or something similar, that occurred once and was
unique, and took the second part.
Then I took the first part of the split at "</td>".
Then I took the second part of "<a href=\"".
Then I took the first part of ">".

"Jon Skeet [C# MVP]" <sk***@pobox.comwrote in message
news:5b**********************************@s50g2000 hsb.googlegroups.com...
On Jul 30, 3:55 pm, CSharper <cshar...@gmx.comwrote:
Use Regex.Split.

Duh???
Which part didn't you understand? In the RegEx class, there's a Split
method. Construct an appropriate regex, and call the Split method.

As Pavel mentioned, String also now contains an overload for
String.Split which takes an array of delimiter strings instead of
chars. It's "new" to 2.0, but hopefully that won't be an issue for
you.

Jon
Jul 30 '08 #10

P: n/a
They may be using .NET 1.1, which doesn't have the string parameter
overloads.

"Pavel Minaev" <in****@gmail.comwrote in message
news:c9**********************************@34g2000h sf.googlegroups.com...
On Jul 30, 6:24 pm, CSharper <cshar...@gmx.comwrote:
I have a html page that I retrived and stored in a string and I want
to split it based on <tdelements. I know only way you can split
using string.split is using characters. The other option is traverse
and split. Is there any other way you can split a string using string
token itself?
Yes; use String.Split. It has an overload which takes String (not
char) delimiters:

public string[] Split(
string[] separator,
StringSplitOptions options
)
Jul 30 '08 #11

This discussion thread is closed

Replies have been disabled for this discussion.