473,472 Members | 2,088 Online
Bytes | Software Development & Data Engineering Community
Create Post

Home Posts Topics Members FAQ

Extracting XML from html file

I am trying ti extract data from a html file in an xml format. The html
contains java script and I only want a small part of the file to be
extracted. Can anyone tell where I can get C source code that does this or an
..exe file. Or any help that will point me in the right direction. Thanks in
advance. Below is segment of code I am concerned with

// -->
</SCRIPT>

<TABLE border=0 CELLSPACING=1 CELLPADDING=2 width="100%">
<tr BGCOLOR="#D4D4D4">
<td ALIGN=CENTER><font face=Verdana class=sm><b>Price</b></font></td>
<td ALIGN=CENTER><font face=Verdana class=sm><b>Change (%)</b></font></td>

<SCRIPT>
<!--
if (isopen==1) {
d("<td ALIGN=CENTER><font face=Verdana class=sm><b>Bid-Ask</b></font></td>");
} else {
d("<td ALIGN=CENTER><font face=Verdana class=sm><b>Bid-Ask (at
Close)</b></font></td>");
}

// -->
</SCRIPT>

<td ALIGN=CENTER><font face=Verdana class=sm><b>Currency</b></font></td>
<td ALIGN=CENTER><font face=Verdana class=sm><b>Volume</b></font></td>
<td ALIGN=CENTER><font face=Verdana class=sm><b>Trades</b></font></td>

<SCRIPT>
<!--
if (isopen==1) {
d("<td ALIGN=CENTER><font face=Verdana class=sm><b>Close</b></font></td>");
} else {
d("<td ALIGN=CENTER><font face=Verdana class=sm><b>Prev.
Close</b></font></td>");
}
// -->
</SCRIPT>

<td ALIGN=CENTER><font face=Verdana class=sm><b>High</b></font></td>
<td ALIGN=CENTER><font face=Verdana class=sm><b>Low</b></font></td></tr>

<tr BGCOLOR="#f0f0f0">
<td ALIGN=CENTER><FONT class=down>

<SCRIPT>
<!--
d(nmp);
// -->
</SCRIPT>

</FONT></td>
<td ALIGN=CENTER><FONT class=down><NOBR>- 0.5 (0.42)</NOBR></FONT></td>
<td ALIGN=CENTER><font class=ystrip>117.5 - 117.75</font></td>
<td ALIGN=CENTER><font class=data>GBX</FONT></td>
<td ALIGN=CENTER><font class=data>95716425</font></td>
<td ALIGN=CENTER><font class=data>1785</font></td>
<td ALIGN=CENTER><font class=data>118</font></td>
<td ALIGN=CENTER><font class=data>119</font></td>
<td ALIGN=CENTER><font class=data>117.25</font></td></tr>
</TABLE>

Nov 12 '05 #1
2 1534
"gee57" <ge***@discussions.microsoft.com> wrote in message news:A7**********************************@microsof t.com...
I am trying ti extract data from a html file in an xml format. : : <TABLE border=0 CELLSPACING=1 CELLPADDING=2 width="100%">


What portion of the markup you included is in XML format? XML
syntax requires attribute values must have single or double quote
delimiters. XHTML requires element names to be lowercase.
Derek Harmon
Nov 12 '05 #2
The Source is part of HTML File that I am concerned with. I want to pick out
the table data and some of the variable data. the source is available at
http://www.axlquotes.com/axl-dlls/pu...uote&page2=VOD

"Derek Harmon" wrote:
"gee57" <ge***@discussions.microsoft.com> wrote in message news:A7**********************************@microsof t.com...
I am trying ti extract data from a html file in an xml format.

: :
<TABLE border=0 CELLSPACING=1 CELLPADDING=2 width="100%">


What portion of the markup you included is in XML format? XML
syntax requires attribute values must have single or double quote
delimiters. XHTML requires element names to be lowercase.
Derek Harmon

Nov 12 '05 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

5
by: Nazgul | last post by:
Hi! I want to implement a small tool in Python for distributing "patches" and I need Your advice. This application should be able to package all files chosen by a user into a self-extracting.exe...
2
by: Avi | last post by:
hi, Can anyone tell me what the problem is and how to solve it The following piece of code resides on an asp page on the server and is used to download files from the server to the machine...
1
by: Cognizance | last post by:
Hi gang, I'm an ASP developer by trade, but I've had to create client side scripts with JavaScript many times in the past. Simple things, like validating form elements and such. Now I've been...
3
by: news | last post by:
I am trying to get at the source of a web page. Looking at the innerHTML element is only part of the story. In IE, right-clicking on various different parts of the page gives me different results...
1
by: Terry Olsen | last post by:
Ok, now that I've got my disk imager program working, I'd like to attach a "self-extractor" to the front end of the image file and make it a self-extracting disk image executable file. The idea...
3
by: Johny | last post by:
Does anyone know about a good regular expression for URL extracting? J.
6
by: Werner | last post by:
Hi, I try to read (and extract) some "self extracting" zipefiles on a Windows system. The standard module zipefile seems not to be able to handle this. False Is there a wrapper or has...
4
by: Ant | last post by:
Hi all, My kids have a bunch of games that have to be run from CD (on Windows XP). Now they're not very careful with them, and so I have a plan. I've downloaded a utility (Daemon Tools) which...
1
by: KingAdnan | last post by:
Hello friends, i am new here, i need help, i am making script for users who will upload zip file and the script will look for index.html inside zip, if it success to find it, it should echo all...
0
by: Andreas Tawn | last post by:
-----Original Message----- g] On Behalf Of Steve Holden Can't help with a recipe, but here's the formal spec if want to figure it out yourself. ...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
1
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
1
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...
0
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...
0
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The...
0
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
0
muto222
php
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.