473,789 Members | 2,671 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

a problem help me

hello everyone
Do you have any information how to generate a tool using .net which is
used to translate the web page contents to html format.

Plz reply me asap

Thanks in advance

Dhananjay

Nov 28 '06 #1
8 1577
Given that most web pages are *in* HTML (or a variant), it wouldn't
have a lot to do...

Can you clarify what you mean?

Marc

Nov 28 '06 #2

Marc Gravell wrote:
Given that most web pages are *in* HTML (or a variant), it wouldn't
have a lot to do...

Can you clarify what you mean?

Marc
hello marc
my problem is
first thing i have to import a client to a website(specifi ed website
,and there may may be more than one website) then i have to generate a
tool which has to convert web page contents to html format save this
html format to a database(sql server).
how to achieve this
could you plz help me to do this.

Reply me asap

Thanks
Dhananjay

Nov 28 '06 #3
You don't need to "convert" anything here. The website is probably in
HTML already. If it isn't you won't be able to do it. You may be able
to use WebClient here to simply download the text (see MSDN2) - but
even then it won't be a usable static copy, as all images, scripts,
cookies, links etc will probably be dead if you *just* use the HTML in
isolation of the other stuff.

You could try and use WebBrowser to export as an mht; never tried it -
might work. Alternatively if it is for later reference you might try
using tools like HTMLDOC to create a standalone PDF of the page (hence
including the images but not scripts).

Alternatively you can find lots of crawlers on google to do this for
you.

It really depends on what *exactly* you need. And unless this is
somehow a C# issue you may find other groups more useful.

Marc

Nov 28 '06 #4

Marc Gravell wrote:
You don't need to "convert" anything here. The website is probably in
HTML already. If it isn't you won't be able to do it. You may be able
to use WebClient here to simply download the text (see MSDN2) - but
even then it won't be a usable static copy, as all images, scripts,
cookies, links etc will probably be dead if you *just* use the HTML in
isolation of the other stuff.

You could try and use WebBrowser to export as an mht; never tried it -
might work. Alternatively if it is for later reference you might try
using tools like HTMLDOC to create a standalone PDF of the page (hence
including the images but not scripts).

Alternatively you can find lots of crawlers on google to do this for
you.

It really depends on what *exactly* you need. And unless this is
somehow a C# issue you may find other groups more useful.

Marc

hello marc
anyway thanks for spending time on me.
what you have suggested i tried it but its not working, its saying
namespace problem.i think this feature is different. i am using vs2005
C#
will you tell me one thing either my problem will be solved by creating
windows appln or web appln.first i have to import client to a website
and then generate a tool to convert webpage contents to html format
save it to sql server databse.
first i was doing with vb.net i have generated a tool which converts
webpage contents to html format , but same thing its not working
inC#.net.

plz reply me
Thanks
waiting for your reply asap
Dhananjay

Nov 28 '06 #5
Hi Dhananjay,

Ok Working on the HttpRequest, and Response objects, These are very
help full for u. Simple give a Request to the specifc URL by using the
HTTPRequest or HttpWebrequest, and then save the content stream of the
response in to the U R DataBase. this part will be simple get the page
u put the request, But u r aim is to get the whole Website, so search
the other links in the Main response stream and form a URL and process
same way...

On Nov 28, 10:45 am, "Dhananjay" <dhananjay...@y ahoo.co.inwrote :
hello everyone
Do you have any information how to generate a tool using .net which is
used to translate the web page contents to html format.

Plz reply me asap

Thanks in advance

Dhananjay
Nov 28 '06 #6
Hello Dhananjay,

First off, your English is vague. This leads to some misunderstandin g.
More on that below.

Secondly, it is not clear what BUSINESS PROBLEM you are trying to solve.
Before you jump to "what is wrong with my solution," please help us to
understand what problem your code is trying solve. There may be a better
way than writing code!

Thirdly, if you have written code, and it is not working, please post it.
That provides a great deal of information for us to help you.

Now, back to your request.

You said:
>first i have to import client to a website
and then generate a tool to convert webpage contents to html format
save it to sql server databse.
1. I do not know what this phrase means "import client to a website" I have
no idea what you are trying to accomplish. Can you use different words to
describe what you mean?

2. I do not know what is difficult about this: "convert webpage contents to
html format" since nearly all web pages are already in HTML format. That is
the nature of the web. All browsers begin by reading HTML. Note that if
the HTML in your target web page is constructed on the fly using Javascript,
then you are going to have a TOUGH time emulating that in C# code.

3. You want to "save it to a sql server database". What is "it" that you
are saving? Each page? Each element on a page? The content of the page?
Why save it to SQL? Do you intend to look up pages using SQL queries? Why
not save it as a web site and use HTTP to get the pages?

I want to help. But until you answer some of these questions, I won't be
terribly helpful.

Note: Are you looking for something like WinHTTrack? This tool is useful
for visiting a web site and creating, on your hard drive, a complete copy of
the site with links intact. It's fairly friendly and easy to use.
--
--- Nick Malik [Microsoft]
MCSD, CFPS, Certified Scrummaster
http://blogs.msdn.com/nickmalik

Disclaimer: Opinions expressed in this forum are my own, and not
representative of my employer.
I do not answer questions on behalf of my employer. I'm just a
programmer helping programmers.
--
"Dhananjay" <dh**********@y ahoo.co.inwrote in message
news:11******** **************@ 14g2000cws.goog legroups.com...
>
Marc Gravell wrote:
>You don't need to "convert" anything here. The website is probably in
HTML already. If it isn't you won't be able to do it. You may be able
to use WebClient here to simply download the text (see MSDN2) - but
even then it won't be a usable static copy, as all images, scripts,
cookies, links etc will probably be dead if you *just* use the HTML in
isolation of the other stuff.

You could try and use WebBrowser to export as an mht; never tried it -
might work. Alternatively if it is for later reference you might try
using tools like HTMLDOC to create a standalone PDF of the page (hence
including the images but not scripts).

Alternativel y you can find lots of crawlers on google to do this for
you.

It really depends on what *exactly* you need. And unless this is
somehow a C# issue you may find other groups more useful.

Marc


hello marc
anyway thanks for spending time on me.
what you have suggested i tried it but its not working, its saying
namespace problem.i think this feature is different. i am using vs2005
C#
will you tell me one thing either my problem will be solved by creating
windows appln or web appln.first i have to import client to a website
and then generate a tool to convert webpage contents to html format
save it to sql server databse.
first i was doing with vb.net i have generated a tool which converts
webpage contents to html format , but same thing its not working
inC#.net.

plz reply me
Thanks
waiting for your reply asap
Dhananjay

Nov 28 '06 #7

Nick Malik [Microsoft] wrote:
Hello Dhananjay,

First off, your English is vague. This leads to some misunderstandin g.
More on that below.

Secondly, it is not clear what BUSINESS PROBLEM you are trying to solve.
Before you jump to "what is wrong with my solution," please help us to
understand what problem your code is trying solve. There may be a better
way than writing code!

Thirdly, if you have written code, and it is not working, please post it.
That provides a great deal of information for us to help you.

Now, back to your request.

You said:
first i have to import client to a website
and then generate a tool to convert webpage contents to html format
save it to sql server databse.

1. I do not know what this phrase means "import client to a website" I have
no idea what you are trying to accomplish. Can you use different words to
describe what you mean?

2. I do not know what is difficult about this: "convert webpage contents to
html format" since nearly all web pages are already in HTML format. That is
the nature of the web. All browsers begin by reading HTML. Note that if
the HTML in your target web page is constructed on the fly using Javascript,
then you are going to have a TOUGH time emulating that in C# code.

3. You want to "save it to a sql server database". What is "it" that you
are saving? Each page? Each element on a page? The content of the page?
Why save it to SQL? Do you intend to look up pages using SQL queries? Why
not save it as a web site and use HTTP to get the pages?

I want to help. But until you answer some of these questions, I won't be
terribly helpful.

Note: Are you looking for something like WinHTTrack? This tool is useful
for visiting a web site and creating, on your hard drive, a complete copy of
the site with links intact. It's fairly friendly and easy to use.
--
--- Nick Malik [Microsoft]
MCSD, CFPS, Certified Scrummaster
http://blogs.msdn.com/nickmalik

Disclaimer: Opinions expressed in this forum are my own, and not
representative of my employer.
I do not answer questions on behalf of my employer. I'm just a
programmer helping programmers.
--
"Dhananjay" <dh**********@y ahoo.co.inwrote in message
news:11******** **************@ 14g2000cws.goog legroups.com...

Marc Gravell wrote:
You don't need to "convert" anything here. The website is probably in
HTML already. If it isn't you won't be able to do it. You may be able
to use WebClient here to simply download the text (see MSDN2) - but
even then it won't be a usable static copy, as all images, scripts,
cookies, links etc will probably be dead if you *just* use the HTML in
isolation of the other stuff.

You could try and use WebBrowser to export as an mht; never tried it -
might work. Alternatively if it is for later reference you might try
using tools like HTMLDOC to create a standalone PDF of the page (hence
including the images but not scripts).

Alternatively you can find lots of crawlers on google to do this for
you.

It really depends on what *exactly* you need. And unless this is
somehow a C# issue you may find other groups more useful.

Marc

hello marc
anyway thanks for spending time on me.
what you have suggested i tried it but its not working, its saying
namespace problem.i think this feature is different. i am using vs2005
C#
will you tell me one thing either my problem will be solved by creating
windows appln or web appln.first i have to import client to a website
and then generate a tool to convert webpage contents to html format
save it to sql server databse.
first i was doing with vb.net i have generated a tool which converts
webpage contents to html format , but same thing its not working
inC#.net.

plz reply me
Thanks
waiting for your reply asap
Dhananjay
=============== =============== =============== =============== =
hello nick
As you have asked some questions.in a simple way i am trying to achieve
this:-
my plan on building a Cache System. It will import content from
different Dhananjay-Sites, translate the dhananjay-Code into HTML and
republish it in a specific format on a file system.

now will you plz guide me how to proceed so that i can achieve it
or have u developed something like this previously then send me the
resources, so that i acn easily proceed towards the target
or u want in more detail ? let me know

plz reply me asap
Thanks
Dhananjay

Nov 28 '06 #8
Hello Dhananjay,
hello nick
As you have asked some questions.in a simple way i am trying to achieve
this:-
my plan on building a Cache System. It will import content from
different Dhananjay-Sites, translate the dhananjay-Code into HTML and
republish it in a specific format on a file system.

now will you plz guide me how to proceed so that i can achieve it
or have u developed something like this previously then send me the
resources, so that i acn easily proceed towards the target
or u want in more detail ? let me know

You are building a cache system. I assume from your statement that the goal
is for a person, using their web browser, to be able to visit a web site
while online, cache the site, and then visit it again when offline. Is this
true? (Are you aware that this is built-in functionality in the IE browser?
Simply add the site to favorites and check the "make available offline"
check box.)

I will assume, given the fact that this is trivial for an individual user,
that you intend for this cache to be visited by more than one user.
Therefore, I assume that the source sites are somehow more 'difficult' to
reach or less reliable than your cache server. In that case, you need to
provide what is called a 'proxy cache' in that the users will hit your site,
looking for the web pages that they want, and your app will get the data
from the remote system, update the local cache, and serve the pages.

Of course, there is no need to write code for any of this. Simply install
ISA server. http://www.microsoft.com/isaserver/default.mspx

On the off chance that you posted on a developer forum because you'd rather
develop software than install existing stuff (;-), then perhaps the code on
this link would be helpful. It is not a proxy server. It is, instead, a
web site spider. That actually sounds more like what you are saying you
want. This link provides complete C# source code for downloading web sites
to a local hard drive: See open source code at
http://www.codeproject.com/useritems/ZetaWebSpider.asp

For a more full-featured system that caches web sites, but one that is not
written in C# (to the best of my knowledge) but is still free, check out
HTTrack. The windows version is WinHTTrack? (www.httrack.com)

I hope this helps,
--
--- Nick Malik [Microsoft]
MCSD, CFPS, Certified Scrummaster
http://blogs.msdn.com/nickmalik

Disclaimer: Opinions expressed in this forum are my own, and not
representative of my employer.
I do not answer questions on behalf of my employer. I'm just a
programmer helping programmers.
Nov 28 '06 #9

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

21
6561
by: Dave | last post by:
After following Microsofts admonition to reformat my system before doing a final compilation of my app I got many warnings/errors upon compiling an rtf file created in word. I used the Help Workshop program: hcw.exe that's included with Visual Basic. This exact same file compiled perfectly with no notes, warnings or errors prior to reformatting my system. Prior to the reformatting, I copied the help.rtf file onto a CD and checked the box to...
9
4415
by: Tom | last post by:
A question for gui application programmers. . . I 've got some GUI programs, written in Python/wxPython, and I've got a help button and a help menu item. Also, I've got a compiled file made with the microsoft HTML workshop utility, lets call it c:\path\help.chm. My question is how do you launch it from the GUI? What logic do I put behind the "help" button, in other words. I thought it would be os.spawnv(os.P_DETACH,...
6
4355
by: wukexin | last post by:
Help me, good men. I find mang books that introduce bit "mang header files",they talk too bit,in fact it is my too fool, I don't learn it, I have do a test program, but I have no correct doing result in any way. Who can help me, I thank you very very much. list.cpp(main program) //-------------------------------------------------------------------------- - #pragma hdrstop #pragma argsused
3
3367
by: Colin J. Williams | last post by:
Python advertises some basic service: C:\Python24>python Python 2.4.1 (#65, Mar 30 2005, 09:13:57) on win32 Type "help", "copyright", "credits" or "license" for more information. >>> With numarray, help gives unhelpful responses:
7
5392
by: Corepaul | last post by:
Missing Help Files When I enter "recordset" as the keyword and search the Visual Basic Help index, I get many topics of interest in the resulting list. But there isn't any information available from clicking on many of the available topics (mostly methods but some properties are also unavailable). This same problem occurs with many, if not most, keywords. Is there any way I can activate these "missing" help topics? HELP!
5
3281
by: Steve | last post by:
I have written a help file (chm) for a DLL and referenced it using Help.ShowHelp My expectation is that a developer using my DLL would be able to access this help file during his development time using "F1" help within the VB IDE. Is this expectation achievable In trying to test my help file in the IDE, I have a solution with 2 projects: the DLL and a tester. VB does not look for my help file; instead, it looks for path to my source code...
8
3237
by: Mark | last post by:
I have loaded Visual Studio .net on my home computer and my laptop, but my home computer has an abbreviated help screen not 2% of the help on my laptop. All the settings look the same on both including search the internet for help, but the help is worthless. Any ideas?
10
3367
by: JonathanOrlev | last post by:
Hello everybody, I wrote this comment in another message of mine, but decided to post it again as a standalone message. I think that Microsoft's Office 2003 help system is horrible, probably the worst I ever seen. I almost cannot find anything I need, including things I
1
6140
by: trunxnirvana007 | last post by:
'UPGRADE_WARNING: Array has a new behavior. Click for more: 'ms-help://MS.VSCC.v80/dv_commoner/local/redirect.htm?keyword="9B7D5ADD-D8FE-4819-A36C-6DEDAF088CC7"' 'UPGRADE_WARNING: Couldn't resolve default property of object Label. Click for more: 'ms-help://MS.VSCC.v80/dv_commoner/local/redirect.htm?keyword="6A50421D-15FE-4896-8A1B-2EC21E9037B2"' Label = New Object(){Box1, Box2, Box3, Box4, Box5, Box6, Box7, Box8, Box9, Box10, Box11,...
0
2895
by: hitencontractor | last post by:
I am working on .NET Version 2003 making an SDI application that calls MS Excel 2003. I added a menu item called "MyApp Help" in the end of the menu bar to show Help-> About. The application calls MS Excel, so the scenario is that I am supposed to see the Excel Menu bar, FILE EDIT VIEW INSERT ... HELP. I am able to see the menu bar, but in case of Help, I see the Help of Excel and help of my application, both as a submenu of help. ...
0
9511
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
0
10412
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
0
10200
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that captivates audiences and drives business growth. The Art of Business Website Design Your website is...
1
10142
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
0
9986
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
1
7529
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
6769
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
5422
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
3
2909
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.