Hi guys,
This is a weird problem, and I'm not sure if I got it right.
Just a practical example, that will describe my problem:
I'm connecting to google.com host on port 80 using fsock open, and I
send a regular GET header without any specific HTTP headers regarding
the type of encoding accepted, cookies, accepted charset, conditional
headers etc
What happens, is after sending the headers to this stream opened using
fsockopen, I start grabbing the headers, and then, comes the body of
the web page, everything seems logic until this point.
The problem is, just after the headers are received, the body of the
page, contains few odd alphanumeric values , about 4 elements in
length, and it seems it's a hexa value. e.g.. 2A, or two values
maybe: 8c9d... then comes the regular HTML code of the page if any.
At the end of the grabbed content, there's also one of these
alphanumeric groups, or a "0" (zero).
For some reason I tend to believe the characters right after the
headers are sent are used by browsers to identify the type of the
encoding of the stream? e.g. bytes that decide that my page is going
to come as UTF-8 encoding?
Anyways, the problem is, how to make sure I get the page right, and
why the file_Get_conten ts (url_goes_here) doesn't grab those
alphanumeric characters, considering they're stripping the returned
headers of the request already.
I am still thinking it's some sort of "stream's first byte" that
informs the app about the encoding of the content, but I'm here to
hear your input and solution on this.
Thank you,
Vladimir Ghetau http://www.Vladimirated.com/ 2 4386
Hi,
You could try using HTTP/1.0 or simply leaving off the HTTP version.
HTTP/1.1 clients must be able to handle "chunked transfer coding",
which is the encoding you're seeing. Each segment is preceded by it's
size in hex.
Details: http://www.w3.org/Protocols/rfc2616/....html#sec3.6.1
Peace,
John Peters
On Jan 19, 3:18 am, Vladimir Ghetau <vladi...@pixel tomorrow.com>
wrote:
Hi guys,
This is a weird problem, and I'm not sure if I got it right.
Just a practical example, that will describe my problem:
I'm connecting to google.com host on port 80 using fsock open, and I
send a regular GET header without any specific HTTP headers regarding
the type of encoding accepted, cookies, accepted charset, conditional
headers etc
What happens, is after sending the headers to this stream opened using
fsockopen, I start grabbing the headers, and then, comes the body of
the web page, everything seems logic until this point.
The problem is, just after the headers are received, the body of the
page, contains few odd alphanumeric values , about 4 elements in
length, and it seems it's a hexa value. e.g.. 2A, or two values
maybe: 8c9d... then comes the regular HTML code of the page if any.
At the end of the grabbed content, there's also one of these
alphanumeric groups, or a "0" (zero).
For some reason I tend to believe the characters right after the
headers are sent are used by browsers to identify the type of the
encoding of the stream? e.g. bytes that decide that my page is going
to come as UTF-8 encoding?
Anyways, the problem is, how to make sure I get the page right, and
why the file_Get_conten ts (url_goes_here) doesn't grab those
alphanumeric characters, considering they're stripping the returned
headers of the request already.
I am still thinking it's some sort of "stream's first byte" that
informs the app about the encoding of the content, but I'm here to
hear your input and solution on this.
Thank you,
Vladimir Ghetau
http://www.Vladimirated.com/
Hello,
on 01/19/2008 06:18 AM Vladimir Ghetau said the following:
I'm connecting to google.com host on port 80 using fsock open, and I
send a regular GET header without any specific HTTP headers regarding
the type of encoding accepted, cookies, accepted charset, conditional
headers etc
What happens, is after sending the headers to this stream opened using
fsockopen, I start grabbing the headers, and then, comes the body of
the web page, everything seems logic until this point.
The problem is, just after the headers are received, the body of the
page, contains few odd alphanumeric values , about 4 elements in
length, and it seems it's a hexa value. e.g.. 2A, or two values
maybe: 8c9d... then comes the regular HTML code of the page if any.
At the end of the grabbed content, there's also one of these
alphanumeric groups, or a "0" (zero).
For some reason I tend to believe the characters right after the
headers are sent are used by browsers to identify the type of the
encoding of the stream? e.g. bytes that decide that my page is going
to come as UTF-8 encoding?
Anyways, the problem is, how to make sure I get the page right, and
why the file_Get_conten ts (url_goes_here) doesn't grab those
alphanumeric characters, considering they're stripping the returned
headers of the request already.
I am still thinking it's some sort of "stream's first byte" that
informs the app about the encoding of the content, but I'm here to
hear your input and solution on this.
Those are chunked transfer encoding blocks. You need to decode and
assemble the blocks. They are useful to know when the server response
has ended for responses with unpredicted length, like for instance those
generated by dynamically generated pages with PHP.
You may want to take a look at this HTTP client class to learn how to
decode them: http://www.phpclasses.org/httpclient
--
Regards,
Manuel Lemos
PHP professionals looking for PHP jobs http://www.phpclasses.org/professionals/
PHP Classes - Free ready to use OOP components written in PHP http://www.phpclasses.org/ This thread has been closed and replies have been disabled. Please start a new discussion. Similar topics |
by: Michael T. Peterson |
last post by:
I am trying to get the file referenced by the following url:
http://waterdata.usgs.gov/wa/nwis/uv?dd_cd=01&dd_cd=02&format=rdb&period=1&site_no=12149000
I'm using parse_url to get the host, path, and query. My call to
fsockopen() works correctly. Unfortunately, I can find no documentation for
how to construct a request that will correctly instruct the server (i.e.,
waterdata.usgs.gov) to send me the file and all the combinations of...
|
by: Mike Verdone |
last post by:
Hello all,
I'm trying to implement a PHP program that can handle streaming HTTP
data through Apache. I need to somehow get access to the incoming data
of the HTTP request as it arrives (i.e. I don't want to access the
FILE array).
Is this possible with mod_php? I think that, in general, it should be
possible as I think mod_python can do it.
|
by: Lu |
last post by:
Hi, i am currently working on ASP.Net v1.0 and is encountering the following
problem. In javascript, I'm passing in:
"somepage.aspx?QSParameter=<RowID>Chèques</RowID>" as part of the query
string. However, in the code behind when I tried to get the query string
value by calling Request.QueryString("QSParameter"), the value I got is:
"<RowID>Chques</RowID>". The special character "è" has been stripped out.
The web.config file is...
|
by: WIWA |
last post by:
Hi,
I want to login to a password protected website and fetch the content of the
page behind. I have based my code on
http://weblogs.asp.net/jdennany/archive/2005/04/23/403971.aspx. When I use
tools like ieHTTPHeaders v1.6, and I perform a normal login (using the
normal website), I see that the viewstate is
__VIEWSTATE=dDwxMzU4OTE3NTA2Ozs%2BTK88jS63JXN181X3N8zKivua8co%3D&txt_username=xxx&txt_password=xxxxxx&btn_login=Login.
When I...
|
by: lazypig06 |
last post by:
Hi !
Yesterday, I posted a topic regarding to XML problem that I've been
having. The old topic can be found at:
http://groups.google.com/group/comp.lang.php/browse_thread/thread/401e2b5cb251c21/8a0eeb03626018f4?lnk=st&q=lazypig06%40gmail.com&rnum=1&hl=en#8a0eeb03626018f4
Thanks to Álvaro G. Vicario for pointing out "chunked" transfer thing.
Anyway, I tried to avoid "chunked" transfer in HTTP protocol. I have
looked at...
| |
by: webEater |
last post by:
Hey,
I am writing a file that reads in an external file in the web and
prints it out including the response header of the http protocol. I do
this to enable cross domain XMLHttpRequests.
I implemented it via fsockopen, like this:
<?
$url = $_REQUEST; // take the param as $uri
//... more ...
|
by: Boldgeek |
last post by:
I am trying to develop an app that will allow automatic updating of a
web form which uses multipart/form-data enctype (as it MIGHT be sending
an image)
I have an example form, which when submitted displays the results as
you would expect.
However when I send the same data using the PHP script I get a 400
error:
Bad Request
|
by: xieliwei |
last post by:
I have a freshly installed openSuSe 10.2 with PHP4 from
http://download.opensuse.org/repositories/home:/michal-m:/php4/openSUSE_10.2/
(openSuSe abandoned PHP4 since version 10, but I have customers who
need php4 support)
The version strings are as follows:
# uname -a
Linux server2 2.6.18.2-34-default #1 SMP Mon Nov 27 11:46:27 UTC 2006
i686 athlon i386 GNU/Linux
|
by: beau.moore |
last post by:
Hi all,
I need some help, I am trying to access a service that binds to the
loopback address on my linux FC8 server. I think PHP functions but I
am unable to get any response back from the fgets() command below.
The service is GoogleDesktop for Linux, I am aware of all of the
restrictions in place to prevent what I am trying to do but I want to
try anyway. It seems to me that my http request should look just like
an http request from...
|
by: marktang |
last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look !
Part I. Meaning of...
|
by: Hystou |
last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it.
First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
| |
by: Oralloy |
last post by:
Hello folks,
I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>".
The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed.
This is as boiled down as I can make it.
Here is my compilation command:
g++-12 -std=c++20 -Wnarrowing bit_field.cpp
Here is the code in...
|
by: jinu1996 |
last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that captivates audiences and drives business growth.
The Art of Business Website Design
Your website is...
|
by: agi2029 |
last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own....
Now, this would greatly impact the work of software developers. The idea...
|
by: isladogs |
last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM).
In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules.
He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms.
Adolph will...
|
by: TSSRALBI |
last post by:
Hello
I'm a network technician in training and I need your help.
I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs.
The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols.
I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
|
by: adsilva |
last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
| |
by: muto222 |
last post by:
How can i add a mobile payment intergratation into php mysql website.
| | |