Hi,
I am using the XmlValidatingReader Class in VS.Net 2003 (targeting dotNet
v1.1) to validate an xml message against a set of schemas.
Within the schema a type is defined as follows
<xs:simpleType name="atypename">
<xs:restriction base="xs:token">
<xs:pattern value="\w{1,6}"/>
</xs:restriction>
</xs:simpleType>
The "\w" construct should allow all characters except the set of
"punctuation", "separator" and "other" characters.
The W3C documentation indicates that the underscore character is punctuation
and should therefore be excluded. However a validation event is not raised
when the content has an undescore character in it. I think this is probably a
result of the fact that in the Unicode recommendation it says that "\w"
should allow underscores because of its common use in programming languages.
However, I would have thought that XmlValidatingReader would follow the W3C
recommendation?
I can't see this listed as a known bug anywhere. Is this because it is not
seen as a bug?
Could someone tell me, if I change to use dotNet v2.0 will this behave in
the way recommended by the W3C or is the behaviour the same as in dotNet v1.1?
Many thanks
--
Phil Hobgen
Southampton, UK