If I have, say, a windows-1280 character in some text because someone,
somewhere, copy and pasted something from Microsoft Word to my online
CMS, but I've declared an encoding of, say, UTF-8, is there a way to
clean that up using PHP? I already run the text through
htmlspecialchars and htmlentities but that is not enough. I'm drawing
an error because of something on my weblog that I copied and pasted
from elsewhere. This validator flags the error:
http://rss.scripting.com/?url=http%3...%2Fpage938.xml