I want to convert large files of prose to xhtml and so I need a way to
remove unwanted line wraps. So, I'm looking for a freebee editor that
has the capability of searching for a single "carriage return/line feed"
or "line feed/carriage return" and removing them. I quess what I need is
an editor that allows non-printable characters in it's search strings.
Does anyone know of on that will allow this?
Thank you...
Rich 23 6846
In article <1114788162.bb3a5b381a3b298cf70ea6acc336deb1@teran ews>, Se*********@hotmail.com enlightened us with... I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings.
Does anyone know of on that will allow this?
Sed, VI.... ;)
--
--
~kaeli~
To steal ideas from one person is plagiarism; to steal from
many is research. http://www.ipwebdesign.net/wildAtHeart http://www.ipwebdesign.net/kaelisSpace
SeaPlusPlus wrote: I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings.
Does anyone know of on that will allow this?
Thank you...
Rich
Rich,
Try NoteTab Pro trial version set for no-wrap to convert the text to
XHTML and see if that works for you. You should easily see any
other discrepancies in the browser view. Also, any "off-spring" of
the original UNIX "vi" should allow you to search for (and
substitute) special characters. It just requires a knowledge of
control characters. The second suggested is not recommended unless
you are truly adventurous.
Cheers,
Del Ferguson
kaeli wrote: In article <1114788162.bb3a5b381a3b298cf70ea6acc336deb1@teran ews>, Se*********@hotmail.com enlightened us with...
I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings.
Does anyone know of on that will allow this?
Sed, VI.... ;)
Qrs, TU.... :-(
Thank you... (i guess)...
Rich
> SeaPlusPlus wrote: I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings.
Does anyone know of on that will allow this?
Del Ferguson wrote:
Try NoteTab Pro trial version set for no-wrap to convert the text to XHTML and see if that works for you. You should easily see any other discrepancies in the browser view. Also, any "off-spring" of the original UNIX "vi" should allow you to search for (and substitute) special characters. It just requires a knowledge of control characters. The second suggested is not recommended unless you are truly adventurous.
Wrap and no-wrap is only how the editor "presents" the text...
NoteTab'll still wrap a line that contains the cr/lf or lf/cr or
whatever terminates the line... the question is... will NoteTab search
for special characters ? ? ? I thought not ! ! !
<facetious>
So, I guess I need to get the VT100 out of the cellar and fire up
RSX11/M in order to use EDT or EDN which worked for me in 1979 because I
can't find a replacement in 2005... AAARRRGGGHHHH ! ! !
</facetious>
Thank you...
Rich
SeaPlusPlus wrote: SeaPlusPlus wrote:
I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings. Does anyone know of on that will allow this?
Del Ferguson wrote:
Try NoteTab Pro trial version set for no-wrap to convert the text to XHTML and see if that works for you. You should easily see any other discrepancies in the browser view. Also, any "off-spring" of the original UNIX "vi" should allow you to search for (and substitute) special characters. It just requires a knowledge of control characters. The second suggested is not recommended unless you are truly adventurous.
Wrap and no-wrap is only how the editor "presents" the text... NoteTab'll still wrap a line that contains the cr/lf or lf/cr or whatever terminates the line... the question is... will NoteTab search for special characters ? ? ? I thought not ! ! !
<facetious> So, I guess I need to get the VT100 out of the cellar and fire up RSX11/M in order to use EDT or EDN which worked for me in 1979 because I can't find a replacement in 2005... AAARRRGGGHHHH ! ! ! </facetious>
No need. TextPad (free evaluation version)can search and replace
line-breaks, tabs, spaces. Just need to tick the box that says
'regular expression'.
As for wrapping in the editor, TextPad can wrap the lines or leave
them run 'off the screen', it's your choice.
--
Els http://locusmeus.com/
Sonhos vem. Sonhos vão. O resto é imperfeito.
- Renato Russo -
Now playing: The Trammps - Ten Percent
In article <1114788162.bb3a5b381a3b298cf70ea6acc336deb1@teran ews>,
SeaPlusPlus <Se*********@hotmail.com> wrote: I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings.
Does anyone know of on that will allow this?
Why is this necessary are part of converting the files to xhtml?
--
= Eric Bustad, Norwegian bachelor programmer
> In article <1114788162.bb3a5b381a3b298cf70ea6acc336deb1@teran ews>, SeaPlusPlus <Se*********@hotmail.com> wrote:
I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings.
Does anyone know of on that will allow this?
Eric Kenneth Bustad wrote:
Why is this necessary are part of converting the files to xhtml?
I needed to reference xhtml to pose an on-topic question. If I didn't
say that, you (the 'anal' police) would be posting that my query is
off-topic and I should be posting to some 'editor' newsgroup.
OBTW, why do you have 'is and 'are' in the same sentence?
Thank you... (i think)
Rich
SeaPlusPlus wrote: SeaPlusPlus wrote:
I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings. Does anyone know of on that will allow this?
Del Ferguson wrote:
Try NoteTab Pro trial version set for no-wrap to convert the text to XHTML and see if that works for you. You should easily see any other discrepancies in the browser view. Also, any "off-spring" of the original UNIX "vi" should allow you to search for (and substitute) special characters. It just requires a knowledge of control characters. The second suggested is not recommended unless you are truly adventurous.
Wrap and no-wrap is only how the editor "presents" the text... NoteTab'll still wrap a line that contains the cr/lf or lf/cr or whatever terminates the line... the question is... will NoteTab search for special characters ? ? ? I thought not ! ! !
<facetious> So, I guess I need to get the VT100 out of the cellar and fire up RSX11/M in order to use EDT or EDN which worked for me in 1979 because I can't find a replacement in 2005... AAARRRGGGHHHH ! ! ! </facetious>
Thank you...
Rich
Rich,
If you no-wrap text and convert to XHTML in NoteTab, the special
character conversions are obvious; however, if you want to do it the
"hard" way, I can send you DOS versions of vi, ed, sed, etc. Works
for me. You are right, NoteTab does not search special characters.
Why bother if XHTML will do all the work? Are you assuming your
text is screwed up?
Cheers,
Del Ferguson
in comp.infosystems. www.authoring.html, SeaPlusPlus wrote: In article <1114788162.bb3a5b381a3b298cf70ea6acc336deb1@teran ews>, SeaPlusPlus <Se*********@hotmail.com> wrote:I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps.
Why they are unwanted?
So, I'm looking for a freebee editor thathas the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings.
Emacs will do that... This question was asked in alt.html last week...
Does anyone know of on that will allow this?
Eric Kenneth Bustad wrote:
Why is this necessary are part of converting the files to xhtml?
I needed to reference xhtml to pose an on-topic question. If I didn't say that, you (the 'anal' police) would be posting that my query is off-topic and I should be posting to some 'editor' newsgroup.
Well, why didn't you? As there is not much use remove linefeeds when
converting to xhtml. Or any other html...
OBTW, why do you have 'is and 'are' in the same sentence?
plonk.
--
Lauri Raittila <http://www.iki.fi/lr> <http://www.iki.fi/zwak/fonts>
Utrecht, NL.
Support me, buy Opera: https://secure.bmtmicro.com/opera/bu...tml?AID=882173
SeaPlusPlus wrote: In article <1114788162.bb3a5b381a3b298cf70ea6acc336deb1@teran ews>, SeaPlusPlus <Se*********@hotmail.com> wrote:
I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings. Does anyone know of on that will allow this?
Eric Kenneth Bustad wrote:
Why is this necessary are part of converting the files to xhtml?
I needed to reference xhtml to pose an on-topic question. If I didn't say that, you (the 'anal' police) would be posting that my query is off-topic and I should be posting to some 'editor' newsgroup.
So your question is on-topic and we still don't know why you think you
need to remove the carriage returns from the text of the pages.
In article <1114797063.c887feaca18820caf971c4ea08f041ab@teran ews>,
SeaPlusPlus <Se*********@hotmail.com> wrote: In article <1114788162.bb3a5b381a3b298cf70ea6acc336deb1@teran ews>, SeaPlusPlus <Se*********@hotmail.com> wrote:I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings.Does anyone know of on that will allow this?
Eric Kenneth Bustad wrote:
Why is this necessary are part of converting the files to xhtml?
I needed to reference xhtml to pose an on-topic question. If I didn't say that, you (the 'anal' police) would be posting that my query is off-topic and I should be posting to some 'editor' newsgroup.
I wasn't trying to netcop, just curious. "Line wraps" would be taken
as just another bit of white space, so I didn't see why they would be
a problem in xhtml.
OBTW, why do you have 'is and 'are' in the same sentence?
Typo. I meant to type ".. necessary as part ..".
Thank you... (i think)
You're welcome...
--
= Eric Bustad, Norwegian bachelor programmer
SeaPlusPlus wrote: the question is... will NoteTab search for special characters ? ? ?
Yes (the Pro version for sure, at least.)
To find/replace a newline, use ^P
Notetab also detects and handles different newline formats
(Unix/Windows/Mac) automatically. It could also convert your text to
(X)HTML automatically too.
I thought not ! ! !
Second thoughts? ;-)
Andrew
Andrew Donaldson wrote: SeaPlusPlus wrote:
the question is... will NoteTab search for special characters ? ? ?
Yes (the Pro version for sure, at least.)
To find/replace a newline, use ^P
Notetab also detects and handles different newline formats (Unix/Windows/Mac) automatically. It could also convert your text to (X)HTML automatically too.
I thought not ! ! !
Second thoughts? ;-)
Andrew
Andrew,
I spoke too soon in my reply about NoteTab Pro. I just don't do my
text "problems" that way. I convert to XHTML and then look for the
obvious formatting problems.
Cheers,
Del Ferguson
>>> SeaPlusPlus wrote: I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings.
Does anyone know of on that will allow this?
Del Ferguson wrote:
If you no-wrap text and convert to XHTML in NoteTab, the special character conversions are obvious; however, if you want to do it the "hard" way, I can send you DOS versions of vi, ed, sed, etc. Works for me. You are right, NoteTab does not search special characters. Why bother if XHTML will do all the work? Are you assuming your text is screwed up?
Del,
What are you talking about? XHTML won't 'do all the work' now would it?
As you can see, in my original question I said... "large files of prose"
to "xhtml".
There're <carriage return>'s and/or <line feed>'s sprinkled throughout
these "large files of prose". I want to, under program control, search
them out, then replace or delete them (with extreme prejudice). ;-)
Listen, two line breaks in a row can be considered inter-paragraph and
one line break standing alone (surrounded by printable text) can be
considered intra-paragraph. I want my editor to search out and dispense
the appropriate action.
I'd like to change all the line break pairs and any space preceeding or
succeeding them to become...
<space><new line>
</p><new line>
<new line>
<p><new line>
and I'd like to change all the single line breaks and any space
preceeding or succeeding them to a single word space.
Then, voila, I'll have big chunk of the conversion process completed.
Thank you...
Rich
>>>SeaPlusPlus wrote: I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings.
Does anyone know of on that will allow this?
Els wrote:
No need. TextPad (free evaluation version)can search and replace line-breaks, tabs, spaces. Just need to tick the box that says 'regular expression'.
That, Els, is THE ANSWER ! ! ! Thank you. I used it to edit the file
I'd been working on and finished it no time at all.
As for wrapping in the editor, TextPad can wrap the lines or leave them run 'off the screen', it's your choice.
Wraping in the editor was never the problem. So, thanks again.
It was all those <new line>'s that needed to be stripped so that they
wouldn't force the BROWSER to wrap where the BROWSER didn't want to...
Rich
SeaPlusPlus wrote: SeaPlusPlus wrote: I want to convert large files of prose to xhtml and so I need a way > to remove unwanted line wraps. So, I'm looking for a freebee editor > that has the capability of searching for a single "carriage > return/line feed" or "line feed/carriage return" and removing them. > I quess what I need is an editor that allows non-printable > characters in it's search strings. Does anyone know of on that will allow this?
Del Ferguson wrote:
If you no-wrap text and convert to XHTML in NoteTab, the special character conversions are obvious; however, if you want to do it the "hard" way, I can send you DOS versions of vi, ed, sed, etc. Works for me. You are right, NoteTab does not search special characters. Why bother if XHTML will do all the work? Are you assuming your text is screwed up?
Del,
What are you talking about? XHTML won't 'do all the work' now would it?
As you can see, in my original question I said... "large files of prose" to "xhtml".
There're <carriage return>'s and/or <line feed>'s sprinkled throughout these "large files of prose". I want to, under program control, search them out, then replace or delete them (with extreme prejudice). ;-)
Listen, two line breaks in a row can be considered inter-paragraph and one line break standing alone (surrounded by printable text) can be considered intra-paragraph. I want my editor to search out and dispense the appropriate action.
I'd like to change all the line break pairs and any space preceeding or succeeding them to become...
<space><new line> </p><new line> <new line> <p><new line>
and I'd like to change all the single line breaks and any space preceeding or succeeding them to a single word space.
Then, voila, I'll have big chunk of the conversion process completed.
NoteTab Pro can do all that.
>>>SeaPlusPlus wrote: I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps.
Lauri Raittila wrote: Why they are unwanted?
HUH? This reads like an incomplete thought terminated by a question
mark... hmmm... even as a question it seems... well... questionable...
Yes, Lauri, they ARE unwanted. the prose in these files are littered
with <new line>'s which would inhibit ANY browser from flowing the text
properly. I realize this is a difficult concept for the uninitiated, but... So, I'm looking for a freebee editor thathas the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings.
Emacs will do that... This question was asked in alt.html last week...
Does anyone know of on that will allow this?
Eric Kenneth Bustad wrote:
Why is this necessary are part of converting the files to xhtml?
I needed to reference xhtml to pose an on-topic question. If I didn't say that, you (the 'anal' police) would be posting that my query is off-topic and I should be posting to some 'editor' newsgroup.
Well, why didn't you? As there is not much use remove linefeeds when converting to xhtml. Or any other html...
The hell there isn't! what the hell are you talking about?
OBTW, why do you have 'is and 'are' in the same sentence?
plonk.
Good, remember that next time I post a query.
Thank you
Rich
Harlan Messinger wrote: SeaPlusPlus wrote:
In article <1114788162.bb3a5b381a3b298cf70ea6acc336deb1@teran ews>, SeaPlusPlus <Se*********@hotmail.com> wrote: I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings. Does anyone know of on that will allow this?
Eric Kenneth Bustad wrote:
Why is this necessary are part of converting the files to xhtml?
I needed to reference xhtml to pose an on-topic question. If I didn't say that, you (the 'anal' police) would be posting that my query is off-topic and I should be posting to some 'editor' newsgroup.
So your question is on-topic and we still don't know why you think you need to remove the carriage returns from the text of the pages.
I have a lot of what you people may call "CONTENT" which is the whole
idea of web pages to start with to provide information. The content I
have is in the form of text files. These text files are formatted (as
the vast majority of text files are formatted) with <cr><lf> throughout.
This content would not flow correctly in a browser if all the new line
information was allowed to remain in the (x)html generated from the
content file. So, the correct and proper thing to do is to remove the
unwanted new lines. Surely, you understand why I would like to remove
the <cr><lf>'s now... am I right?
Thank you...
Rich
C A Upsdell > wrote: SeaPlusPlus wrote:
> SeaPlusPlus wrote: >> I want to convert large files of prose to xhtml and so I need a >> way to remove unwanted line wraps. So, I'm looking for a freebee >> editor that has the capability of searching for a single "carriage >> return/line feed" or "line feed/carriage return" and removing >> them. I quess what I need is an editor that allows non-printable >> characters in it's search strings. >> Does anyone know of on that will allow this? Del Ferguson wrote:
If you no-wrap text and convert to XHTML in NoteTab, the special character conversions are obvious; however, if you want to do it the "hard" way, I can send you DOS versions of vi, ed, sed, etc. Works for me. You are right, NoteTab does not search special characters. Why bother if XHTML will do all the work? Are you assuming your text is screwed up? Del,
What are you talking about? XHTML won't 'do all the work' now would it?
As you can see, in my original question I said... "large files of prose" to "xhtml".
There're <carriage return>'s and/or <line feed>'s sprinkled throughout these "large files of prose". I want to, under program control, search them out, then replace or delete them (with extreme prejudice). ;-)
Listen, two line breaks in a row can be considered inter-paragraph and one line break standing alone (surrounded by printable text) can be considered intra-paragraph. I want my editor to search out and dispense the appropriate action.
I'd like to change all the line break pairs and any space preceeding or succeeding them to become...
<space><new line> </p><new line> <new line> <p><new line>
and I'd like to change all the single line breaks and any space preceeding or succeeding them to a single word space.
Then, voila, I'll have big chunk of the conversion process completed.
NoteTab Pro can do all that.
C A,
You are completely right about NoteTab being able to do all of that.
Using the "Modify" options, the text document can be changed
almost any way you want it. Sea found the way he wanted to do it,
anyway.
Cheers,
Del Ferguson
In our last episode,
<1114788162.bb3a5b381a3b298cf70ea6acc336deb1@teran ews>,
the lovely and talented SeaPlusPlus
broadcast on comp.infosystems. www.authoring.html: I want to convert large files of prose to xhtml and so I need a way to remove unwanted line wraps. So, I'm looking for a freebee editor that has the capability of searching for a single "carriage return/line feed" or "line feed/carriage return" and removing them. I quess what I need is an editor that allows non-printable characters in it's search strings.
Does anyone know of on that will allow this?
I don't know of any unix editors (joe, (x)emacs, vi, etc. and
their various flavors and work-alikes) that won't do this.
However, what you want to do is a one-liner in perl, and if you
are going to be doing things like this often on multiple files,
you should look into perl.
Also, you almost certainly do not want just to remove the end of
line character(s). You want to replace them with spaces.
However, in most elements, xhtml and html will treat linefeeds
as white space to begin with and will collapse multiple white
spaces, so I am not sure why you need to do this at all.
Thank you...
Rich
--
Lars Eighner ei*****@io.com http://www.larseighner.com/
War on Terrorism: Okay, Unleash OUR Extreme Fundamentalists
"... all of them who have tried to secularize America, I point the finger in
their face and say, 'You helped this happen.'" --Jerry Falwell
On Fri, 29 Apr 2005 16:45:59 -0400, SeaPlusPlus
<Se*********@hotmail.com> wrote:
[...] I have a lot of what you people may call "CONTENT"... ...in the form of text files ... formatted ... with <cr><lf> throughout.
This content would not flow correctly in a browser if all the new line information was allowed to remain in the (x)html generated from the content file.
Sure it will.
So, the correct and proper thing to do is to remove the unwanted new lines. Surely, you understand why I would like to remove the <cr><lf>'s now... am I right?
Sorry; no.
(X)HTML browsers will normalize any sequence of characters #9, #10, #13
and #32 into one single #32 (space character) at presentation time.
Even if your source markup should happen to look something like this...
<p
Hello
World,
here
I
come !
</p
Any decent browser will output that as...
Hello World, Here I come !
....white space normalization has been a basic function of HTML browsers
from the start and for SGML browsers even before them.
It seems to me that you got yourself hung up on something that you maybe
did not understand from the start. Set up an experiment with one of your
"content" files and see for your self.
I can guarantee you that any random CR,LF,LF,LF,CR,CR,LF (etc...)
combination in your content will not generate a line break in your
browser presentation, it will be converted to one single space character
if you have your content properly marked up.
One exception, the 'pre' element will suggest to browsers to keep
original formatting for presentation, but surely you are not planning on
using that element for large volumes of content?
--
Rex
SeaPlusPlus wrote: I have a lot of what you people may call "CONTENT" which is the whole idea of web pages to start with to provide information. The content I have is in the form of text files. These text files are formatted (as the vast majority of text files are formatted) with <cr><lf> throughout. This content would not flow correctly in a browser if all the new line information was allowed to remain in the (x)html generated from the content file.
New line sequences in (X)HTML are treated as ordinary whitespace.
Consecutive whitespace characters are collapsed automatically into a
single space. Therefore, there is no need for you to do this yourself.
For example, if your text file looks like this:
Line one
line two
line three
and you embed it into an HTML body as follows:
<body>
<p>
Line one
line two
line three
</p>
</body>
it will look as follows in a browser:
Line one line two line three
So unless a preliminary conversion would be entertaining for you, you
might as well not bother. This discussion thread is closed Replies have been disabled for this discussion. Similar topics
7 posts
views
Thread by Mike Kamermans |
last post: by
|
10 posts
views
Thread by AnnMarie |
last post: by
|
3 posts
views
Thread by mark4asp |
last post: by
|
3 posts
views
Thread by jj |
last post: by
|
reply
views
Thread by bettervssremoting |
last post: by
|
63 posts
views
Thread by John Salerno |
last post: by
| |
9 posts
views
Thread by maya |
last post: by
|
10 posts
views
Thread by Greg Lovern |
last post: by
| | | | | | | | | | |