HTML (XHTML) in netnews

Discussion:

(too old to reply)

Ivan Shmakov

2012-04-30 05:49:11 UTC

[Cross-posting to news:news.misc, for the discussion is not
quite about the newsreading software.]

[…]

Actually, I tend advocate /for/ the use of HTML (or, rather, XHTML)
in e-mail in news, despite that my user agent of choice has poor
support for it. (And I think of it as an opportunity to improve the
latter.)
At the very least, it allows one to avoid such an ad-hockery as
/slashes/ /for/ /slanted/, and *stars* *for* *bold*. Not to mention
"proper" tables, formatted vs. "preformatted" text distinction, and
inline MathML formulae and SVG graphics.

I have never seen an HTML e-mail that wouldn't have become better by
rewriting it in plain text.
People that use HTML in e-mail tend to be unaware of the underlying
medium of all "them fancy formatting features" and abuse them badly
and without taste. They paste lists and tables into Outlook directly
from MS Word and are happy about it, while the result it outright
horrible. Preformatted text viewed in a monospace font is always
better than that.

As I've said before, there're many different ways to author
“poor” HTML. It doesn't mean that any HTML is /necessarily/
bad.

As you have shown, it is possible to highlight words in plain text.

Yes, but then my Gnus shows /foo/ as highlighted foo, because it
thinks that slashes mean slanted text here, not the Unix
directory separators they in fact are.

Formulas can expressed like they are in programming languages or
LaTeX source,

I disagree.

Would LaTeX code be a preferable form of presentation, I guess
we'd see a lot of scientific journals switching to it from the
now-ubiquitous mathematic notation.

Using * to mean ⋅ or ×, or introducing “new” functions, such as
abs, expt, pow, sqrt, etc., doesn't seem sane, either.

That being said, Unicode-based plain-text allows ❘x❘, x¹²³, and
even √x̅. However, I don't know of any easy way to author such
formulae, while there're a few translators from a subset of
LaTeX to MathML.

and graphics should be converted to ASCII-art,

What for?

linked, or attached, depending on the circumstances.

The graphics, both linked and provided as separate MIME parts,
could be presented in a specific place of an HTML-based message
referencing it. Not so for a message based on plain-text.

Apart from the aesthetic reasons, I like plain text for it's utter
simplicity, transparecy, and suitabilty for automated searching and
processing.

I disagree. None of the automated formatters I know can handle
hyphenated text, and none of them could discern “verbatim” text
(such as a source code fragment, or ASCII art) from the
“normal”, formatted text.

Or think of the slanted (or bold) face examples above. What if
I wish to drop all the slanted face markup, while leaving Unix
absolute filenames alone?

One doesn't have to use a parser to comfortably read and operate
plain-text files.

I disagree. Think of parsing a formatted table, like:

Column 1 Columns 2, 3
Column 2 Column 3

Whatever there is 1 Foo bar
And here's the other 2 Baz

(If that's not hard enough, consider adding various box drawing
characters to the formatting, like it was common for the DOS-era
plain text.)

While it's easy to convert (or format) an HTML table to a
plain-text presentation like the above (think of Lynx or
html2text), it's much harder the other way around.

XML is much more suitable for automated processing. Consider,
e. g., XPath, XQuery, XSLT, xmlstarlet(1), etc.

[…]

--
FSF associate member #7257

Anton Shepelev

2012-04-30 22:12:02 UTC