ms word vs html ?

Karsten M. Self kmself at ix.netcom.com
Sun Oct 20 20:47:07 PDT 2002


on Mon, Oct 07, 2002, Colin Marquardt (colin at marquardt-home.de) wrote:
> "Karsten M. Self" <kmself at ix.netcom.com> writes:
> 
> > Went to work on the manager's HTML export.  It blew up both Netscape and
> > Mozilla.  Ran the docs through W3C's 'tidy' utility to clean up the
> > HMTL.  One document finally validated (after a bunch of hand-edits to
> > fix errors).  The other blew up _tidy_ itself.  The HTML was _so_
> > nonstandard it blew up the HTML validator.  I ended up rendering the
> > document via Lynx and re-tagging it by hand.  In both cases, the tidied
> > HTML was ~30% the size of the original MS Word generated document.
> 
> Wasn't there a cleaning tool especially for Word-generated HTML
> code? Called demoronizer or so... indeed:
>    http://www.fourmilab.ch/webtools/demoroniser/

tidy is largely a replacement for the domoroniser.  It's more current,
more capable, and generally more standards conformant.

Peace.

-- 
Karsten M. Self <kmself at ix.netcom.com>        http://kmself.home.netcom.com/
 What Part of "Gestalt" don't you understand?
   Data corrupts.  Absolute data corrupts absolutely.
    -- Ed Self's corollary of Atkinson's Law.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://nblug.org/pipermail/talk/attachments/20021020/a5cbe292/attachment.pgp


More information about the talk mailing list