Wiki-ing the Vista Monograph: Difference between revisions

From OLPC
Jump to navigation Jump to search
No edit summary
No edit summary
Line 6: Line 6:
* cleaned it up with Dave Raggett's [http://tidy.sourceforge.net/ HTML Tidy]
* cleaned it up with Dave Raggett's [http://tidy.sourceforge.net/ HTML Tidy]
* converted it to MediaWiki using [http://search.cpan.org/~diberri/HTML-WikiConverter-0.61/lib/HTML/WikiConverter.pm HTML::WikiConverter].
* converted it to MediaWiki using [http://search.cpan.org/~diberri/HTML-WikiConverter-0.61/lib/HTML/WikiConverter.pm HTML::WikiConverter].
* manual editing to clean out a lot of junk html.
* manual editing to clean out a lot of junk html. Trust your Browser! to get things right w/o all this junk.
** br
** br
** font
** font
** div
** div
** span
** span
** funny characters
** funny characters - went away when I got the UTF-8 stuff right in HTML Tidy.
* Need to redo it with a sed script.
* Need to redo it with a sed script.
* script to remove excess blank lines.
* script to remove excess blank lines.

Revision as of 03:08, 16 February 2008

I started with the MS Word version of the VistA Monograph, vista_monograph2005_06.doc

  • opened it with Open Office
  • saved it as html
  • cleaned it up with Dave Raggett's HTML Tidy
  • converted it to MediaWiki using HTML::WikiConverter.
  • manual editing to clean out a lot of junk html. Trust your Browser! to get things right w/o all this junk.
    • br
    • font
    • div
    • span
    • funny characters - went away when I got the UTF-8 stuff right in HTML Tidy.
  • Need to redo it with a sed script.
  • script to remove excess blank lines.

It's no wonder it has glitches. I'm surprised it came out as well as it did.