Wiki-ing the Vista Monograph: Difference between revisions

From OLPC
Jump to navigation Jump to search
No edit summary
No edit summary
Line 5: Line 5:
* saved it as html
* saved it as html
* cleaned it up with Dave Raggett's [http://tidy.sourceforge.net/ HTML Tidy]
* cleaned it up with Dave Raggett's [http://tidy.sourceforge.net/ HTML Tidy]
** funny characters - went away when I got the UTF-8 stuff right in HTML Tidy.
* converted it to MediaWiki using [http://search.cpan.org/~diberri/HTML-WikiConverter-0.61/lib/HTML/WikiConverter.pm HTML::WikiConverter].
* converted it to MediaWiki using [http://search.cpan.org/~diberri/HTML-WikiConverter-0.61/lib/HTML/WikiConverter.pm HTML::WikiConverter].
* manual editing to clean out a lot of junk html. Trust your Browser! to get things right w/o all this junk.
* manual editing to clean out a lot of junk html. replaced w/ sed scripts. Trust your Browser! to get things right w/o all this junk.
** br
** br
** font
** font
** div
** div
** span
** span

** funny characters - went away when I got the UTF-8 stuff right in HTML Tidy.
* Need to redo it with a sed script.
* script to remove excess blank lines.
* script to remove excess blank lines.

Revision as of 03:10, 16 February 2008

I started with the MS Word version of the VistA Monograph, vista_monograph2005_06.doc

  • opened it with Open Office
  • saved it as html
  • cleaned it up with Dave Raggett's HTML Tidy
    • funny characters - went away when I got the UTF-8 stuff right in HTML Tidy.
  • converted it to MediaWiki using HTML::WikiConverter.
  • manual editing to clean out a lot of junk html. replaced w/ sed scripts. Trust your Browser! to get things right w/o all this junk.
    • br
    • font
    • div
    • span
  • script to remove excess blank lines.

It's no wonder it has glitches. I'm surprised it came out as well as it did.