Wiki-ing the Vista Monograph: Difference between revisions
Jump to navigation
Jump to search
Drew.einhorn (talk | contribs) No edit summary |
Drew.einhorn (talk | contribs) No edit summary |
||
Line 6: | Line 6: | ||
* cleaned it up with Dave Raggett's [http://tidy.sourceforge.net/ HTML Tidy] |
* cleaned it up with Dave Raggett's [http://tidy.sourceforge.net/ HTML Tidy] |
||
* converted it to MediaWiki using [http://search.cpan.org/~diberri/HTML-WikiConverter-0.61/lib/HTML/WikiConverter.pm HTML::WikiConverter]. |
* converted it to MediaWiki using [http://search.cpan.org/~diberri/HTML-WikiConverter-0.61/lib/HTML/WikiConverter.pm HTML::WikiConverter]. |
||
* manual editing to clean out a lot of junk html. |
* manual editing to clean out a lot of junk html. Trust your Browser! to get things right w/o all this junk. |
||
** br |
** br |
||
** font |
** font |
||
** div |
** div |
||
** span |
** span |
||
** funny characters |
** funny characters - went away when I got the UTF-8 stuff right in HTML Tidy. |
||
* Need to redo it with a sed script. |
* Need to redo it with a sed script. |
||
* script to remove excess blank lines. |
* script to remove excess blank lines. |
Revision as of 03:08, 16 February 2008
I started with the MS Word version of the VistA Monograph, vista_monograph2005_06.doc
- opened it with Open Office
- saved it as html
- cleaned it up with Dave Raggett's HTML Tidy
- converted it to MediaWiki using HTML::WikiConverter.
- manual editing to clean out a lot of junk html. Trust your Browser! to get things right w/o all this junk.
- br
- font
- div
- span
- funny characters - went away when I got the UTF-8 stuff right in HTML Tidy.
- Need to redo it with a sed script.
- script to remove excess blank lines.
It's no wonder it has glitches. I'm surprised it came out as well as it did.