View Single Post
  #2   Report Post  
Posted to microsoft.public.word.newusers
Bob Buckland ?:-\) Bob   Buckland ?:-\) is offline
external usenet poster
 
Posts: 2,073
Default File size and HTML

Hi Rob,

Word 2007's new features (langauge neutral architecture, quick style sets, font pairs in themes...) can put quite a bit of
information into a Word web document to allow restoring to a .doc,.docX/M file type from a web page.

If you use Office Button=Save As=Other File Types=Web Page-Filtered
you may see quite a bit of that removed.

What is the DOS utility you're using to filter the HTML output?

=============
"Rob van Albada" wrote in message ...
Hi,

I am using Word2007 to edit a rather largish bilingual dictionary.
When I strip all superfluous HLML-tags, the size is around 6 MB.
The file produced by Word used to be around 1 MB larger, about 7 MB.
I use a DOS32 program to strip the file of its superfluous tags for
advanced processing.
However, lately, the file size has increased enormously.
Under Word-2007 (before I used Word-2000) the file size has increased
from 6 MB to 15.9 MB approx.
For instance, the header now contains a list of all available fonts
(several hundred, while I use only two: Times New Roman and Symbol).
Also, every two or three words the file contains totally superfluous
information of the font, language and font size.
How can I bring back the file size to something more normal?
Word slows down considerably with a file of this size.

Thanks for your help,

Rob in Amsterdam
--

Bob Buckland ?:-)
MS Office System Products MVP

*Courtesy is not expensive and can pay big dividends*