View Single Post
  #4   Report Post  
Posted to microsoft.public.word.docmanagement
Rebecca Rebecca is offline
external usenet poster
 
Posts: 49
Default A Better Solution?

Yes, Robert, the scanned book contains dozens and dozens of color pictures,
and yes, that's the actual size of the file. I know, at first I thought it
was a bug (say, my computer was not reading the file size correctly) or I was
losing my eyesight or my mind.

It does seem impossible (and I've been experimenting with various scanned
images for years to get the file sizes down). Try it out and you'll see.
It's almost a miracle (if you've got a ton of scanned material in PDF files,
that is). And frankly, navigating PDF files in Acrobat is a pain (slow as
molasses, despite some nice functions, though). But with a Tablet PC, you
can ink and do other thinks with the images in MS Word with no problem, and
it still does not increase the file size too much (though I haven't been
highlighting that much yet).

I don't think there are other big (connecting) files lurking somewhere on my
hard disk, and if there are, well, this would be a first, too. I saved the
htm files as MS Word files, so go figure. But who knows, maybe you're right
-- maybe there's a catch somewhere. But as I recommended, try it with a big
PDF in Acrobat, save it as a htm file, open it in MS Word, and save it as a
MS Word doc. Viola!

"Robert M. Franz (RMF)" wrote:

Hi Rebecca

Rebecca wrote:
Using a Fijitsu ScanSnap scanner, I scanned (at a "very good" resolution) an
entire book of 350 pages, which has many color pictures, using Acrobat 7.0.
The resulting PDF was 154 megabytes. I then saved this PDF as a htm file,
opened it in MS Word 2003 or 2007 (same results in both), and saved it as a
word document. The resulting doc size is a teensy-weensy 21 KB, and the file


Wait a minute: how many high-color pictures are there in your 350 page
document? At 21 KByte, I doubt there can be much text in a 350 page Word
document, and no pictures to speak of. When you save as HTML, the
pictures and other stuff are most probably external (that's what Word
does, anyway, when you save a document to HTML there).

There seems to be either a couple of other big files around, or your
resulting document cannot be much more then a mere text file ...

BTW, have you tried saving as RTF from Acrobat?

Greetinx
Robert
--
/"\ ASCII Ribbon Campaign | MS
\ / | MVP
X Against HTML | for
/ \ in e-mail & news | Word