About Us

Hi all,
Does anyone know if it's possible to make a list of all the unique words in
a document without having to destroy all the punctuation and formatting
first? I know you can make a concordance index, but you have to know all the
words first for that. I'm an amateur Java programmer, so if you know Java,
you know that we can use StringTokenizers and HashSets to do this for small
strings, but is there a way to do that on a larger scale for a Word file (I
know it's a different programming language too, the Java was just an example)
that's a few hundred pages long?
Thanks!

Thread Tools
Show Printable Version
Display Modes
Switch to Linear Mode Switch to Hybrid Mode Threaded Mode

Similar Threads
Thread	Thread Starter	Forum	Replies	Last Post
Word should catalog misspelled words to study.	rndthought	Microsoft Word Help	39	May 21st 23 02:47 AM
How to find a series of words and then changing formats	MolTom	Microsoft Word Help	4	December 13th 05 03:05 PM
Catalog all words in document	Brad A.	Microsoft Word Help	1	July 20th 05 09:44 PM
How can I find if there are doubles in my list of words?	Rhen	Microsoft Word Help	3	June 9th 05 05:12 AM
Frequency count in Word	mmm	Microsoft Word Help	1	November 28th 04 12:44 PM

Menu

About Us