Home |
Search |
Today's Posts |
#1
![]()
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
![]()
Hi all,
Does anyone know if it's possible to make a list of all the unique words in a document without having to destroy all the punctuation and formatting first? I know you can make a concordance index, but you have to know all the words first for that. I'm an amateur Java programmer, so if you know Java, you know that we can use StringTokenizers and HashSets to do this for small strings, but is there a way to do that on a larger scale for a Word file (I know it's a different programming language too, the Java was just an example) that's a few hundred pages long? Thanks! |
Thread Tools | |
Display Modes | |
|
|
![]() |
||||
Thread | Forum | |||
Word should catalog misspelled words to study. | Microsoft Word Help | |||
How to find a series of words and then changing formats | Microsoft Word Help | |||
Catalog all words in document | Microsoft Word Help | |||
How can I find if there are doubles in my list of words? | Microsoft Word Help | |||
Frequency count in Word | Microsoft Word Help |