Reply
 
Thread Tools Display Modes
  #1   Report Post  
Posted to microsoft.public.word.docmanagement
da9ve da9ve is offline
external usenet poster
 
Posts: 15
Default search/find by unicode codepoint

I've been trying to use the search/Find and Replace tool to search by Unicode
codepoint. I know I'm using the right syntax/punctuation, because when I
search on ^100, it correctly finds all the 'd' characters. But when I try
searching on a codepoint in a "private use" codepage *, it just doesn't find
anything.

Specifically, there are numerous 0xF020 characters plopped into this
document alongside (badly used) Symbol font versions of ‰¤ ‰¥ characters, and
Word displays them sometimes as combining dots underneath these adjacent
characters, and sometimes as little question-mark boxes (ï€*ï€*) beside them.

(Hm, when I paste them in this browser window, they show up as the combining
dots, and mess up the cursor position on the line of text where they sit.)
But either way, when I toggle the Unicode, Word tells me they're 0xF020's,
and that hex translates to 61472 decimal - and a search on that always
returns "Word has finished searching the document. The search item was not
found."

(* Yeah, long story: I get documents from the EMEA - European Medicines
Agency - and some of them have,... strange characters and formatting. I know
these private use characters aren't SUPPOSED to be used, but for some reason
they're there, and I need to be able to programmatically find and replace
them.

Is this just a known loophole, that Word doesn't bother searching on
characters in "private use" or undefined codepages, or is there something
else that it's hiding from me?
  #2   Report Post  
Posted to microsoft.public.word.docmanagement
Klaus Linke Klaus Linke is offline
external usenet poster
 
Posts: 413
Default search/find by unicode codepoint

To search for (decimal) Unicode, you can use ^u61472 ...

But since you usually don't know the decimal code, it's easier to copy/paste
the character into the Find dialog (Ctrl+C/Ctrl+V), or to type the hex code
into "Find what" (F020) and the toggle it with Ctrl+X (ToggleCharacterCode):
That works in most dialogs, too.

Regards,
Klaus

  #3   Report Post  
Posted to microsoft.public.word.docmanagement
Klaus Linke Klaus Linke is offline
external usenet poster
 
Posts: 413
Default search/find by unicode codepoint

To search for (decimal) Unicode, you can use ^u61472 ...

But since you usually don't know the decimal code, it's easier to copy/paste
the character into the Find dialog (Ctrl+C/Ctrl+V), or to type the hex code
into "Find what" (F020) and the toggle it with Ctrl+X (ToggleCharacterCode):
That works in most dialogs, too.

Regards,
Klaus

Reply
Thread Tools
Display Modes

Posting Rules

Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Symbol "font" - not Unicode compliant, how to Search/Replace? da9ve Microsoft Word Help 4 April 8th 10 07:08 PM
dealing forcefully with Unicode and non-Unicode characters da9ve Microsoft Word Help 4 April 9th 09 06:14 PM
SEARCH / FIND Jetpilot Microsoft Word Help 2 February 15th 06 10:02 PM
Search/Find two words that are not next to each other Philos Microsoft Word Help 12 August 30th 05 10:38 PM
Can't search using find a recipient Charlene Mailmerge 3 August 8th 05 07:21 AM


All times are GMT +1. The time now is 12:41 AM.

Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright ©2004-2024 Microsoft Office Word Forum - WordBanter.
The comments are property of their posters.
 

About Us

"It's about Microsoft Word"