Reply
 
Thread Tools Display Modes
  #1   Report Post  
Posted to microsoft.public.word.docmanagement
Jeff Hook Jeff Hook is offline
external usenet poster
 
Posts: 1
Default Removing or Converting HTML code when Web text is pasted into Word 2007

I've just searched this group for prior threads about this topic and I
didn't find any guidance.

I save large amounts of text which I copy from Web pages and which I
then paste into my local Word 2007 files.

I value the content which is displayed at one blog. The content is
detailed and it discusses difficult topics which I prefer to study
carefully by reviewing the content as digital text in my own local
files. When I've saved Web text to my local files I can use Word
2007's capabilities to reformat the text, to highlight sections in
different colors and fonts, to add footnotes, etc. This is all
helpful in my study of the content.

The author of the blog generously offers the blog's contents without
restrictions. He doesn't charge a subscription fee. He offers the
content to anyone who's interested and it seems clear that his
intention is to help as many readers as he can by freely publishing
his guidance. However, he's not directly managing the technical
operation of his own blog. He hired a contractor who created the blog
and who maintains the site. The contractor has disabled the blog's
context menu, apparently in an attempt to protect the author's content
from plagiarism. The author clearly wants to distribute his blog
content widely. I'm not "stealing" his content. I'm only saving it
to my own files so I can study it. I'm not republishing it in any
way, let alone claiming it as my own, etc. I do think my use of
copies of the content is ethical.

I can't simply select text at the blog and copy it to my own files as
I do with the content which I obtain from most sites but I am able to
view the site's Source Code by using the View\Source menu of my Web
browser, IE8. I can then select that digital text, and I can copy
that text either by using the "Copy" option in the context menu of
those source code pages, or by using the pages' "Edit" menus.

I'm able to paste that code into my local Word 2007 files as a way of
working around the disablement of the usual copy capability at the
site, but this leaves me with large amounts of unwanted HTML code in
the text which I paste to my local files from the blog.

I have three questions:

1. Is Word 2007 able to remove all HTML code from text of this type?

2. Is Word 2007 able to "convert" such code to "Rich Text Format"?

3. Can I only process this code in this way by using a separate
utility which is intended for this purpose?

I've tried to find guidance in Microsoft's own online Help pages, but
I wasn't successful. I think Microsoft offers some excellent Word
2007 Help content, but the Bing search interface doesn't seem to me to
be helpful.

I found that third party utilities can be obtained for the removal of
all HTML code from text (i.e. for the extraction of Plain Text from
HTML code) and for the conversion of HTML code to formatted text but I
wonder if Word 2007 can do this itself. I've come to think that this
excellent application can do almost anything I want it to do but I've
found that some of these impressive capabilities aren't see at the
surface level where I tend to operate in this application.
  #2   Report Post  
Posted to microsoft.public.word.docmanagement
Suzanne S. Barnhill Suzanne S. Barnhill is offline
external usenet poster
 
Posts: 33,624
Default Removing or Converting HTML code when Web text is pasted into Word 2007

Since you seem to have a pretty good idea of what you're doing, this is
definitely a long shot, but I'll ask anyway. Can you select text in the blog
at all? If so, is Ctrl+C to copy also disabled?

Another thought: Are you able to print the page? If so, can you print it to
a PDF printer or "Generic (Text Only)" printer? It's possible that either of
those would leave you with just as much of a mess as the raw HTML, but it
might be worth a try.

I can't offer any guidance about the conversion abilities of Word 2007
(though perhaps others can), but if you'll provide a link to the blog, I
might be able to figure out some other workaround.

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Jeff Hook" wrote in message
...
I've just searched this group for prior threads about this topic and I
didn't find any guidance.

I save large amounts of text which I copy from Web pages and which I
then paste into my local Word 2007 files.

I value the content which is displayed at one blog. The content is
detailed and it discusses difficult topics which I prefer to study
carefully by reviewing the content as digital text in my own local
files. When I've saved Web text to my local files I can use Word
2007's capabilities to reformat the text, to highlight sections in
different colors and fonts, to add footnotes, etc. This is all
helpful in my study of the content.

The author of the blog generously offers the blog's contents without
restrictions. He doesn't charge a subscription fee. He offers the
content to anyone who's interested and it seems clear that his
intention is to help as many readers as he can by freely publishing
his guidance. However, he's not directly managing the technical
operation of his own blog. He hired a contractor who created the blog
and who maintains the site. The contractor has disabled the blog's
context menu, apparently in an attempt to protect the author's content
from plagiarism. The author clearly wants to distribute his blog
content widely. I'm not "stealing" his content. I'm only saving it
to my own files so I can study it. I'm not republishing it in any
way, let alone claiming it as my own, etc. I do think my use of
copies of the content is ethical.

I can't simply select text at the blog and copy it to my own files as
I do with the content which I obtain from most sites but I am able to
view the site's Source Code by using the View\Source menu of my Web
browser, IE8. I can then select that digital text, and I can copy
that text either by using the "Copy" option in the context menu of
those source code pages, or by using the pages' "Edit" menus.

I'm able to paste that code into my local Word 2007 files as a way of
working around the disablement of the usual copy capability at the
site, but this leaves me with large amounts of unwanted HTML code in
the text which I paste to my local files from the blog.

I have three questions:

1. Is Word 2007 able to remove all HTML code from text of this type?

2. Is Word 2007 able to "convert" such code to "Rich Text Format"?

3. Can I only process this code in this way by using a separate
utility which is intended for this purpose?

I've tried to find guidance in Microsoft's own online Help pages, but
I wasn't successful. I think Microsoft offers some excellent Word
2007 Help content, but the Bing search interface doesn't seem to me to
be helpful.

I found that third party utilities can be obtained for the removal of
all HTML code from text (i.e. for the extraction of Plain Text from
HTML code) and for the conversion of HTML code to formatted text but I
wonder if Word 2007 can do this itself. I've come to think that this
excellent application can do almost anything I want it to do but I've
found that some of these impressive capabilities aren't see at the
surface level where I tend to operate in this application.


  #3   Report Post  
Posted to microsoft.public.word.docmanagement
Jeff H. Jeff H. is offline
external usenet poster
 
Posts: 1
Default Removing or Converting HTML code when Web text is pasted intoWord 2007


Suzanne, thanks so much for your **EXCELLENT** assistance for so many
years in this group! Wow!

1. You said "if you'll provide a link to the blog, I might be able to
figure out some other workaround."

Here's the blog's top page:

http://blog.kimblechartingsolutions.com/

but hang on. A trip to the blog may only be optional because it looks
like you've already solved my problem! I'll provide more information
for the benefit of other users who are experiencing this same problem:

2. You asked, "Can you select text in the blog at all? If so, is Ctrl
+C to copy also disabled?:

a. I can't select ANY of the digital text on any of the blog pages
while they're normally displayed as fully-formatted Web pages on the
tabs of my Web browser, IE8. Normal selection methods (clicking the
left mouse button, holding it down, and swiping the mouse cursor;
left-clicking to position the insertion point, then pressing and
holding the keyboard Shift key, then left-clicking the mouse again at
the point at which I wish the selection to end) produce no effect
whatsoever. I stress that I cant select the blogs digital text when
its shown as the content of €śnormal Web pages€ť because I am able to
select the same digital text when its shown as the blogs source
code, was Ill explain below.

b. ANY right-click on ANY of the blog's normal Web pages produces
only an obnoxious "yellow triangle with black exclamation point" small
rectangular error message which appears at the center of the screen to
the accompaniment of the equally-obnoxious default Windows XP error
"sounder." The error message is shown in my chosen "slate blue"
Windows XP Home system color, it's captioned "Message from webpage,"
and it fairly chortles "Content Protected ! " (The exclamation point
is included...)

The context menu has clearly been disabled completely.

This linked text appears at the absolute bottom of the content area of
each page at the blog:

Content Protected Using Blog Protector By: PcDrome.

The text is linked to the software's "product page" at:

http://www.pcdrome.com/blog-protector

The developer says, "Blog Protector is a wordpress plugin which can
Protect your valuable blog content as from getting copied. Many
Bloggers simply copy-paste the content from your blog without giving
credit to the original source. This plugin basically disable the text
selection & right click which will make the bloggers difficult to copy
the content.

Features

–*Disable right click on your blog.
–*Disable selection of text on your blog."

(Funny... €śBlog Protector€ť hasn't been used on the developers own
"product page" for the software€¦ I was able to copy that text and to
paste it to this field without any difficulty...)

c. Since I'm not able to select text in any way on the Web/blog
pages, I can't use the Ctrl+C keystroke combination from the keyboard.

d. As I explained, none of this is a problem with respect to my
access to the source code, which I can reach by using IE8's View
\Source menu. The code is displayed on a separate tab with an http:
prefix. The text is "formatted Plain Text," so to speak, as would be
shown by a code editor, such as Dreamweaver. Text appears in
different colors as numbered lines of code. I can select that text by
any method, the context menu is fully functional on those browser tabs
of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm
able to select that code without difficulty.

3. You said, "Are you able to print the page? If so, can you print it
to a PDF printer or "Generic (Text Only)" printer? It's possible that
either of those would leave you with just as much of a mess as the raw
HTML, but it might be worth a try."

With that suggestion you may have solved my problem! I so rarely
print any content to paper that I know next to nil about printing.
I've seen references to "printing to files" rather than to paper, and
I tried it a couple of times, purely out of curiosity, but I was never
able to achieve any success. This time I succeeded!

Because I'm not able to select the text I wished to save on the blog
pages I was forced to go directly to my browser's File\Print menu,
which I assumed would either print all of each Web page, or none of
it. The Print Dialog was displayed. The "Select Printer" field of
the dialog only offered my minimal monochromatic laser printer and the
"Microsoft XPS Document Writer" as printer options. I selected the
XPS Writer and I left-clicked the "Print" button on the Print dialog.
After a brief delay, I was shown the €śSave file as€ť dialog. I saved
the output as an XPS file. I was then able to open the files by
double-left-clicking them in Windows Explorer's Contents Pane in the
Windows Classic View in Windows XP Home Edition. I found these files
were crude RTF files which were displayed in my browser. My usual
method of saving digital text from Web pages is to paste the content
into my local Word 2007 files by using the home ribbon's "Paste\Paste
Special\Unformatted text" menu. I then reformat the text minimally,
but these XPS files preserve the formatting which is shown on the blog
pages, and I may gradually retain that formatting in my local files,
although Ill convert the text to my standard fonts and sizes.

I'm able to select text from the full XPS copies of the Web pages,
which are shown on multiple pages of normal word processor size in
these XPS files. Im able to copy that text as Id normally copy any
text on Web pages which havent been protected from copying.

I didn't even know what XPS files were, so I used Uniblue's File
Extension site to check the extension. The site identifies these as
Microsoft "XML Paper Specification" files at:

http://filext.com/file-extension/XPS

They link to Microsoft's own page at:

http://msdn.microsoft.com/en-us/wind.../gg463373.aspx

Suzanne, as I could have expected, you came through in my behalf! As
I said, my knowledge of printing is extremely limited and I know even
less about printing to files rather than to paper, but your suggestion
seems to be a solution to my problem!

You showed me that this method does seem to operate as an HTML to RTF
converter utility would operate, but it's quicker because it directly
converts the "protected" Web text to RTF which can then be copied
normally. I assume a converter utility would require me to copy the
source code, then paste the code into the converter, and then copy the
converter's output. This cuts out the source code step.

(It may be difficult to delete these temporary XLS files when I've
copied selected contents from them and have pasted that content to my
Log file, but I'll solve that minor problem when I've built up a
backlog of these files. I was told I couldn't delete the first few
files of this type because another application was using them, but
that problem may clear up when I close my browser &/or Word 2007.)

I don't know if this is happening in Word 2007 or not, but it works!
THANKS AGAIN!

Jeff Hook, NJ, USA

On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote:
Since you seem to have a pretty good idea of what you're doing, this is definitely a long shot, but I'll ask anyway. Can you select text in the blog at all? If so, is Ctrl+C to copy also disabled? (snipped)

  #4   Report Post  
Posted to microsoft.public.word.docmanagement
Suzanne S. Barnhill Suzanne S. Barnhill is offline
external usenet poster
 
Posts: 33,624
Default Removing or Converting HTML code when Web text is pasted into Word 2007

Well, I'm glad that led you to a solution. I have the XPS printer on my
Windows 7 laptop (I think it's installed by default either with Windows or
with recent versions of Office), which is a good thing since I can't get the
laptop to see any printers on the network (or to see the network at all
except for Internet access), but I haven't had occasion to try it for
anything, and anytime I've searched for explanations of XPS online I've
ended up terminally confused!

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Jeff H." wrote in message
...

Suzanne, thanks so much for your **EXCELLENT** assistance for so many
years in this group! Wow!

1. You said "if you'll provide a link to the blog, I might be able to
figure out some other workaround."

Here's the blog's top page:

http://blog.kimblechartingsolutions.com/

but hang on. A trip to the blog may only be optional because it looks
like you've already solved my problem! I'll provide more information
for the benefit of other users who are experiencing this same problem:

2. You asked, "Can you select text in the blog at all? If so, is Ctrl
+C to copy also disabled?:

a. I can't select ANY of the digital text on any of the blog pages
while they're normally displayed as fully-formatted Web pages on the
tabs of my Web browser, IE8. Normal selection methods (clicking the
left mouse button, holding it down, and swiping the mouse cursor;
left-clicking to position the insertion point, then pressing and
holding the keyboard Shift key, then left-clicking the mouse again at
the point at which I wish the selection to end) produce no effect
whatsoever. I stress that I cant select the blogs digital text when
its shown as the content of €śnormal Web pages€ť because I am able to
select the same digital text when its shown as the blogs source
code, was Ill explain below.

b. ANY right-click on ANY of the blog's normal Web pages produces
only an obnoxious "yellow triangle with black exclamation point" small
rectangular error message which appears at the center of the screen to
the accompaniment of the equally-obnoxious default Windows XP error
"sounder." The error message is shown in my chosen "slate blue"
Windows XP Home system color, it's captioned "Message from webpage,"
and it fairly chortles "Content Protected ! " (The exclamation point
is included...)

The context menu has clearly been disabled completely.

This linked text appears at the absolute bottom of the content area of
each page at the blog:

Content Protected Using Blog Protector By: PcDrome.

The text is linked to the software's "product page" at:

http://www.pcdrome.com/blog-protector

The developer says, "Blog Protector is a wordpress plugin which can
Protect your valuable blog content as from getting copied. Many
Bloggers simply copy-paste the content from your blog without giving
credit to the original source. This plugin basically disable the text
selection & right click which will make the bloggers difficult to copy
the content.

Features

–*Disable right click on your blog.
–*Disable selection of text on your blog."

(Funny... €śBlog Protector€ť hasn't been used on the developers own
"product page" for the software€¦ I was able to copy that text and to
paste it to this field without any difficulty...)

c. Since I'm not able to select text in any way on the Web/blog
pages, I can't use the Ctrl+C keystroke combination from the keyboard.

d. As I explained, none of this is a problem with respect to my
access to the source code, which I can reach by using IE8's View
\Source menu. The code is displayed on a separate tab with an http:
prefix. The text is "formatted Plain Text," so to speak, as would be
shown by a code editor, such as Dreamweaver. Text appears in
different colors as numbered lines of code. I can select that text by
any method, the context menu is fully functional on those browser tabs
of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm
able to select that code without difficulty.

3. You said, "Are you able to print the page? If so, can you print it
to a PDF printer or "Generic (Text Only)" printer? It's possible that
either of those would leave you with just as much of a mess as the raw
HTML, but it might be worth a try."

With that suggestion you may have solved my problem! I so rarely
print any content to paper that I know next to nil about printing.
I've seen references to "printing to files" rather than to paper, and
I tried it a couple of times, purely out of curiosity, but I was never
able to achieve any success. This time I succeeded!

Because I'm not able to select the text I wished to save on the blog
pages I was forced to go directly to my browser's File\Print menu,
which I assumed would either print all of each Web page, or none of
it. The Print Dialog was displayed. The "Select Printer" field of
the dialog only offered my minimal monochromatic laser printer and the
"Microsoft XPS Document Writer" as printer options. I selected the
XPS Writer and I left-clicked the "Print" button on the Print dialog.
After a brief delay, I was shown the €śSave file as€ť dialog. I saved
the output as an XPS file. I was then able to open the files by
double-left-clicking them in Windows Explorer's Contents Pane in the
Windows Classic View in Windows XP Home Edition. I found these files
were crude RTF files which were displayed in my browser. My usual
method of saving digital text from Web pages is to paste the content
into my local Word 2007 files by using the home ribbon's "Paste\Paste
Special\Unformatted text" menu. I then reformat the text minimally,
but these XPS files preserve the formatting which is shown on the blog
pages, and I may gradually retain that formatting in my local files,
although Ill convert the text to my standard fonts and sizes.

I'm able to select text from the full XPS copies of the Web pages,
which are shown on multiple pages of normal word processor size in
these XPS files. Im able to copy that text as Id normally copy any
text on Web pages which havent been protected from copying.

I didn't even know what XPS files were, so I used Uniblue's File
Extension site to check the extension. The site identifies these as
Microsoft "XML Paper Specification" files at:

http://filext.com/file-extension/XPS

They link to Microsoft's own page at:

http://msdn.microsoft.com/en-us/wind.../gg463373.aspx

Suzanne, as I could have expected, you came through in my behalf! As
I said, my knowledge of printing is extremely limited and I know even
less about printing to files rather than to paper, but your suggestion
seems to be a solution to my problem!

You showed me that this method does seem to operate as an HTML to RTF
converter utility would operate, but it's quicker because it directly
converts the "protected" Web text to RTF which can then be copied
normally. I assume a converter utility would require me to copy the
source code, then paste the code into the converter, and then copy the
converter's output. This cuts out the source code step.

(It may be difficult to delete these temporary XLS files when I've
copied selected contents from them and have pasted that content to my
Log file, but I'll solve that minor problem when I've built up a
backlog of these files. I was told I couldn't delete the first few
files of this type because another application was using them, but
that problem may clear up when I close my browser &/or Word 2007.)

I don't know if this is happening in Word 2007 or not, but it works!
THANKS AGAIN!

Jeff Hook, NJ, USA

On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote:
Since you seem to have a pretty good idea of what you're doing, this is
definitely a long shot, but I'll ask anyway. Can you select text in the
blog at all? If so, is Ctrl+C to copy also disabled? (snipped)


  #5   Report Post  
Posted to microsoft.public.word.docmanagement
Stefan Blom[_3_] Stefan Blom[_3_] is offline
external usenet poster
 
Posts: 6,897
Default Removing or Converting HTML code when Web text is pasted into Word 2007

Maybe I'm missing something here, but I thought the XPS printer is simply Microsoft's equivalence of PDF?

Stefan Blom
Microsoft Word MVP



"Suzanne S. Barnhill" wrote in message ...

Well, I'm glad that led you to a solution. I have the XPS printer on my
Windows 7 laptop (I think it's installed by default either with Windows or
with recent versions of Office), which is a good thing since I can't get the
laptop to see any printers on the network (or to see the network at all
except for Internet access), but I haven't had occasion to try it for
anything, and anytime I've searched for explanations of XPS online I've
ended up terminally confused!

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Jeff H." wrote in message
...

Suzanne, thanks so much for your **EXCELLENT** assistance for so many
years in this group! Wow!

1. You said "if you'll provide a link to the blog, I might be able to
figure out some other workaround."

Here's the blog's top page:

http://blog.kimblechartingsolutions.com/

but hang on. A trip to the blog may only be optional because it looks
like you've already solved my problem! I'll provide more information
for the benefit of other users who are experiencing this same problem:

2. You asked, "Can you select text in the blog at all? If so, is Ctrl
+C to copy also disabled?:

a. I can't select ANY of the digital text on any of the blog pages
while they're normally displayed as fully-formatted Web pages on the
tabs of my Web browser, IE8. Normal selection methods (clicking the
left mouse button, holding it down, and swiping the mouse cursor;
left-clicking to position the insertion point, then pressing and
holding the keyboard Shift key, then left-clicking the mouse again at
the point at which I wish the selection to end) produce no effect
whatsoever. I stress that I cant select the blogs digital text when
its shown as the content of €śnormal Web pages€ť because I am able to
select the same digital text when its shown as the blogs source
code, was Ill explain below.

b. ANY right-click on ANY of the blog's normal Web pages produces
only an obnoxious "yellow triangle with black exclamation point" small
rectangular error message which appears at the center of the screen to
the accompaniment of the equally-obnoxious default Windows XP error
"sounder." The error message is shown in my chosen "slate blue"
Windows XP Home system color, it's captioned "Message from webpage,"
and it fairly chortles "Content Protected ! " (The exclamation point
is included...)

The context menu has clearly been disabled completely.

This linked text appears at the absolute bottom of the content area of
each page at the blog:

Content Protected Using Blog Protector By: PcDrome.

The text is linked to the software's "product page" at:

http://www.pcdrome.com/blog-protector

The developer says, "Blog Protector is a wordpress plugin which can
Protect your valuable blog content as from getting copied. Many
Bloggers simply copy-paste the content from your blog without giving
credit to the original source. This plugin basically disable the text
selection & right click which will make the bloggers difficult to copy
the content.

Features

–*Disable right click on your blog.
–*Disable selection of text on your blog."

(Funny... €śBlog Protector€ť hasn't been used on the developers own
"product page" for the software€¦ I was able to copy that text and to
paste it to this field without any difficulty...)

c. Since I'm not able to select text in any way on the Web/blog
pages, I can't use the Ctrl+C keystroke combination from the keyboard.

d. As I explained, none of this is a problem with respect to my
access to the source code, which I can reach by using IE8's View
\Source menu. The code is displayed on a separate tab with an http:
prefix. The text is "formatted Plain Text," so to speak, as would be
shown by a code editor, such as Dreamweaver. Text appears in
different colors as numbered lines of code. I can select that text by
any method, the context menu is fully functional on those browser tabs
of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm
able to select that code without difficulty.

3. You said, "Are you able to print the page? If so, can you print it
to a PDF printer or "Generic (Text Only)" printer? It's possible that
either of those would leave you with just as much of a mess as the raw
HTML, but it might be worth a try."

With that suggestion you may have solved my problem! I so rarely
print any content to paper that I know next to nil about printing.
I've seen references to "printing to files" rather than to paper, and
I tried it a couple of times, purely out of curiosity, but I was never
able to achieve any success. This time I succeeded!

Because I'm not able to select the text I wished to save on the blog
pages I was forced to go directly to my browser's File\Print menu,
which I assumed would either print all of each Web page, or none of
it. The Print Dialog was displayed. The "Select Printer" field of
the dialog only offered my minimal monochromatic laser printer and the
"Microsoft XPS Document Writer" as printer options. I selected the
XPS Writer and I left-clicked the "Print" button on the Print dialog.
After a brief delay, I was shown the €śSave file as€ť dialog. I saved
the output as an XPS file. I was then able to open the files by
double-left-clicking them in Windows Explorer's Contents Pane in the
Windows Classic View in Windows XP Home Edition. I found these files
were crude RTF files which were displayed in my browser. My usual
method of saving digital text from Web pages is to paste the content
into my local Word 2007 files by using the home ribbon's "Paste\Paste
Special\Unformatted text" menu. I then reformat the text minimally,
but these XPS files preserve the formatting which is shown on the blog
pages, and I may gradually retain that formatting in my local files,
although Ill convert the text to my standard fonts and sizes.

I'm able to select text from the full XPS copies of the Web pages,
which are shown on multiple pages of normal word processor size in
these XPS files. Im able to copy that text as Id normally copy any
text on Web pages which havent been protected from copying.

I didn't even know what XPS files were, so I used Uniblue's File
Extension site to check the extension. The site identifies these as
Microsoft "XML Paper Specification" files at:

http://filext.com/file-extension/XPS

They link to Microsoft's own page at:

http://msdn.microsoft.com/en-us/wind.../gg463373.aspx

Suzanne, as I could have expected, you came through in my behalf! As
I said, my knowledge of printing is extremely limited and I know even
less about printing to files rather than to paper, but your suggestion
seems to be a solution to my problem!

You showed me that this method does seem to operate as an HTML to RTF
converter utility would operate, but it's quicker because it directly
converts the "protected" Web text to RTF which can then be copied
normally. I assume a converter utility would require me to copy the
source code, then paste the code into the converter, and then copy the
converter's output. This cuts out the source code step.

(It may be difficult to delete these temporary XLS files when I've
copied selected contents from them and have pasted that content to my
Log file, but I'll solve that minor problem when I've built up a
backlog of these files. I was told I couldn't delete the first few
files of this type because another application was using them, but
that problem may clear up when I close my browser &/or Word 2007.)

I don't know if this is happening in Word 2007 or not, but it works!
THANKS AGAIN!

Jeff Hook, NJ, USA

On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote:
Since you seem to have a pretty good idea of what you're doing, this is
definitely a long shot, but I'll ask anyway. Can you select text in the
blog at all? If so, is Ctrl+C to copy also disabled? (snipped)



  #6   Report Post  
Posted to microsoft.public.word.docmanagement
VC[_2_] VC[_2_] is offline
external usenet poster
 
Posts: 6
Default Removing or Converting HTML code when Web text is pasted intoWord 2007

If you have MS OneNote, you could ty to take a screenshot of the blog,
and paste the image into a OneNote document.

You can then right click on the image and select "Copy Text From
Picture".

See http://www.technospot.net/blogs/copy...fice-one-note/

Might be worth a shot.
  #7   Report Post  
Posted to microsoft.public.word.docmanagement
Suzanne S. Barnhill Suzanne S. Barnhill is offline
external usenet poster
 
Posts: 33,624
Default Removing or Converting HTML code when Web text is pasted into Word 2007

Yeah, it sort of is. Except that I couldn't find any clue about an XPS
reader that's as universal as Adobe Reader, so I couldn't figure out what
you do with the .xps files after you have them, but apparently there's an
XPS reader built into Windows. I just checked Tools | Folder Options | File
Types in Windows XP, and, sure enough, XPS Viewer.exe is listed for them,
which is interesting since my Printers folder doesn't include the XPS
Printer.

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Stefan Blom" wrote in message
...
Maybe I'm missing something here, but I thought the XPS printer is simply
Microsoft's equivalence of PDF?

Stefan Blom
Microsoft Word MVP



"Suzanne S. Barnhill" wrote in message ...
Well, I'm glad that led you to a solution. I have the XPS printer on my
Windows 7 laptop (I think it's installed by default either with Windows or
with recent versions of Office), which is a good thing since I can't get
the laptop to see any printers on the network (or to see the network at
all except for Internet access), but I haven't had occasion to try it for
anything, and anytime I've searched for explanations of XPS online I've
ended up terminally confused!

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Jeff H." wrote in message
...

Suzanne, thanks so much for your **EXCELLENT** assistance for so many
years in this group! Wow!

1. You said "if you'll provide a link to the blog, I might be able to
figure out some other workaround."

Here's the blog's top page:

http://blog.kimblechartingsolutions.com/

but hang on. A trip to the blog may only be optional because it looks
like you've already solved my problem! I'll provide more information
for the benefit of other users who are experiencing this same problem:

2. You asked, "Can you select text in the blog at all? If so, is Ctrl
+C to copy also disabled?:

a. I can't select ANY of the digital text on any of the blog pages
while they're normally displayed as fully-formatted Web pages on the
tabs of my Web browser, IE8. Normal selection methods (clicking the
left mouse button, holding it down, and swiping the mouse cursor;
left-clicking to position the insertion point, then pressing and
holding the keyboard Shift key, then left-clicking the mouse again at
the point at which I wish the selection to end) produce no effect
whatsoever. I stress that I cant select the blogs digital text when
its shown as the content of €śnormal Web pages€ť because I am able to
select the same digital text when its shown as the blogs source
code, was Ill explain below.

b. ANY right-click on ANY of the blog's normal Web pages produces
only an obnoxious "yellow triangle with black exclamation point" small
rectangular error message which appears at the center of the screen to
the accompaniment of the equally-obnoxious default Windows XP error
"sounder." The error message is shown in my chosen "slate blue"
Windows XP Home system color, it's captioned "Message from webpage,"
and it fairly chortles "Content Protected ! " (The exclamation point
is included...)

The context menu has clearly been disabled completely.

This linked text appears at the absolute bottom of the content area of
each page at the blog:

Content Protected Using Blog Protector By: PcDrome.

The text is linked to the software's "product page" at:

http://www.pcdrome.com/blog-protector

The developer says, "Blog Protector is a wordpress plugin which can
Protect your valuable blog content as from getting copied. Many
Bloggers simply copy-paste the content from your blog without giving
credit to the original source. This plugin basically disable the text
selection & right click which will make the bloggers difficult to copy
the content.

Features

–*Disable right click on your blog.
–*Disable selection of text on your blog."

(Funny... €śBlog Protector€ť hasn't been used on the developers own
"product page" for the software€¦ I was able to copy that text and to
paste it to this field without any difficulty...)

c. Since I'm not able to select text in any way on the Web/blog
pages, I can't use the Ctrl+C keystroke combination from the keyboard.

d. As I explained, none of this is a problem with respect to my
access to the source code, which I can reach by using IE8's View
\Source menu. The code is displayed on a separate tab with an http:
prefix. The text is "formatted Plain Text," so to speak, as would be
shown by a code editor, such as Dreamweaver. Text appears in
different colors as numbered lines of code. I can select that text by
any method, the context menu is fully functional on those browser tabs
of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm
able to select that code without difficulty.

3. You said, "Are you able to print the page? If so, can you print it
to a PDF printer or "Generic (Text Only)" printer? It's possible that
either of those would leave you with just as much of a mess as the raw
HTML, but it might be worth a try."

With that suggestion you may have solved my problem! I so rarely
print any content to paper that I know next to nil about printing.
I've seen references to "printing to files" rather than to paper, and
I tried it a couple of times, purely out of curiosity, but I was never
able to achieve any success. This time I succeeded!

Because I'm not able to select the text I wished to save on the blog
pages I was forced to go directly to my browser's File\Print menu,
which I assumed would either print all of each Web page, or none of
it. The Print Dialog was displayed. The "Select Printer" field of
the dialog only offered my minimal monochromatic laser printer and the
"Microsoft XPS Document Writer" as printer options. I selected the
XPS Writer and I left-clicked the "Print" button on the Print dialog.
After a brief delay, I was shown the €śSave file as€ť dialog. I saved
the output as an XPS file. I was then able to open the files by
double-left-clicking them in Windows Explorer's Contents Pane in the
Windows Classic View in Windows XP Home Edition. I found these files
were crude RTF files which were displayed in my browser. My usual
method of saving digital text from Web pages is to paste the content
into my local Word 2007 files by using the home ribbon's "Paste\Paste
Special\Unformatted text" menu. I then reformat the text minimally,
but these XPS files preserve the formatting which is shown on the blog
pages, and I may gradually retain that formatting in my local files,
although Ill convert the text to my standard fonts and sizes.

I'm able to select text from the full XPS copies of the Web pages,
which are shown on multiple pages of normal word processor size in
these XPS files. Im able to copy that text as Id normally copy any
text on Web pages which havent been protected from copying.

I didn't even know what XPS files were, so I used Uniblue's File
Extension site to check the extension. The site identifies these as
Microsoft "XML Paper Specification" files at:

http://filext.com/file-extension/XPS

They link to Microsoft's own page at:

http://msdn.microsoft.com/en-us/wind.../gg463373.aspx

Suzanne, as I could have expected, you came through in my behalf! As
I said, my knowledge of printing is extremely limited and I know even
less about printing to files rather than to paper, but your suggestion
seems to be a solution to my problem!

You showed me that this method does seem to operate as an HTML to RTF
converter utility would operate, but it's quicker because it directly
converts the "protected" Web text to RTF which can then be copied
normally. I assume a converter utility would require me to copy the
source code, then paste the code into the converter, and then copy the
converter's output. This cuts out the source code step.

(It may be difficult to delete these temporary XLS files when I've
copied selected contents from them and have pasted that content to my
Log file, but I'll solve that minor problem when I've built up a
backlog of these files. I was told I couldn't delete the first few
files of this type because another application was using them, but
that problem may clear up when I close my browser &/or Word 2007.)

I don't know if this is happening in Word 2007 or not, but it works!
THANKS AGAIN!

Jeff Hook, NJ, USA

On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote:
Since you seem to have a pretty good idea of what you're doing, this is
definitely a long shot, but I'll ask anyway. Can you select text in the
blog at all? If so, is Ctrl+C to copy also disabled? (snipped)



  #8   Report Post  
Posted to microsoft.public.word.docmanagement
Stefan Blom[_3_] Stefan Blom[_3_] is offline
external usenet poster
 
Posts: 6,897
Default Removing or Converting HTML code when Web text is pasted into Word 2007

The XPS reader is probably "less universal" than the Adobe Reader. :-)

My guess is that the XPS reader has been installed when you have updated Windows XP. To get the XPS "printer" you may have to install it manually though.

Stefan Blom
Microsoft Word MVP



"Suzanne S. Barnhill" wrote in message ...

Yeah, it sort of is. Except that I couldn't find any clue about an XPS
reader that's as universal as Adobe Reader, so I couldn't figure out what
you do with the .xps files after you have them, but apparently there's an
XPS reader built into Windows. I just checked Tools | Folder Options | File
Types in Windows XP, and, sure enough, XPS Viewer.exe is listed for them,
which is interesting since my Printers folder doesn't include the XPS
Printer.

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Stefan Blom" wrote in message
...
Maybe I'm missing something here, but I thought the XPS printer is simply
Microsoft's equivalence of PDF?

Stefan Blom
Microsoft Word MVP



"Suzanne S. Barnhill" wrote in message ...
Well, I'm glad that led you to a solution. I have the XPS printer on my
Windows 7 laptop (I think it's installed by default either with Windows or
with recent versions of Office), which is a good thing since I can't get
the laptop to see any printers on the network (or to see the network at
all except for Internet access), but I haven't had occasion to try it for
anything, and anytime I've searched for explanations of XPS online I've
ended up terminally confused!

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Jeff H." wrote in message
...

Suzanne, thanks so much for your **EXCELLENT** assistance for so many
years in this group! Wow!

1. You said "if you'll provide a link to the blog, I might be able to
figure out some other workaround."

Here's the blog's top page:

http://blog.kimblechartingsolutions.com/

but hang on. A trip to the blog may only be optional because it looks
like you've already solved my problem! I'll provide more information
for the benefit of other users who are experiencing this same problem:

2. You asked, "Can you select text in the blog at all? If so, is Ctrl
+C to copy also disabled?:

a. I can't select ANY of the digital text on any of the blog pages
while they're normally displayed as fully-formatted Web pages on the
tabs of my Web browser, IE8. Normal selection methods (clicking the
left mouse button, holding it down, and swiping the mouse cursor;
left-clicking to position the insertion point, then pressing and
holding the keyboard Shift key, then left-clicking the mouse again at
the point at which I wish the selection to end) produce no effect
whatsoever. I stress that I cant select the blogs digital text when
its shown as the content of €śnormal Web pages€ť because I am able to
select the same digital text when its shown as the blogs source
code, was Ill explain below.

b. ANY right-click on ANY of the blog's normal Web pages produces
only an obnoxious "yellow triangle with black exclamation point" small
rectangular error message which appears at the center of the screen to
the accompaniment of the equally-obnoxious default Windows XP error
"sounder." The error message is shown in my chosen "slate blue"
Windows XP Home system color, it's captioned "Message from webpage,"
and it fairly chortles "Content Protected ! " (The exclamation point
is included...)

The context menu has clearly been disabled completely.

This linked text appears at the absolute bottom of the content area of
each page at the blog:

Content Protected Using Blog Protector By: PcDrome.

The text is linked to the software's "product page" at:

http://www.pcdrome.com/blog-protector

The developer says, "Blog Protector is a wordpress plugin which can
Protect your valuable blog content as from getting copied. Many
Bloggers simply copy-paste the content from your blog without giving
credit to the original source. This plugin basically disable the text
selection & right click which will make the bloggers difficult to copy
the content.

Features

–*Disable right click on your blog.
–*Disable selection of text on your blog."

(Funny... €śBlog Protector€ť hasn't been used on the developers own
"product page" for the software€¦ I was able to copy that text and to
paste it to this field without any difficulty...)

c. Since I'm not able to select text in any way on the Web/blog
pages, I can't use the Ctrl+C keystroke combination from the keyboard.

d. As I explained, none of this is a problem with respect to my
access to the source code, which I can reach by using IE8's View
\Source menu. The code is displayed on a separate tab with an http:
prefix. The text is "formatted Plain Text," so to speak, as would be
shown by a code editor, such as Dreamweaver. Text appears in
different colors as numbered lines of code. I can select that text by
any method, the context menu is fully functional on those browser tabs
of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm
able to select that code without difficulty.

3. You said, "Are you able to print the page? If so, can you print it
to a PDF printer or "Generic (Text Only)" printer? It's possible that
either of those would leave you with just as much of a mess as the raw
HTML, but it might be worth a try."

With that suggestion you may have solved my problem! I so rarely
print any content to paper that I know next to nil about printing.
I've seen references to "printing to files" rather than to paper, and
I tried it a couple of times, purely out of curiosity, but I was never
able to achieve any success. This time I succeeded!

Because I'm not able to select the text I wished to save on the blog
pages I was forced to go directly to my browser's File\Print menu,
which I assumed would either print all of each Web page, or none of
it. The Print Dialog was displayed. The "Select Printer" field of
the dialog only offered my minimal monochromatic laser printer and the
"Microsoft XPS Document Writer" as printer options. I selected the
XPS Writer and I left-clicked the "Print" button on the Print dialog.
After a brief delay, I was shown the €śSave file as€ť dialog. I saved
the output as an XPS file. I was then able to open the files by
double-left-clicking them in Windows Explorer's Contents Pane in the
Windows Classic View in Windows XP Home Edition. I found these files
were crude RTF files which were displayed in my browser. My usual
method of saving digital text from Web pages is to paste the content
into my local Word 2007 files by using the home ribbon's "Paste\Paste
Special\Unformatted text" menu. I then reformat the text minimally,
but these XPS files preserve the formatting which is shown on the blog
pages, and I may gradually retain that formatting in my local files,
although Ill convert the text to my standard fonts and sizes.

I'm able to select text from the full XPS copies of the Web pages,
which are shown on multiple pages of normal word processor size in
these XPS files. Im able to copy that text as Id normally copy any
text on Web pages which havent been protected from copying.

I didn't even know what XPS files were, so I used Uniblue's File
Extension site to check the extension. The site identifies these as
Microsoft "XML Paper Specification" files at:

http://filext.com/file-extension/XPS

They link to Microsoft's own page at:

http://msdn.microsoft.com/en-us/wind.../gg463373.aspx

Suzanne, as I could have expected, you came through in my behalf! As
I said, my knowledge of printing is extremely limited and I know even
less about printing to files rather than to paper, but your suggestion
seems to be a solution to my problem!

You showed me that this method does seem to operate as an HTML to RTF
converter utility would operate, but it's quicker because it directly
converts the "protected" Web text to RTF which can then be copied
normally. I assume a converter utility would require me to copy the
source code, then paste the code into the converter, and then copy the
converter's output. This cuts out the source code step.

(It may be difficult to delete these temporary XLS files when I've
copied selected contents from them and have pasted that content to my
Log file, but I'll solve that minor problem when I've built up a
backlog of these files. I was told I couldn't delete the first few
files of this type because another application was using them, but
that problem may clear up when I close my browser &/or Word 2007.)

I don't know if this is happening in Word 2007 or not, but it works!
THANKS AGAIN!

Jeff Hook, NJ, USA

On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote:
Since you seem to have a pretty good idea of what you're doing, this is
definitely a long shot, but I'll ask anyway. Can you select text in the
blog at all? If so, is Ctrl+C to copy also disabled? (snipped)


  #9   Report Post  
Posted to microsoft.public.word.docmanagement
Stefan Blom[_3_] Stefan Blom[_3_] is offline
external usenet poster
 
Posts: 6,897
Default Removing or Converting HTML code when Web text is pasted into Word 2007

My guess is that the XPS reader was installed when you updated Windows XP. The XPS "printer," on the other hand, may have to be installed manually.

By definition, that would make the XPS reader "less universal" than the Adobe reader. :-)

Stefan Blom
Microsoft Word MVP



"Suzanne S. Barnhill" wrote in message ...

Yeah, it sort of is. Except that I couldn't find any clue about an XPS
reader that's as universal as Adobe Reader, so I couldn't figure out what
you do with the .xps files after you have them, but apparently there's an
XPS reader built into Windows. I just checked Tools | Folder Options | File
Types in Windows XP, and, sure enough, XPS Viewer.exe is listed for them,
which is interesting since my Printers folder doesn't include the XPS
Printer.

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Stefan Blom" wrote in message
...
Maybe I'm missing something here, but I thought the XPS printer is simply
Microsoft's equivalence of PDF?

Stefan Blom
Microsoft Word MVP



"Suzanne S. Barnhill" wrote in message ...
Well, I'm glad that led you to a solution. I have the XPS printer on my
Windows 7 laptop (I think it's installed by default either with Windows or
with recent versions of Office), which is a good thing since I can't get
the laptop to see any printers on the network (or to see the network at
all except for Internet access), but I haven't had occasion to try it for
anything, and anytime I've searched for explanations of XPS online I've
ended up terminally confused!

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Jeff H." wrote in message
...

Suzanne, thanks so much for your **EXCELLENT** assistance for so many
years in this group! Wow!

1. You said "if you'll provide a link to the blog, I might be able to
figure out some other workaround."

Here's the blog's top page:

http://blog.kimblechartingsolutions.com/

but hang on. A trip to the blog may only be optional because it looks
like you've already solved my problem! I'll provide more information
for the benefit of other users who are experiencing this same problem:

2. You asked, "Can you select text in the blog at all? If so, is Ctrl
+C to copy also disabled?:

a. I can't select ANY of the digital text on any of the blog pages
while they're normally displayed as fully-formatted Web pages on the
tabs of my Web browser, IE8. Normal selection methods (clicking the
left mouse button, holding it down, and swiping the mouse cursor;
left-clicking to position the insertion point, then pressing and
holding the keyboard Shift key, then left-clicking the mouse again at
the point at which I wish the selection to end) produce no effect
whatsoever. I stress that I cant select the blogs digital text when
its shown as the content of €śnormal Web pages€ť because I am able to
select the same digital text when its shown as the blogs source
code, was Ill explain below.

b. ANY right-click on ANY of the blog's normal Web pages produces
only an obnoxious "yellow triangle with black exclamation point" small
rectangular error message which appears at the center of the screen to
the accompaniment of the equally-obnoxious default Windows XP error
"sounder." The error message is shown in my chosen "slate blue"
Windows XP Home system color, it's captioned "Message from webpage,"
and it fairly chortles "Content Protected ! " (The exclamation point
is included...)

The context menu has clearly been disabled completely.

This linked text appears at the absolute bottom of the content area of
each page at the blog:

Content Protected Using Blog Protector By: PcDrome.

The text is linked to the software's "product page" at:

http://www.pcdrome.com/blog-protector

The developer says, "Blog Protector is a wordpress plugin which can
Protect your valuable blog content as from getting copied. Many
Bloggers simply copy-paste the content from your blog without giving
credit to the original source. This plugin basically disable the text
selection & right click which will make the bloggers difficult to copy
the content.

Features

–*Disable right click on your blog.
–*Disable selection of text on your blog."

(Funny... €śBlog Protector€ť hasn't been used on the developers own
"product page" for the software€¦ I was able to copy that text and to
paste it to this field without any difficulty...)

c. Since I'm not able to select text in any way on the Web/blog
pages, I can't use the Ctrl+C keystroke combination from the keyboard.

d. As I explained, none of this is a problem with respect to my
access to the source code, which I can reach by using IE8's View
\Source menu. The code is displayed on a separate tab with an http:
prefix. The text is "formatted Plain Text," so to speak, as would be
shown by a code editor, such as Dreamweaver. Text appears in
different colors as numbered lines of code. I can select that text by
any method, the context menu is fully functional on those browser tabs
of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm
able to select that code without difficulty.

3. You said, "Are you able to print the page? If so, can you print it
to a PDF printer or "Generic (Text Only)" printer? It's possible that
either of those would leave you with just as much of a mess as the raw
HTML, but it might be worth a try."

With that suggestion you may have solved my problem! I so rarely
print any content to paper that I know next to nil about printing.
I've seen references to "printing to files" rather than to paper, and
I tried it a couple of times, purely out of curiosity, but I was never
able to achieve any success. This time I succeeded!

Because I'm not able to select the text I wished to save on the blog
pages I was forced to go directly to my browser's File\Print menu,
which I assumed would either print all of each Web page, or none of
it. The Print Dialog was displayed. The "Select Printer" field of
the dialog only offered my minimal monochromatic laser printer and the
"Microsoft XPS Document Writer" as printer options. I selected the
XPS Writer and I left-clicked the "Print" button on the Print dialog.
After a brief delay, I was shown the €śSave file as€ť dialog. I saved
the output as an XPS file. I was then able to open the files by
double-left-clicking them in Windows Explorer's Contents Pane in the
Windows Classic View in Windows XP Home Edition. I found these files
were crude RTF files which were displayed in my browser. My usual
method of saving digital text from Web pages is to paste the content
into my local Word 2007 files by using the home ribbon's "Paste\Paste
Special\Unformatted text" menu. I then reformat the text minimally,
but these XPS files preserve the formatting which is shown on the blog
pages, and I may gradually retain that formatting in my local files,
although Ill convert the text to my standard fonts and sizes.

I'm able to select text from the full XPS copies of the Web pages,
which are shown on multiple pages of normal word processor size in
these XPS files. Im able to copy that text as Id normally copy any
text on Web pages which havent been protected from copying.

I didn't even know what XPS files were, so I used Uniblue's File
Extension site to check the extension. The site identifies these as
Microsoft "XML Paper Specification" files at:

http://filext.com/file-extension/XPS

They link to Microsoft's own page at:

http://msdn.microsoft.com/en-us/wind.../gg463373.aspx

Suzanne, as I could have expected, you came through in my behalf! As
I said, my knowledge of printing is extremely limited and I know even
less about printing to files rather than to paper, but your suggestion
seems to be a solution to my problem!

You showed me that this method does seem to operate as an HTML to RTF
converter utility would operate, but it's quicker because it directly
converts the "protected" Web text to RTF which can then be copied
normally. I assume a converter utility would require me to copy the
source code, then paste the code into the converter, and then copy the
converter's output. This cuts out the source code step.

(It may be difficult to delete these temporary XLS files when I've
copied selected contents from them and have pasted that content to my
Log file, but I'll solve that minor problem when I've built up a
backlog of these files. I was told I couldn't delete the first few
files of this type because another application was using them, but
that problem may clear up when I close my browser &/or Word 2007.)

I don't know if this is happening in Word 2007 or not, but it works!
THANKS AGAIN!

Jeff Hook, NJ, USA

On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote:
Since you seem to have a pretty good idea of what you're doing, this is
definitely a long shot, but I'll ask anyway. Can you select text in the
blog at all? If so, is Ctrl+C to copy also disabled? (snipped)


  #10   Report Post  
Posted to microsoft.public.word.docmanagement
Suzanne S. Barnhill Suzanne S. Barnhill is offline
external usenet poster
 
Posts: 33,624
Default Removing or Converting HTML code when Web text is pasted into Word 2007

Looked at another way, Adobe Reader doesn't include any capability of
*creating* PDFs, while the function of the XPS printer is to create XPS
files. But I won't argue that XPS is "less universal" since I still know
next to nothing about it whereas one is inundated with PDFs everywhere every
day.

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Stefan Blom" wrote in message
...
My guess is that the XPS reader was installed when you updated Windows XP.
The XPS "printer," on the other hand, may have to be installed manually.
By definition, that would make the XPS reader "less universal" than the
Adobe reader. :-)

Stefan Blom
Microsoft Word MVP



"Suzanne S. Barnhill" wrote in message ...
Yeah, it sort of is. Except that I couldn't find any clue about an XPS
reader that's as universal as Adobe Reader, so I couldn't figure out what
you do with the .xps files after you have them, but apparently there's an
XPS reader built into Windows. I just checked Tools | Folder Options |
File Types in Windows XP, and, sure enough, XPS Viewer.exe is listed for
them, which is interesting since my Printers folder doesn't include the
XPS Printer.

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Stefan Blom" wrote in message
...
Maybe I'm missing something here, but I thought the XPS printer is simply
Microsoft's equivalence of PDF?

Stefan Blom
Microsoft Word MVP



"Suzanne S. Barnhill" wrote in message
...
Well, I'm glad that led you to a solution. I have the XPS printer on my
Windows 7 laptop (I think it's installed by default either with Windows
or with recent versions of Office), which is a good thing since I can't
get the laptop to see any printers on the network (or to see the network
at all except for Internet access), but I haven't had occasion to try it
for anything, and anytime I've searched for explanations of XPS online
I've ended up terminally confused!

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Jeff H." wrote in message
...

Suzanne, thanks so much for your **EXCELLENT** assistance for so many
years in this group! Wow!

1. You said "if you'll provide a link to the blog, I might be able to
figure out some other workaround."

Here's the blog's top page:

http://blog.kimblechartingsolutions.com/

but hang on. A trip to the blog may only be optional because it looks
like you've already solved my problem! I'll provide more information
for the benefit of other users who are experiencing this same problem:

2. You asked, "Can you select text in the blog at all? If so, is Ctrl
+C to copy also disabled?:

a. I can't select ANY of the digital text on any of the blog pages
while they're normally displayed as fully-formatted Web pages on the
tabs of my Web browser, IE8. Normal selection methods (clicking the
left mouse button, holding it down, and swiping the mouse cursor;
left-clicking to position the insertion point, then pressing and
holding the keyboard Shift key, then left-clicking the mouse again at
the point at which I wish the selection to end) produce no effect
whatsoever. I stress that I cant select the blogs digital text when
its shown as the content of €śnormal Web pages€ť because I am able to
select the same digital text when its shown as the blogs source
code, was Ill explain below.

b. ANY right-click on ANY of the blog's normal Web pages produces
only an obnoxious "yellow triangle with black exclamation point" small
rectangular error message which appears at the center of the screen to
the accompaniment of the equally-obnoxious default Windows XP error
"sounder." The error message is shown in my chosen "slate blue"
Windows XP Home system color, it's captioned "Message from webpage,"
and it fairly chortles "Content Protected ! " (The exclamation point
is included...)

The context menu has clearly been disabled completely.

This linked text appears at the absolute bottom of the content area of
each page at the blog:

Content Protected Using Blog Protector By: PcDrome.

The text is linked to the software's "product page" at:

http://www.pcdrome.com/blog-protector

The developer says, "Blog Protector is a wordpress plugin which can
Protect your valuable blog content as from getting copied. Many
Bloggers simply copy-paste the content from your blog without giving
credit to the original source. This plugin basically disable the text
selection & right click which will make the bloggers difficult to copy
the content.

Features

–*Disable right click on your blog.
–*Disable selection of text on your blog."

(Funny... €śBlog Protector€ť hasn't been used on the developers own
"product page" for the software€¦ I was able to copy that text and to
paste it to this field without any difficulty...)

c. Since I'm not able to select text in any way on the Web/blog
pages, I can't use the Ctrl+C keystroke combination from the keyboard.

d. As I explained, none of this is a problem with respect to my
access to the source code, which I can reach by using IE8's View
\Source menu. The code is displayed on a separate tab with an http:
prefix. The text is "formatted Plain Text," so to speak, as would be
shown by a code editor, such as Dreamweaver. Text appears in
different colors as numbered lines of code. I can select that text by
any method, the context menu is fully functional on those browser tabs
of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm
able to select that code without difficulty.

3. You said, "Are you able to print the page? If so, can you print it
to a PDF printer or "Generic (Text Only)" printer? It's possible that
either of those would leave you with just as much of a mess as the raw
HTML, but it might be worth a try."

With that suggestion you may have solved my problem! I so rarely
print any content to paper that I know next to nil about printing.
I've seen references to "printing to files" rather than to paper, and
I tried it a couple of times, purely out of curiosity, but I was never
able to achieve any success. This time I succeeded!

Because I'm not able to select the text I wished to save on the blog
pages I was forced to go directly to my browser's File\Print menu,
which I assumed would either print all of each Web page, or none of
it. The Print Dialog was displayed. The "Select Printer" field of
the dialog only offered my minimal monochromatic laser printer and the
"Microsoft XPS Document Writer" as printer options. I selected the
XPS Writer and I left-clicked the "Print" button on the Print dialog.
After a brief delay, I was shown the €śSave file as€ť dialog. I saved
the output as an XPS file. I was then able to open the files by
double-left-clicking them in Windows Explorer's Contents Pane in the
Windows Classic View in Windows XP Home Edition. I found these files
were crude RTF files which were displayed in my browser. My usual
method of saving digital text from Web pages is to paste the content
into my local Word 2007 files by using the home ribbon's "Paste\Paste
Special\Unformatted text" menu. I then reformat the text minimally,
but these XPS files preserve the formatting which is shown on the blog
pages, and I may gradually retain that formatting in my local files,
although Ill convert the text to my standard fonts and sizes.

I'm able to select text from the full XPS copies of the Web pages,
which are shown on multiple pages of normal word processor size in
these XPS files. Im able to copy that text as Id normally copy any
text on Web pages which havent been protected from copying.

I didn't even know what XPS files were, so I used Uniblue's File
Extension site to check the extension. The site identifies these as
Microsoft "XML Paper Specification" files at:

http://filext.com/file-extension/XPS

They link to Microsoft's own page at:

http://msdn.microsoft.com/en-us/wind.../gg463373.aspx

Suzanne, as I could have expected, you came through in my behalf! As
I said, my knowledge of printing is extremely limited and I know even
less about printing to files rather than to paper, but your suggestion
seems to be a solution to my problem!

You showed me that this method does seem to operate as an HTML to RTF
converter utility would operate, but it's quicker because it directly
converts the "protected" Web text to RTF which can then be copied
normally. I assume a converter utility would require me to copy the
source code, then paste the code into the converter, and then copy the
converter's output. This cuts out the source code step.

(It may be difficult to delete these temporary XLS files when I've
copied selected contents from them and have pasted that content to my
Log file, but I'll solve that minor problem when I've built up a
backlog of these files. I was told I couldn't delete the first few
files of this type because another application was using them, but
that problem may clear up when I close my browser &/or Word 2007.)

I don't know if this is happening in Word 2007 or not, but it works!
THANKS AGAIN!

Jeff Hook, NJ, USA

On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote:
Since you seem to have a pretty good idea of what you're doing, this is
definitely a long shot, but I'll ask anyway. Can you select text in the
blog at all? If so, is Ctrl+C to copy also disabled? (snipped)





  #11   Report Post  
Posted to microsoft.public.word.docmanagement
Stefan Blom[_3_] Stefan Blom[_3_] is offline
external usenet poster
 
Posts: 6,897
Default Removing or Converting HTML code when Web text is pasted into Word 2007

Good point.

Stefan Blom
Microsoft Word MVP



"Suzanne S. Barnhill" wrote in message ...

Looked at another way, Adobe Reader doesn't include any capability of
*creating* PDFs, while the function of the XPS printer is to create XPS
files. But I won't argue that XPS is "less universal" since I still know
next to nothing about it whereas one is inundated with PDFs everywhere every
day.

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Stefan Blom" wrote in message
...
My guess is that the XPS reader was installed when you updated Windows XP.
The XPS "printer," on the other hand, may have to be installed manually.
By definition, that would make the XPS reader "less universal" than the
Adobe reader. :-)

Stefan Blom
Microsoft Word MVP



"Suzanne S. Barnhill" wrote in message ...
Yeah, it sort of is. Except that I couldn't find any clue about an XPS
reader that's as universal as Adobe Reader, so I couldn't figure out what
you do with the .xps files after you have them, but apparently there's an
XPS reader built into Windows. I just checked Tools | Folder Options |
File Types in Windows XP, and, sure enough, XPS Viewer.exe is listed for
them, which is interesting since my Printers folder doesn't include the
XPS Printer.

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Stefan Blom" wrote in message
...
Maybe I'm missing something here, but I thought the XPS printer is simply
Microsoft's equivalence of PDF?

Stefan Blom
Microsoft Word MVP



"Suzanne S. Barnhill" wrote in message
...
Well, I'm glad that led you to a solution. I have the XPS printer on my
Windows 7 laptop (I think it's installed by default either with Windows
or with recent versions of Office), which is a good thing since I can't
get the laptop to see any printers on the network (or to see the network
at all except for Internet access), but I haven't had occasion to try it
for anything, and anytime I've searched for explanations of XPS online
I've ended up terminally confused!

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA
http://word.mvps.org

"Jeff H." wrote in message
...

Suzanne, thanks so much for your **EXCELLENT** assistance for so many
years in this group! Wow!

1. You said "if you'll provide a link to the blog, I might be able to
figure out some other workaround."

Here's the blog's top page:

http://blog.kimblechartingsolutions.com/

but hang on. A trip to the blog may only be optional because it looks
like you've already solved my problem! I'll provide more information
for the benefit of other users who are experiencing this same problem:

2. You asked, "Can you select text in the blog at all? If so, is Ctrl
+C to copy also disabled?:

a. I can't select ANY of the digital text on any of the blog pages
while they're normally displayed as fully-formatted Web pages on the
tabs of my Web browser, IE8. Normal selection methods (clicking the
left mouse button, holding it down, and swiping the mouse cursor;
left-clicking to position the insertion point, then pressing and
holding the keyboard Shift key, then left-clicking the mouse again at
the point at which I wish the selection to end) produce no effect
whatsoever. I stress that I cant select the blogs digital text when
its shown as the content of €śnormal Web pages€ť because I am able to
select the same digital text when its shown as the blogs source
code, was Ill explain below.

b. ANY right-click on ANY of the blog's normal Web pages produces
only an obnoxious "yellow triangle with black exclamation point" small
rectangular error message which appears at the center of the screen to
the accompaniment of the equally-obnoxious default Windows XP error
"sounder." The error message is shown in my chosen "slate blue"
Windows XP Home system color, it's captioned "Message from webpage,"
and it fairly chortles "Content Protected ! " (The exclamation point
is included...)

The context menu has clearly been disabled completely.

This linked text appears at the absolute bottom of the content area of
each page at the blog:

Content Protected Using Blog Protector By: PcDrome.

The text is linked to the software's "product page" at:

http://www.pcdrome.com/blog-protector

The developer says, "Blog Protector is a wordpress plugin which can
Protect your valuable blog content as from getting copied. Many
Bloggers simply copy-paste the content from your blog without giving
credit to the original source. This plugin basically disable the text
selection & right click which will make the bloggers difficult to copy
the content.

Features

–*Disable right click on your blog.
–*Disable selection of text on your blog."

(Funny... €śBlog Protector€ť hasn't been used on the developers own
"product page" for the software€¦ I was able to copy that text and to
paste it to this field without any difficulty...)

c. Since I'm not able to select text in any way on the Web/blog
pages, I can't use the Ctrl+C keystroke combination from the keyboard.

d. As I explained, none of this is a problem with respect to my
access to the source code, which I can reach by using IE8's View
\Source menu. The code is displayed on a separate tab with an http:
prefix. The text is "formatted Plain Text," so to speak, as would be
shown by a code editor, such as Dreamweaver. Text appears in
different colors as numbered lines of code. I can select that text by
any method, the context menu is fully functional on those browser tabs
of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm
able to select that code without difficulty.

3. You said, "Are you able to print the page? If so, can you print it
to a PDF printer or "Generic (Text Only)" printer? It's possible that
either of those would leave you with just as much of a mess as the raw
HTML, but it might be worth a try."

With that suggestion you may have solved my problem! I so rarely
print any content to paper that I know next to nil about printing.
I've seen references to "printing to files" rather than to paper, and
I tried it a couple of times, purely out of curiosity, but I was never
able to achieve any success. This time I succeeded!

Because I'm not able to select the text I wished to save on the blog
pages I was forced to go directly to my browser's File\Print menu,
which I assumed would either print all of each Web page, or none of
it. The Print Dialog was displayed. The "Select Printer" field of
the dialog only offered my minimal monochromatic laser printer and the
"Microsoft XPS Document Writer" as printer options. I selected the
XPS Writer and I left-clicked the "Print" button on the Print dialog.
After a brief delay, I was shown the €śSave file as€ť dialog. I saved
the output as an XPS file. I was then able to open the files by
double-left-clicking them in Windows Explorer's Contents Pane in the
Windows Classic View in Windows XP Home Edition. I found these files
were crude RTF files which were displayed in my browser. My usual
method of saving digital text from Web pages is to paste the content
into my local Word 2007 files by using the home ribbon's "Paste\Paste
Special\Unformatted text" menu. I then reformat the text minimally,
but these XPS files preserve the formatting which is shown on the blog
pages, and I may gradually retain that formatting in my local files,
although Ill convert the text to my standard fonts and sizes.

I'm able to select text from the full XPS copies of the Web pages,
which are shown on multiple pages of normal word processor size in
these XPS files. Im able to copy that text as Id normally copy any
text on Web pages which havent been protected from copying.

I didn't even know what XPS files were, so I used Uniblue's File
Extension site to check the extension. The site identifies these as
Microsoft "XML Paper Specification" files at:

http://filext.com/file-extension/XPS

They link to Microsoft's own page at:

http://msdn.microsoft.com/en-us/wind.../gg463373.aspx

Suzanne, as I could have expected, you came through in my behalf! As
I said, my knowledge of printing is extremely limited and I know even
less about printing to files rather than to paper, but your suggestion
seems to be a solution to my problem!

You showed me that this method does seem to operate as an HTML to RTF
converter utility would operate, but it's quicker because it directly
converts the "protected" Web text to RTF which can then be copied
normally. I assume a converter utility would require me to copy the
source code, then paste the code into the converter, and then copy the
converter's output. This cuts out the source code step.

(It may be difficult to delete these temporary XLS files when I've
copied selected contents from them and have pasted that content to my
Log file, but I'll solve that minor problem when I've built up a
backlog of these files. I was told I couldn't delete the first few
files of this type because another application was using them, but
that problem may clear up when I close my browser &/or Word 2007.)

I don't know if this is happening in Word 2007 or not, but it works!
THANKS AGAIN!

Jeff Hook, NJ, USA

On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote:
Since you seem to have a pretty good idea of what you're doing, this is
definitely a long shot, but I'll ask anyway. Can you select text in the
blog at all? If so, is Ctrl+C to copy also disabled? (snipped)


  #12   Report Post  
Posted to microsoft.public.word.docmanagement
Robert2 Robert2 is offline
external usenet poster
 
Posts: 4
Default Removing or Converting HTML code when Web text is pasted into Word 2007

Microsoft .NET Framework 3.0 natively includes a dedicated XPS Document
Writer and Viewer. When people double-click a ´.xpsˇ document in the
Windows shell, the document is automatically displayed in the XPS Document
Viewer embedded in Internet Explorer v.6+.
  #13   Report Post  
Posted to microsoft.public.word.docmanagement
Robert2 Robert2 is offline
external usenet poster
 
Posts: 4
Default Removing or Converting HTML code when Web text is pasted into Word 2007

The World Browser (http://www.ioage.com/twen/) is an alternative Web
browser based on the Internet Explorer engine. It has a ´View | View In
Edit Modeˇ main menu command. When the page at
http://blog.kimblechartingsolutions.com/ is displayed in that ´Edit Modeˇ,
you can right-click within the main text, choose ´Select Allˇ, then press
´Ctrl+Cˇ. You can then paste the whole page into Word using ´Ctrl+Vˇ.
Reply
Thread Tools
Display Modes

Posting Rules

Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Removing gray background from pasted text document Bernie K Microsoft Word Help 2 May 15th 23 08:48 PM
Converting html pages into Word 2007 via Clipboard OleHoss Microsoft Word Help 1 December 23rd 09 02:55 AM
Removing spaces in pasted text. suzie marie Page Layout 1 August 22nd 07 12:22 AM
How can I see the html code in word 2007? Lotta New Users 1 May 4th 07 05:13 PM
how do i paste html code in personal web page tamikass New Users 1 April 26th 06 03:12 AM


All times are GMT +1. The time now is 10:08 PM.

Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright ©2004-2024 Microsoft Office Word Forum - WordBanter.
The comments are property of their posters.
 

About Us

"It's about Microsoft Word"