Home |
Search |
Today's Posts |
#1
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted into Word 2007
I've just searched this group for prior threads about this topic and I
didn't find any guidance. I save large amounts of text which I copy from Web pages and which I then paste into my local Word 2007 files. I value the content which is displayed at one blog. The content is detailed and it discusses difficult topics which I prefer to study carefully by reviewing the content as digital text in my own local files. When I've saved Web text to my local files I can use Word 2007's capabilities to reformat the text, to highlight sections in different colors and fonts, to add footnotes, etc. This is all helpful in my study of the content. The author of the blog generously offers the blog's contents without restrictions. He doesn't charge a subscription fee. He offers the content to anyone who's interested and it seems clear that his intention is to help as many readers as he can by freely publishing his guidance. However, he's not directly managing the technical operation of his own blog. He hired a contractor who created the blog and who maintains the site. The contractor has disabled the blog's context menu, apparently in an attempt to protect the author's content from plagiarism. The author clearly wants to distribute his blog content widely. I'm not "stealing" his content. I'm only saving it to my own files so I can study it. I'm not republishing it in any way, let alone claiming it as my own, etc. I do think my use of copies of the content is ethical. I can't simply select text at the blog and copy it to my own files as I do with the content which I obtain from most sites but I am able to view the site's Source Code by using the View\Source menu of my Web browser, IE8. I can then select that digital text, and I can copy that text either by using the "Copy" option in the context menu of those source code pages, or by using the pages' "Edit" menus. I'm able to paste that code into my local Word 2007 files as a way of working around the disablement of the usual copy capability at the site, but this leaves me with large amounts of unwanted HTML code in the text which I paste to my local files from the blog. I have three questions: 1. Is Word 2007 able to remove all HTML code from text of this type? 2. Is Word 2007 able to "convert" such code to "Rich Text Format"? 3. Can I only process this code in this way by using a separate utility which is intended for this purpose? I've tried to find guidance in Microsoft's own online Help pages, but I wasn't successful. I think Microsoft offers some excellent Word 2007 Help content, but the Bing search interface doesn't seem to me to be helpful. I found that third party utilities can be obtained for the removal of all HTML code from text (i.e. for the extraction of Plain Text from HTML code) and for the conversion of HTML code to formatted text but I wonder if Word 2007 can do this itself. I've come to think that this excellent application can do almost anything I want it to do but I've found that some of these impressive capabilities aren't see at the surface level where I tend to operate in this application. |
#2
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted into Word 2007
Since you seem to have a pretty good idea of what you're doing, this is
definitely a long shot, but I'll ask anyway. Can you select text in the blog at all? If so, is Ctrl+C to copy also disabled? Another thought: Are you able to print the page? If so, can you print it to a PDF printer or "Generic (Text Only)" printer? It's possible that either of those would leave you with just as much of a mess as the raw HTML, but it might be worth a try. I can't offer any guidance about the conversion abilities of Word 2007 (though perhaps others can), but if you'll provide a link to the blog, I might be able to figure out some other workaround. -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Jeff Hook" wrote in message ... I've just searched this group for prior threads about this topic and I didn't find any guidance. I save large amounts of text which I copy from Web pages and which I then paste into my local Word 2007 files. I value the content which is displayed at one blog. The content is detailed and it discusses difficult topics which I prefer to study carefully by reviewing the content as digital text in my own local files. When I've saved Web text to my local files I can use Word 2007's capabilities to reformat the text, to highlight sections in different colors and fonts, to add footnotes, etc. This is all helpful in my study of the content. The author of the blog generously offers the blog's contents without restrictions. He doesn't charge a subscription fee. He offers the content to anyone who's interested and it seems clear that his intention is to help as many readers as he can by freely publishing his guidance. However, he's not directly managing the technical operation of his own blog. He hired a contractor who created the blog and who maintains the site. The contractor has disabled the blog's context menu, apparently in an attempt to protect the author's content from plagiarism. The author clearly wants to distribute his blog content widely. I'm not "stealing" his content. I'm only saving it to my own files so I can study it. I'm not republishing it in any way, let alone claiming it as my own, etc. I do think my use of copies of the content is ethical. I can't simply select text at the blog and copy it to my own files as I do with the content which I obtain from most sites but I am able to view the site's Source Code by using the View\Source menu of my Web browser, IE8. I can then select that digital text, and I can copy that text either by using the "Copy" option in the context menu of those source code pages, or by using the pages' "Edit" menus. I'm able to paste that code into my local Word 2007 files as a way of working around the disablement of the usual copy capability at the site, but this leaves me with large amounts of unwanted HTML code in the text which I paste to my local files from the blog. I have three questions: 1. Is Word 2007 able to remove all HTML code from text of this type? 2. Is Word 2007 able to "convert" such code to "Rich Text Format"? 3. Can I only process this code in this way by using a separate utility which is intended for this purpose? I've tried to find guidance in Microsoft's own online Help pages, but I wasn't successful. I think Microsoft offers some excellent Word 2007 Help content, but the Bing search interface doesn't seem to me to be helpful. I found that third party utilities can be obtained for the removal of all HTML code from text (i.e. for the extraction of Plain Text from HTML code) and for the conversion of HTML code to formatted text but I wonder if Word 2007 can do this itself. I've come to think that this excellent application can do almost anything I want it to do but I've found that some of these impressive capabilities aren't see at the surface level where I tend to operate in this application. |
#3
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted intoWord 2007
Suzanne, thanks so much for your **EXCELLENT** assistance for so many years in this group! Wow! 1. You said "if you'll provide a link to the blog, I might be able to figure out some other workaround." Here's the blog's top page: http://blog.kimblechartingsolutions.com/ but hang on. A trip to the blog may only be optional because it looks like you've already solved my problem! I'll provide more information for the benefit of other users who are experiencing this same problem: 2. You asked, "Can you select text in the blog at all? If so, is Ctrl +C to copy also disabled?: a. I can't select ANY of the digital text on any of the blog pages while they're normally displayed as fully-formatted Web pages on the tabs of my Web browser, IE8. Normal selection methods (clicking the left mouse button, holding it down, and swiping the mouse cursor; left-clicking to position the insertion point, then pressing and holding the keyboard Shift key, then left-clicking the mouse again at the point at which I wish the selection to end) produce no effect whatsoever. I stress that I cant select the blogs digital text when its shown as the content of €śnormal Web pages€ť because I am able to select the same digital text when its shown as the blogs source code, was Ill explain below. b. ANY right-click on ANY of the blog's normal Web pages produces only an obnoxious "yellow triangle with black exclamation point" small rectangular error message which appears at the center of the screen to the accompaniment of the equally-obnoxious default Windows XP error "sounder." The error message is shown in my chosen "slate blue" Windows XP Home system color, it's captioned "Message from webpage," and it fairly chortles "Content Protected ! " (The exclamation point is included...) The context menu has clearly been disabled completely. This linked text appears at the absolute bottom of the content area of each page at the blog: Content Protected Using Blog Protector By: PcDrome. The text is linked to the software's "product page" at: http://www.pcdrome.com/blog-protector The developer says, "Blog Protector is a wordpress plugin which can Protect your valuable blog content as from getting copied. Many Bloggers simply copy-paste the content from your blog without giving credit to the original source. This plugin basically disable the text selection & right click which will make the bloggers difficult to copy the content. Features –*Disable right click on your blog. –*Disable selection of text on your blog." (Funny... €śBlog Protector€ť hasn't been used on the developers own "product page" for the software€¦ I was able to copy that text and to paste it to this field without any difficulty...) c. Since I'm not able to select text in any way on the Web/blog pages, I can't use the Ctrl+C keystroke combination from the keyboard. d. As I explained, none of this is a problem with respect to my access to the source code, which I can reach by using IE8's View \Source menu. The code is displayed on a separate tab with an http: prefix. The text is "formatted Plain Text," so to speak, as would be shown by a code editor, such as Dreamweaver. Text appears in different colors as numbered lines of code. I can select that text by any method, the context menu is fully functional on those browser tabs of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm able to select that code without difficulty. 3. You said, "Are you able to print the page? If so, can you print it to a PDF printer or "Generic (Text Only)" printer? It's possible that either of those would leave you with just as much of a mess as the raw HTML, but it might be worth a try." With that suggestion you may have solved my problem! I so rarely print any content to paper that I know next to nil about printing. I've seen references to "printing to files" rather than to paper, and I tried it a couple of times, purely out of curiosity, but I was never able to achieve any success. This time I succeeded! Because I'm not able to select the text I wished to save on the blog pages I was forced to go directly to my browser's File\Print menu, which I assumed would either print all of each Web page, or none of it. The Print Dialog was displayed. The "Select Printer" field of the dialog only offered my minimal monochromatic laser printer and the "Microsoft XPS Document Writer" as printer options. I selected the XPS Writer and I left-clicked the "Print" button on the Print dialog. After a brief delay, I was shown the €śSave file as€ť dialog. I saved the output as an XPS file. I was then able to open the files by double-left-clicking them in Windows Explorer's Contents Pane in the Windows Classic View in Windows XP Home Edition. I found these files were crude RTF files which were displayed in my browser. My usual method of saving digital text from Web pages is to paste the content into my local Word 2007 files by using the home ribbon's "Paste\Paste Special\Unformatted text" menu. I then reformat the text minimally, but these XPS files preserve the formatting which is shown on the blog pages, and I may gradually retain that formatting in my local files, although Ill convert the text to my standard fonts and sizes. I'm able to select text from the full XPS copies of the Web pages, which are shown on multiple pages of normal word processor size in these XPS files. Im able to copy that text as Id normally copy any text on Web pages which havent been protected from copying. I didn't even know what XPS files were, so I used Uniblue's File Extension site to check the extension. The site identifies these as Microsoft "XML Paper Specification" files at: http://filext.com/file-extension/XPS They link to Microsoft's own page at: http://msdn.microsoft.com/en-us/wind.../gg463373.aspx Suzanne, as I could have expected, you came through in my behalf! As I said, my knowledge of printing is extremely limited and I know even less about printing to files rather than to paper, but your suggestion seems to be a solution to my problem! You showed me that this method does seem to operate as an HTML to RTF converter utility would operate, but it's quicker because it directly converts the "protected" Web text to RTF which can then be copied normally. I assume a converter utility would require me to copy the source code, then paste the code into the converter, and then copy the converter's output. This cuts out the source code step. (It may be difficult to delete these temporary XLS files when I've copied selected contents from them and have pasted that content to my Log file, but I'll solve that minor problem when I've built up a backlog of these files. I was told I couldn't delete the first few files of this type because another application was using them, but that problem may clear up when I close my browser &/or Word 2007.) I don't know if this is happening in Word 2007 or not, but it works! THANKS AGAIN! Jeff Hook, NJ, USA On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote: Since you seem to have a pretty good idea of what you're doing, this is definitely a long shot, but I'll ask anyway. Can you select text in the blog at all? If so, is Ctrl+C to copy also disabled? (snipped) |
#4
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted into Word 2007
Well, I'm glad that led you to a solution. I have the XPS printer on my
Windows 7 laptop (I think it's installed by default either with Windows or with recent versions of Office), which is a good thing since I can't get the laptop to see any printers on the network (or to see the network at all except for Internet access), but I haven't had occasion to try it for anything, and anytime I've searched for explanations of XPS online I've ended up terminally confused! -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Jeff H." wrote in message ... Suzanne, thanks so much for your **EXCELLENT** assistance for so many years in this group! Wow! 1. You said "if you'll provide a link to the blog, I might be able to figure out some other workaround." Here's the blog's top page: http://blog.kimblechartingsolutions.com/ but hang on. A trip to the blog may only be optional because it looks like you've already solved my problem! I'll provide more information for the benefit of other users who are experiencing this same problem: 2. You asked, "Can you select text in the blog at all? If so, is Ctrl +C to copy also disabled?: a. I can't select ANY of the digital text on any of the blog pages while they're normally displayed as fully-formatted Web pages on the tabs of my Web browser, IE8. Normal selection methods (clicking the left mouse button, holding it down, and swiping the mouse cursor; left-clicking to position the insertion point, then pressing and holding the keyboard Shift key, then left-clicking the mouse again at the point at which I wish the selection to end) produce no effect whatsoever. I stress that I cant select the blogs digital text when its shown as the content of €śnormal Web pages€ť because I am able to select the same digital text when its shown as the blogs source code, was Ill explain below. b. ANY right-click on ANY of the blog's normal Web pages produces only an obnoxious "yellow triangle with black exclamation point" small rectangular error message which appears at the center of the screen to the accompaniment of the equally-obnoxious default Windows XP error "sounder." The error message is shown in my chosen "slate blue" Windows XP Home system color, it's captioned "Message from webpage," and it fairly chortles "Content Protected ! " (The exclamation point is included...) The context menu has clearly been disabled completely. This linked text appears at the absolute bottom of the content area of each page at the blog: Content Protected Using Blog Protector By: PcDrome. The text is linked to the software's "product page" at: http://www.pcdrome.com/blog-protector The developer says, "Blog Protector is a wordpress plugin which can Protect your valuable blog content as from getting copied. Many Bloggers simply copy-paste the content from your blog without giving credit to the original source. This plugin basically disable the text selection & right click which will make the bloggers difficult to copy the content. Features –*Disable right click on your blog. –*Disable selection of text on your blog." (Funny... €śBlog Protector€ť hasn't been used on the developers own "product page" for the software€¦ I was able to copy that text and to paste it to this field without any difficulty...) c. Since I'm not able to select text in any way on the Web/blog pages, I can't use the Ctrl+C keystroke combination from the keyboard. d. As I explained, none of this is a problem with respect to my access to the source code, which I can reach by using IE8's View \Source menu. The code is displayed on a separate tab with an http: prefix. The text is "formatted Plain Text," so to speak, as would be shown by a code editor, such as Dreamweaver. Text appears in different colors as numbered lines of code. I can select that text by any method, the context menu is fully functional on those browser tabs of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm able to select that code without difficulty. 3. You said, "Are you able to print the page? If so, can you print it to a PDF printer or "Generic (Text Only)" printer? It's possible that either of those would leave you with just as much of a mess as the raw HTML, but it might be worth a try." With that suggestion you may have solved my problem! I so rarely print any content to paper that I know next to nil about printing. I've seen references to "printing to files" rather than to paper, and I tried it a couple of times, purely out of curiosity, but I was never able to achieve any success. This time I succeeded! Because I'm not able to select the text I wished to save on the blog pages I was forced to go directly to my browser's File\Print menu, which I assumed would either print all of each Web page, or none of it. The Print Dialog was displayed. The "Select Printer" field of the dialog only offered my minimal monochromatic laser printer and the "Microsoft XPS Document Writer" as printer options. I selected the XPS Writer and I left-clicked the "Print" button on the Print dialog. After a brief delay, I was shown the €śSave file as€ť dialog. I saved the output as an XPS file. I was then able to open the files by double-left-clicking them in Windows Explorer's Contents Pane in the Windows Classic View in Windows XP Home Edition. I found these files were crude RTF files which were displayed in my browser. My usual method of saving digital text from Web pages is to paste the content into my local Word 2007 files by using the home ribbon's "Paste\Paste Special\Unformatted text" menu. I then reformat the text minimally, but these XPS files preserve the formatting which is shown on the blog pages, and I may gradually retain that formatting in my local files, although Ill convert the text to my standard fonts and sizes. I'm able to select text from the full XPS copies of the Web pages, which are shown on multiple pages of normal word processor size in these XPS files. Im able to copy that text as Id normally copy any text on Web pages which havent been protected from copying. I didn't even know what XPS files were, so I used Uniblue's File Extension site to check the extension. The site identifies these as Microsoft "XML Paper Specification" files at: http://filext.com/file-extension/XPS They link to Microsoft's own page at: http://msdn.microsoft.com/en-us/wind.../gg463373.aspx Suzanne, as I could have expected, you came through in my behalf! As I said, my knowledge of printing is extremely limited and I know even less about printing to files rather than to paper, but your suggestion seems to be a solution to my problem! You showed me that this method does seem to operate as an HTML to RTF converter utility would operate, but it's quicker because it directly converts the "protected" Web text to RTF which can then be copied normally. I assume a converter utility would require me to copy the source code, then paste the code into the converter, and then copy the converter's output. This cuts out the source code step. (It may be difficult to delete these temporary XLS files when I've copied selected contents from them and have pasted that content to my Log file, but I'll solve that minor problem when I've built up a backlog of these files. I was told I couldn't delete the first few files of this type because another application was using them, but that problem may clear up when I close my browser &/or Word 2007.) I don't know if this is happening in Word 2007 or not, but it works! THANKS AGAIN! Jeff Hook, NJ, USA On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote: Since you seem to have a pretty good idea of what you're doing, this is definitely a long shot, but I'll ask anyway. Can you select text in the blog at all? If so, is Ctrl+C to copy also disabled? (snipped) |
#5
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted into Word 2007
Maybe I'm missing something here, but I thought the XPS printer is simply Microsoft's equivalence of PDF?
Stefan Blom Microsoft Word MVP "Suzanne S. Barnhill" wrote in message ... Well, I'm glad that led you to a solution. I have the XPS printer on my Windows 7 laptop (I think it's installed by default either with Windows or with recent versions of Office), which is a good thing since I can't get the laptop to see any printers on the network (or to see the network at all except for Internet access), but I haven't had occasion to try it for anything, and anytime I've searched for explanations of XPS online I've ended up terminally confused! -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Jeff H." wrote in message ... Suzanne, thanks so much for your **EXCELLENT** assistance for so many years in this group! Wow! 1. You said "if you'll provide a link to the blog, I might be able to figure out some other workaround." Here's the blog's top page: http://blog.kimblechartingsolutions.com/ but hang on. A trip to the blog may only be optional because it looks like you've already solved my problem! I'll provide more information for the benefit of other users who are experiencing this same problem: 2. You asked, "Can you select text in the blog at all? If so, is Ctrl +C to copy also disabled?: a. I can't select ANY of the digital text on any of the blog pages while they're normally displayed as fully-formatted Web pages on the tabs of my Web browser, IE8. Normal selection methods (clicking the left mouse button, holding it down, and swiping the mouse cursor; left-clicking to position the insertion point, then pressing and holding the keyboard Shift key, then left-clicking the mouse again at the point at which I wish the selection to end) produce no effect whatsoever. I stress that I cant select the blogs digital text when its shown as the content of €śnormal Web pages€ť because I am able to select the same digital text when its shown as the blogs source code, was Ill explain below. b. ANY right-click on ANY of the blog's normal Web pages produces only an obnoxious "yellow triangle with black exclamation point" small rectangular error message which appears at the center of the screen to the accompaniment of the equally-obnoxious default Windows XP error "sounder." The error message is shown in my chosen "slate blue" Windows XP Home system color, it's captioned "Message from webpage," and it fairly chortles "Content Protected ! " (The exclamation point is included...) The context menu has clearly been disabled completely. This linked text appears at the absolute bottom of the content area of each page at the blog: Content Protected Using Blog Protector By: PcDrome. The text is linked to the software's "product page" at: http://www.pcdrome.com/blog-protector The developer says, "Blog Protector is a wordpress plugin which can Protect your valuable blog content as from getting copied. Many Bloggers simply copy-paste the content from your blog without giving credit to the original source. This plugin basically disable the text selection & right click which will make the bloggers difficult to copy the content. Features –*Disable right click on your blog. –*Disable selection of text on your blog." (Funny... €śBlog Protector€ť hasn't been used on the developers own "product page" for the software€¦ I was able to copy that text and to paste it to this field without any difficulty...) c. Since I'm not able to select text in any way on the Web/blog pages, I can't use the Ctrl+C keystroke combination from the keyboard. d. As I explained, none of this is a problem with respect to my access to the source code, which I can reach by using IE8's View \Source menu. The code is displayed on a separate tab with an http: prefix. The text is "formatted Plain Text," so to speak, as would be shown by a code editor, such as Dreamweaver. Text appears in different colors as numbered lines of code. I can select that text by any method, the context menu is fully functional on those browser tabs of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm able to select that code without difficulty. 3. You said, "Are you able to print the page? If so, can you print it to a PDF printer or "Generic (Text Only)" printer? It's possible that either of those would leave you with just as much of a mess as the raw HTML, but it might be worth a try." With that suggestion you may have solved my problem! I so rarely print any content to paper that I know next to nil about printing. I've seen references to "printing to files" rather than to paper, and I tried it a couple of times, purely out of curiosity, but I was never able to achieve any success. This time I succeeded! Because I'm not able to select the text I wished to save on the blog pages I was forced to go directly to my browser's File\Print menu, which I assumed would either print all of each Web page, or none of it. The Print Dialog was displayed. The "Select Printer" field of the dialog only offered my minimal monochromatic laser printer and the "Microsoft XPS Document Writer" as printer options. I selected the XPS Writer and I left-clicked the "Print" button on the Print dialog. After a brief delay, I was shown the €śSave file as€ť dialog. I saved the output as an XPS file. I was then able to open the files by double-left-clicking them in Windows Explorer's Contents Pane in the Windows Classic View in Windows XP Home Edition. I found these files were crude RTF files which were displayed in my browser. My usual method of saving digital text from Web pages is to paste the content into my local Word 2007 files by using the home ribbon's "Paste\Paste Special\Unformatted text" menu. I then reformat the text minimally, but these XPS files preserve the formatting which is shown on the blog pages, and I may gradually retain that formatting in my local files, although Ill convert the text to my standard fonts and sizes. I'm able to select text from the full XPS copies of the Web pages, which are shown on multiple pages of normal word processor size in these XPS files. Im able to copy that text as Id normally copy any text on Web pages which havent been protected from copying. I didn't even know what XPS files were, so I used Uniblue's File Extension site to check the extension. The site identifies these as Microsoft "XML Paper Specification" files at: http://filext.com/file-extension/XPS They link to Microsoft's own page at: http://msdn.microsoft.com/en-us/wind.../gg463373.aspx Suzanne, as I could have expected, you came through in my behalf! As I said, my knowledge of printing is extremely limited and I know even less about printing to files rather than to paper, but your suggestion seems to be a solution to my problem! You showed me that this method does seem to operate as an HTML to RTF converter utility would operate, but it's quicker because it directly converts the "protected" Web text to RTF which can then be copied normally. I assume a converter utility would require me to copy the source code, then paste the code into the converter, and then copy the converter's output. This cuts out the source code step. (It may be difficult to delete these temporary XLS files when I've copied selected contents from them and have pasted that content to my Log file, but I'll solve that minor problem when I've built up a backlog of these files. I was told I couldn't delete the first few files of this type because another application was using them, but that problem may clear up when I close my browser &/or Word 2007.) I don't know if this is happening in Word 2007 or not, but it works! THANKS AGAIN! Jeff Hook, NJ, USA On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote: Since you seem to have a pretty good idea of what you're doing, this is definitely a long shot, but I'll ask anyway. Can you select text in the blog at all? If so, is Ctrl+C to copy also disabled? (snipped) |
#6
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted intoWord 2007
If you have MS OneNote, you could ty to take a screenshot of the blog,
and paste the image into a OneNote document. You can then right click on the image and select "Copy Text From Picture". See http://www.technospot.net/blogs/copy...fice-one-note/ Might be worth a shot. |
#7
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted into Word 2007
Yeah, it sort of is. Except that I couldn't find any clue about an XPS
reader that's as universal as Adobe Reader, so I couldn't figure out what you do with the .xps files after you have them, but apparently there's an XPS reader built into Windows. I just checked Tools | Folder Options | File Types in Windows XP, and, sure enough, XPS Viewer.exe is listed for them, which is interesting since my Printers folder doesn't include the XPS Printer. -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Stefan Blom" wrote in message ... Maybe I'm missing something here, but I thought the XPS printer is simply Microsoft's equivalence of PDF? Stefan Blom Microsoft Word MVP "Suzanne S. Barnhill" wrote in message ... Well, I'm glad that led you to a solution. I have the XPS printer on my Windows 7 laptop (I think it's installed by default either with Windows or with recent versions of Office), which is a good thing since I can't get the laptop to see any printers on the network (or to see the network at all except for Internet access), but I haven't had occasion to try it for anything, and anytime I've searched for explanations of XPS online I've ended up terminally confused! -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Jeff H." wrote in message ... Suzanne, thanks so much for your **EXCELLENT** assistance for so many years in this group! Wow! 1. You said "if you'll provide a link to the blog, I might be able to figure out some other workaround." Here's the blog's top page: http://blog.kimblechartingsolutions.com/ but hang on. A trip to the blog may only be optional because it looks like you've already solved my problem! I'll provide more information for the benefit of other users who are experiencing this same problem: 2. You asked, "Can you select text in the blog at all? If so, is Ctrl +C to copy also disabled?: a. I can't select ANY of the digital text on any of the blog pages while they're normally displayed as fully-formatted Web pages on the tabs of my Web browser, IE8. Normal selection methods (clicking the left mouse button, holding it down, and swiping the mouse cursor; left-clicking to position the insertion point, then pressing and holding the keyboard Shift key, then left-clicking the mouse again at the point at which I wish the selection to end) produce no effect whatsoever. I stress that I cant select the blogs digital text when its shown as the content of €śnormal Web pages€ť because I am able to select the same digital text when its shown as the blogs source code, was Ill explain below. b. ANY right-click on ANY of the blog's normal Web pages produces only an obnoxious "yellow triangle with black exclamation point" small rectangular error message which appears at the center of the screen to the accompaniment of the equally-obnoxious default Windows XP error "sounder." The error message is shown in my chosen "slate blue" Windows XP Home system color, it's captioned "Message from webpage," and it fairly chortles "Content Protected ! " (The exclamation point is included...) The context menu has clearly been disabled completely. This linked text appears at the absolute bottom of the content area of each page at the blog: Content Protected Using Blog Protector By: PcDrome. The text is linked to the software's "product page" at: http://www.pcdrome.com/blog-protector The developer says, "Blog Protector is a wordpress plugin which can Protect your valuable blog content as from getting copied. Many Bloggers simply copy-paste the content from your blog without giving credit to the original source. This plugin basically disable the text selection & right click which will make the bloggers difficult to copy the content. Features –*Disable right click on your blog. –*Disable selection of text on your blog." (Funny... €śBlog Protector€ť hasn't been used on the developers own "product page" for the software€¦ I was able to copy that text and to paste it to this field without any difficulty...) c. Since I'm not able to select text in any way on the Web/blog pages, I can't use the Ctrl+C keystroke combination from the keyboard. d. As I explained, none of this is a problem with respect to my access to the source code, which I can reach by using IE8's View \Source menu. The code is displayed on a separate tab with an http: prefix. The text is "formatted Plain Text," so to speak, as would be shown by a code editor, such as Dreamweaver. Text appears in different colors as numbered lines of code. I can select that text by any method, the context menu is fully functional on those browser tabs of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm able to select that code without difficulty. 3. You said, "Are you able to print the page? If so, can you print it to a PDF printer or "Generic (Text Only)" printer? It's possible that either of those would leave you with just as much of a mess as the raw HTML, but it might be worth a try." With that suggestion you may have solved my problem! I so rarely print any content to paper that I know next to nil about printing. I've seen references to "printing to files" rather than to paper, and I tried it a couple of times, purely out of curiosity, but I was never able to achieve any success. This time I succeeded! Because I'm not able to select the text I wished to save on the blog pages I was forced to go directly to my browser's File\Print menu, which I assumed would either print all of each Web page, or none of it. The Print Dialog was displayed. The "Select Printer" field of the dialog only offered my minimal monochromatic laser printer and the "Microsoft XPS Document Writer" as printer options. I selected the XPS Writer and I left-clicked the "Print" button on the Print dialog. After a brief delay, I was shown the €śSave file as€ť dialog. I saved the output as an XPS file. I was then able to open the files by double-left-clicking them in Windows Explorer's Contents Pane in the Windows Classic View in Windows XP Home Edition. I found these files were crude RTF files which were displayed in my browser. My usual method of saving digital text from Web pages is to paste the content into my local Word 2007 files by using the home ribbon's "Paste\Paste Special\Unformatted text" menu. I then reformat the text minimally, but these XPS files preserve the formatting which is shown on the blog pages, and I may gradually retain that formatting in my local files, although Ill convert the text to my standard fonts and sizes. I'm able to select text from the full XPS copies of the Web pages, which are shown on multiple pages of normal word processor size in these XPS files. Im able to copy that text as Id normally copy any text on Web pages which havent been protected from copying. I didn't even know what XPS files were, so I used Uniblue's File Extension site to check the extension. The site identifies these as Microsoft "XML Paper Specification" files at: http://filext.com/file-extension/XPS They link to Microsoft's own page at: http://msdn.microsoft.com/en-us/wind.../gg463373.aspx Suzanne, as I could have expected, you came through in my behalf! As I said, my knowledge of printing is extremely limited and I know even less about printing to files rather than to paper, but your suggestion seems to be a solution to my problem! You showed me that this method does seem to operate as an HTML to RTF converter utility would operate, but it's quicker because it directly converts the "protected" Web text to RTF which can then be copied normally. I assume a converter utility would require me to copy the source code, then paste the code into the converter, and then copy the converter's output. This cuts out the source code step. (It may be difficult to delete these temporary XLS files when I've copied selected contents from them and have pasted that content to my Log file, but I'll solve that minor problem when I've built up a backlog of these files. I was told I couldn't delete the first few files of this type because another application was using them, but that problem may clear up when I close my browser &/or Word 2007.) I don't know if this is happening in Word 2007 or not, but it works! THANKS AGAIN! Jeff Hook, NJ, USA On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote: Since you seem to have a pretty good idea of what you're doing, this is definitely a long shot, but I'll ask anyway. Can you select text in the blog at all? If so, is Ctrl+C to copy also disabled? (snipped) |
#8
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted into Word 2007
The XPS reader is probably "less universal" than the Adobe Reader. :-)
My guess is that the XPS reader has been installed when you have updated Windows XP. To get the XPS "printer" you may have to install it manually though. Stefan Blom Microsoft Word MVP "Suzanne S. Barnhill" wrote in message ... Yeah, it sort of is. Except that I couldn't find any clue about an XPS reader that's as universal as Adobe Reader, so I couldn't figure out what you do with the .xps files after you have them, but apparently there's an XPS reader built into Windows. I just checked Tools | Folder Options | File Types in Windows XP, and, sure enough, XPS Viewer.exe is listed for them, which is interesting since my Printers folder doesn't include the XPS Printer. -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Stefan Blom" wrote in message ... Maybe I'm missing something here, but I thought the XPS printer is simply Microsoft's equivalence of PDF? Stefan Blom Microsoft Word MVP "Suzanne S. Barnhill" wrote in message ... Well, I'm glad that led you to a solution. I have the XPS printer on my Windows 7 laptop (I think it's installed by default either with Windows or with recent versions of Office), which is a good thing since I can't get the laptop to see any printers on the network (or to see the network at all except for Internet access), but I haven't had occasion to try it for anything, and anytime I've searched for explanations of XPS online I've ended up terminally confused! -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Jeff H." wrote in message ... Suzanne, thanks so much for your **EXCELLENT** assistance for so many years in this group! Wow! 1. You said "if you'll provide a link to the blog, I might be able to figure out some other workaround." Here's the blog's top page: http://blog.kimblechartingsolutions.com/ but hang on. A trip to the blog may only be optional because it looks like you've already solved my problem! I'll provide more information for the benefit of other users who are experiencing this same problem: 2. You asked, "Can you select text in the blog at all? If so, is Ctrl +C to copy also disabled?: a. I can't select ANY of the digital text on any of the blog pages while they're normally displayed as fully-formatted Web pages on the tabs of my Web browser, IE8. Normal selection methods (clicking the left mouse button, holding it down, and swiping the mouse cursor; left-clicking to position the insertion point, then pressing and holding the keyboard Shift key, then left-clicking the mouse again at the point at which I wish the selection to end) produce no effect whatsoever. I stress that I cant select the blogs digital text when its shown as the content of €śnormal Web pages€ť because I am able to select the same digital text when its shown as the blogs source code, was Ill explain below. b. ANY right-click on ANY of the blog's normal Web pages produces only an obnoxious "yellow triangle with black exclamation point" small rectangular error message which appears at the center of the screen to the accompaniment of the equally-obnoxious default Windows XP error "sounder." The error message is shown in my chosen "slate blue" Windows XP Home system color, it's captioned "Message from webpage," and it fairly chortles "Content Protected ! " (The exclamation point is included...) The context menu has clearly been disabled completely. This linked text appears at the absolute bottom of the content area of each page at the blog: Content Protected Using Blog Protector By: PcDrome. The text is linked to the software's "product page" at: http://www.pcdrome.com/blog-protector The developer says, "Blog Protector is a wordpress plugin which can Protect your valuable blog content as from getting copied. Many Bloggers simply copy-paste the content from your blog without giving credit to the original source. This plugin basically disable the text selection & right click which will make the bloggers difficult to copy the content. Features –*Disable right click on your blog. –*Disable selection of text on your blog." (Funny... €śBlog Protector€ť hasn't been used on the developers own "product page" for the software€¦ I was able to copy that text and to paste it to this field without any difficulty...) c. Since I'm not able to select text in any way on the Web/blog pages, I can't use the Ctrl+C keystroke combination from the keyboard. d. As I explained, none of this is a problem with respect to my access to the source code, which I can reach by using IE8's View \Source menu. The code is displayed on a separate tab with an http: prefix. The text is "formatted Plain Text," so to speak, as would be shown by a code editor, such as Dreamweaver. Text appears in different colors as numbered lines of code. I can select that text by any method, the context menu is fully functional on those browser tabs of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm able to select that code without difficulty. 3. You said, "Are you able to print the page? If so, can you print it to a PDF printer or "Generic (Text Only)" printer? It's possible that either of those would leave you with just as much of a mess as the raw HTML, but it might be worth a try." With that suggestion you may have solved my problem! I so rarely print any content to paper that I know next to nil about printing. I've seen references to "printing to files" rather than to paper, and I tried it a couple of times, purely out of curiosity, but I was never able to achieve any success. This time I succeeded! Because I'm not able to select the text I wished to save on the blog pages I was forced to go directly to my browser's File\Print menu, which I assumed would either print all of each Web page, or none of it. The Print Dialog was displayed. The "Select Printer" field of the dialog only offered my minimal monochromatic laser printer and the "Microsoft XPS Document Writer" as printer options. I selected the XPS Writer and I left-clicked the "Print" button on the Print dialog. After a brief delay, I was shown the €śSave file as€ť dialog. I saved the output as an XPS file. I was then able to open the files by double-left-clicking them in Windows Explorer's Contents Pane in the Windows Classic View in Windows XP Home Edition. I found these files were crude RTF files which were displayed in my browser. My usual method of saving digital text from Web pages is to paste the content into my local Word 2007 files by using the home ribbon's "Paste\Paste Special\Unformatted text" menu. I then reformat the text minimally, but these XPS files preserve the formatting which is shown on the blog pages, and I may gradually retain that formatting in my local files, although Ill convert the text to my standard fonts and sizes. I'm able to select text from the full XPS copies of the Web pages, which are shown on multiple pages of normal word processor size in these XPS files. Im able to copy that text as Id normally copy any text on Web pages which havent been protected from copying. I didn't even know what XPS files were, so I used Uniblue's File Extension site to check the extension. The site identifies these as Microsoft "XML Paper Specification" files at: http://filext.com/file-extension/XPS They link to Microsoft's own page at: http://msdn.microsoft.com/en-us/wind.../gg463373.aspx Suzanne, as I could have expected, you came through in my behalf! As I said, my knowledge of printing is extremely limited and I know even less about printing to files rather than to paper, but your suggestion seems to be a solution to my problem! You showed me that this method does seem to operate as an HTML to RTF converter utility would operate, but it's quicker because it directly converts the "protected" Web text to RTF which can then be copied normally. I assume a converter utility would require me to copy the source code, then paste the code into the converter, and then copy the converter's output. This cuts out the source code step. (It may be difficult to delete these temporary XLS files when I've copied selected contents from them and have pasted that content to my Log file, but I'll solve that minor problem when I've built up a backlog of these files. I was told I couldn't delete the first few files of this type because another application was using them, but that problem may clear up when I close my browser &/or Word 2007.) I don't know if this is happening in Word 2007 or not, but it works! THANKS AGAIN! Jeff Hook, NJ, USA On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote: Since you seem to have a pretty good idea of what you're doing, this is definitely a long shot, but I'll ask anyway. Can you select text in the blog at all? If so, is Ctrl+C to copy also disabled? (snipped) |
#9
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted into Word 2007
My guess is that the XPS reader was installed when you updated Windows XP. The XPS "printer," on the other hand, may have to be installed manually.
By definition, that would make the XPS reader "less universal" than the Adobe reader. :-) Stefan Blom Microsoft Word MVP "Suzanne S. Barnhill" wrote in message ... Yeah, it sort of is. Except that I couldn't find any clue about an XPS reader that's as universal as Adobe Reader, so I couldn't figure out what you do with the .xps files after you have them, but apparently there's an XPS reader built into Windows. I just checked Tools | Folder Options | File Types in Windows XP, and, sure enough, XPS Viewer.exe is listed for them, which is interesting since my Printers folder doesn't include the XPS Printer. -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Stefan Blom" wrote in message ... Maybe I'm missing something here, but I thought the XPS printer is simply Microsoft's equivalence of PDF? Stefan Blom Microsoft Word MVP "Suzanne S. Barnhill" wrote in message ... Well, I'm glad that led you to a solution. I have the XPS printer on my Windows 7 laptop (I think it's installed by default either with Windows or with recent versions of Office), which is a good thing since I can't get the laptop to see any printers on the network (or to see the network at all except for Internet access), but I haven't had occasion to try it for anything, and anytime I've searched for explanations of XPS online I've ended up terminally confused! -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Jeff H." wrote in message ... Suzanne, thanks so much for your **EXCELLENT** assistance for so many years in this group! Wow! 1. You said "if you'll provide a link to the blog, I might be able to figure out some other workaround." Here's the blog's top page: http://blog.kimblechartingsolutions.com/ but hang on. A trip to the blog may only be optional because it looks like you've already solved my problem! I'll provide more information for the benefit of other users who are experiencing this same problem: 2. You asked, "Can you select text in the blog at all? If so, is Ctrl +C to copy also disabled?: a. I can't select ANY of the digital text on any of the blog pages while they're normally displayed as fully-formatted Web pages on the tabs of my Web browser, IE8. Normal selection methods (clicking the left mouse button, holding it down, and swiping the mouse cursor; left-clicking to position the insertion point, then pressing and holding the keyboard Shift key, then left-clicking the mouse again at the point at which I wish the selection to end) produce no effect whatsoever. I stress that I cant select the blogs digital text when its shown as the content of €śnormal Web pages€ť because I am able to select the same digital text when its shown as the blogs source code, was Ill explain below. b. ANY right-click on ANY of the blog's normal Web pages produces only an obnoxious "yellow triangle with black exclamation point" small rectangular error message which appears at the center of the screen to the accompaniment of the equally-obnoxious default Windows XP error "sounder." The error message is shown in my chosen "slate blue" Windows XP Home system color, it's captioned "Message from webpage," and it fairly chortles "Content Protected ! " (The exclamation point is included...) The context menu has clearly been disabled completely. This linked text appears at the absolute bottom of the content area of each page at the blog: Content Protected Using Blog Protector By: PcDrome. The text is linked to the software's "product page" at: http://www.pcdrome.com/blog-protector The developer says, "Blog Protector is a wordpress plugin which can Protect your valuable blog content as from getting copied. Many Bloggers simply copy-paste the content from your blog without giving credit to the original source. This plugin basically disable the text selection & right click which will make the bloggers difficult to copy the content. Features –*Disable right click on your blog. –*Disable selection of text on your blog." (Funny... €śBlog Protector€ť hasn't been used on the developers own "product page" for the software€¦ I was able to copy that text and to paste it to this field without any difficulty...) c. Since I'm not able to select text in any way on the Web/blog pages, I can't use the Ctrl+C keystroke combination from the keyboard. d. As I explained, none of this is a problem with respect to my access to the source code, which I can reach by using IE8's View \Source menu. The code is displayed on a separate tab with an http: prefix. The text is "formatted Plain Text," so to speak, as would be shown by a code editor, such as Dreamweaver. Text appears in different colors as numbered lines of code. I can select that text by any method, the context menu is fully functional on those browser tabs of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm able to select that code without difficulty. 3. You said, "Are you able to print the page? If so, can you print it to a PDF printer or "Generic (Text Only)" printer? It's possible that either of those would leave you with just as much of a mess as the raw HTML, but it might be worth a try." With that suggestion you may have solved my problem! I so rarely print any content to paper that I know next to nil about printing. I've seen references to "printing to files" rather than to paper, and I tried it a couple of times, purely out of curiosity, but I was never able to achieve any success. This time I succeeded! Because I'm not able to select the text I wished to save on the blog pages I was forced to go directly to my browser's File\Print menu, which I assumed would either print all of each Web page, or none of it. The Print Dialog was displayed. The "Select Printer" field of the dialog only offered my minimal monochromatic laser printer and the "Microsoft XPS Document Writer" as printer options. I selected the XPS Writer and I left-clicked the "Print" button on the Print dialog. After a brief delay, I was shown the €śSave file as€ť dialog. I saved the output as an XPS file. I was then able to open the files by double-left-clicking them in Windows Explorer's Contents Pane in the Windows Classic View in Windows XP Home Edition. I found these files were crude RTF files which were displayed in my browser. My usual method of saving digital text from Web pages is to paste the content into my local Word 2007 files by using the home ribbon's "Paste\Paste Special\Unformatted text" menu. I then reformat the text minimally, but these XPS files preserve the formatting which is shown on the blog pages, and I may gradually retain that formatting in my local files, although Ill convert the text to my standard fonts and sizes. I'm able to select text from the full XPS copies of the Web pages, which are shown on multiple pages of normal word processor size in these XPS files. Im able to copy that text as Id normally copy any text on Web pages which havent been protected from copying. I didn't even know what XPS files were, so I used Uniblue's File Extension site to check the extension. The site identifies these as Microsoft "XML Paper Specification" files at: http://filext.com/file-extension/XPS They link to Microsoft's own page at: http://msdn.microsoft.com/en-us/wind.../gg463373.aspx Suzanne, as I could have expected, you came through in my behalf! As I said, my knowledge of printing is extremely limited and I know even less about printing to files rather than to paper, but your suggestion seems to be a solution to my problem! You showed me that this method does seem to operate as an HTML to RTF converter utility would operate, but it's quicker because it directly converts the "protected" Web text to RTF which can then be copied normally. I assume a converter utility would require me to copy the source code, then paste the code into the converter, and then copy the converter's output. This cuts out the source code step. (It may be difficult to delete these temporary XLS files when I've copied selected contents from them and have pasted that content to my Log file, but I'll solve that minor problem when I've built up a backlog of these files. I was told I couldn't delete the first few files of this type because another application was using them, but that problem may clear up when I close my browser &/or Word 2007.) I don't know if this is happening in Word 2007 or not, but it works! THANKS AGAIN! Jeff Hook, NJ, USA On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote: Since you seem to have a pretty good idea of what you're doing, this is definitely a long shot, but I'll ask anyway. Can you select text in the blog at all? If so, is Ctrl+C to copy also disabled? (snipped) |
#10
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted into Word 2007
Looked at another way, Adobe Reader doesn't include any capability of
*creating* PDFs, while the function of the XPS printer is to create XPS files. But I won't argue that XPS is "less universal" since I still know next to nothing about it whereas one is inundated with PDFs everywhere every day. -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Stefan Blom" wrote in message ... My guess is that the XPS reader was installed when you updated Windows XP. The XPS "printer," on the other hand, may have to be installed manually. By definition, that would make the XPS reader "less universal" than the Adobe reader. :-) Stefan Blom Microsoft Word MVP "Suzanne S. Barnhill" wrote in message ... Yeah, it sort of is. Except that I couldn't find any clue about an XPS reader that's as universal as Adobe Reader, so I couldn't figure out what you do with the .xps files after you have them, but apparently there's an XPS reader built into Windows. I just checked Tools | Folder Options | File Types in Windows XP, and, sure enough, XPS Viewer.exe is listed for them, which is interesting since my Printers folder doesn't include the XPS Printer. -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Stefan Blom" wrote in message ... Maybe I'm missing something here, but I thought the XPS printer is simply Microsoft's equivalence of PDF? Stefan Blom Microsoft Word MVP "Suzanne S. Barnhill" wrote in message ... Well, I'm glad that led you to a solution. I have the XPS printer on my Windows 7 laptop (I think it's installed by default either with Windows or with recent versions of Office), which is a good thing since I can't get the laptop to see any printers on the network (or to see the network at all except for Internet access), but I haven't had occasion to try it for anything, and anytime I've searched for explanations of XPS online I've ended up terminally confused! -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Jeff H." wrote in message ... Suzanne, thanks so much for your **EXCELLENT** assistance for so many years in this group! Wow! 1. You said "if you'll provide a link to the blog, I might be able to figure out some other workaround." Here's the blog's top page: http://blog.kimblechartingsolutions.com/ but hang on. A trip to the blog may only be optional because it looks like you've already solved my problem! I'll provide more information for the benefit of other users who are experiencing this same problem: 2. You asked, "Can you select text in the blog at all? If so, is Ctrl +C to copy also disabled?: a. I can't select ANY of the digital text on any of the blog pages while they're normally displayed as fully-formatted Web pages on the tabs of my Web browser, IE8. Normal selection methods (clicking the left mouse button, holding it down, and swiping the mouse cursor; left-clicking to position the insertion point, then pressing and holding the keyboard Shift key, then left-clicking the mouse again at the point at which I wish the selection to end) produce no effect whatsoever. I stress that I cant select the blogs digital text when its shown as the content of €śnormal Web pages€ť because I am able to select the same digital text when its shown as the blogs source code, was Ill explain below. b. ANY right-click on ANY of the blog's normal Web pages produces only an obnoxious "yellow triangle with black exclamation point" small rectangular error message which appears at the center of the screen to the accompaniment of the equally-obnoxious default Windows XP error "sounder." The error message is shown in my chosen "slate blue" Windows XP Home system color, it's captioned "Message from webpage," and it fairly chortles "Content Protected ! " (The exclamation point is included...) The context menu has clearly been disabled completely. This linked text appears at the absolute bottom of the content area of each page at the blog: Content Protected Using Blog Protector By: PcDrome. The text is linked to the software's "product page" at: http://www.pcdrome.com/blog-protector The developer says, "Blog Protector is a wordpress plugin which can Protect your valuable blog content as from getting copied. Many Bloggers simply copy-paste the content from your blog without giving credit to the original source. This plugin basically disable the text selection & right click which will make the bloggers difficult to copy the content. Features –*Disable right click on your blog. –*Disable selection of text on your blog." (Funny... €śBlog Protector€ť hasn't been used on the developers own "product page" for the software€¦ I was able to copy that text and to paste it to this field without any difficulty...) c. Since I'm not able to select text in any way on the Web/blog pages, I can't use the Ctrl+C keystroke combination from the keyboard. d. As I explained, none of this is a problem with respect to my access to the source code, which I can reach by using IE8's View \Source menu. The code is displayed on a separate tab with an http: prefix. The text is "formatted Plain Text," so to speak, as would be shown by a code editor, such as Dreamweaver. Text appears in different colors as numbered lines of code. I can select that text by any method, the context menu is fully functional on those browser tabs of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm able to select that code without difficulty. 3. You said, "Are you able to print the page? If so, can you print it to a PDF printer or "Generic (Text Only)" printer? It's possible that either of those would leave you with just as much of a mess as the raw HTML, but it might be worth a try." With that suggestion you may have solved my problem! I so rarely print any content to paper that I know next to nil about printing. I've seen references to "printing to files" rather than to paper, and I tried it a couple of times, purely out of curiosity, but I was never able to achieve any success. This time I succeeded! Because I'm not able to select the text I wished to save on the blog pages I was forced to go directly to my browser's File\Print menu, which I assumed would either print all of each Web page, or none of it. The Print Dialog was displayed. The "Select Printer" field of the dialog only offered my minimal monochromatic laser printer and the "Microsoft XPS Document Writer" as printer options. I selected the XPS Writer and I left-clicked the "Print" button on the Print dialog. After a brief delay, I was shown the €śSave file as€ť dialog. I saved the output as an XPS file. I was then able to open the files by double-left-clicking them in Windows Explorer's Contents Pane in the Windows Classic View in Windows XP Home Edition. I found these files were crude RTF files which were displayed in my browser. My usual method of saving digital text from Web pages is to paste the content into my local Word 2007 files by using the home ribbon's "Paste\Paste Special\Unformatted text" menu. I then reformat the text minimally, but these XPS files preserve the formatting which is shown on the blog pages, and I may gradually retain that formatting in my local files, although Ill convert the text to my standard fonts and sizes. I'm able to select text from the full XPS copies of the Web pages, which are shown on multiple pages of normal word processor size in these XPS files. Im able to copy that text as Id normally copy any text on Web pages which havent been protected from copying. I didn't even know what XPS files were, so I used Uniblue's File Extension site to check the extension. The site identifies these as Microsoft "XML Paper Specification" files at: http://filext.com/file-extension/XPS They link to Microsoft's own page at: http://msdn.microsoft.com/en-us/wind.../gg463373.aspx Suzanne, as I could have expected, you came through in my behalf! As I said, my knowledge of printing is extremely limited and I know even less about printing to files rather than to paper, but your suggestion seems to be a solution to my problem! You showed me that this method does seem to operate as an HTML to RTF converter utility would operate, but it's quicker because it directly converts the "protected" Web text to RTF which can then be copied normally. I assume a converter utility would require me to copy the source code, then paste the code into the converter, and then copy the converter's output. This cuts out the source code step. (It may be difficult to delete these temporary XLS files when I've copied selected contents from them and have pasted that content to my Log file, but I'll solve that minor problem when I've built up a backlog of these files. I was told I couldn't delete the first few files of this type because another application was using them, but that problem may clear up when I close my browser &/or Word 2007.) I don't know if this is happening in Word 2007 or not, but it works! THANKS AGAIN! Jeff Hook, NJ, USA On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote: Since you seem to have a pretty good idea of what you're doing, this is definitely a long shot, but I'll ask anyway. Can you select text in the blog at all? If so, is Ctrl+C to copy also disabled? (snipped) |
#11
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted into Word 2007
Good point.
Stefan Blom Microsoft Word MVP "Suzanne S. Barnhill" wrote in message ... Looked at another way, Adobe Reader doesn't include any capability of *creating* PDFs, while the function of the XPS printer is to create XPS files. But I won't argue that XPS is "less universal" since I still know next to nothing about it whereas one is inundated with PDFs everywhere every day. -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Stefan Blom" wrote in message ... My guess is that the XPS reader was installed when you updated Windows XP. The XPS "printer," on the other hand, may have to be installed manually. By definition, that would make the XPS reader "less universal" than the Adobe reader. :-) Stefan Blom Microsoft Word MVP "Suzanne S. Barnhill" wrote in message ... Yeah, it sort of is. Except that I couldn't find any clue about an XPS reader that's as universal as Adobe Reader, so I couldn't figure out what you do with the .xps files after you have them, but apparently there's an XPS reader built into Windows. I just checked Tools | Folder Options | File Types in Windows XP, and, sure enough, XPS Viewer.exe is listed for them, which is interesting since my Printers folder doesn't include the XPS Printer. -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Stefan Blom" wrote in message ... Maybe I'm missing something here, but I thought the XPS printer is simply Microsoft's equivalence of PDF? Stefan Blom Microsoft Word MVP "Suzanne S. Barnhill" wrote in message ... Well, I'm glad that led you to a solution. I have the XPS printer on my Windows 7 laptop (I think it's installed by default either with Windows or with recent versions of Office), which is a good thing since I can't get the laptop to see any printers on the network (or to see the network at all except for Internet access), but I haven't had occasion to try it for anything, and anytime I've searched for explanations of XPS online I've ended up terminally confused! -- Suzanne S. Barnhill Microsoft MVP (Word) Words into Type Fairhope, Alabama USA http://word.mvps.org "Jeff H." wrote in message ... Suzanne, thanks so much for your **EXCELLENT** assistance for so many years in this group! Wow! 1. You said "if you'll provide a link to the blog, I might be able to figure out some other workaround." Here's the blog's top page: http://blog.kimblechartingsolutions.com/ but hang on. A trip to the blog may only be optional because it looks like you've already solved my problem! I'll provide more information for the benefit of other users who are experiencing this same problem: 2. You asked, "Can you select text in the blog at all? If so, is Ctrl +C to copy also disabled?: a. I can't select ANY of the digital text on any of the blog pages while they're normally displayed as fully-formatted Web pages on the tabs of my Web browser, IE8. Normal selection methods (clicking the left mouse button, holding it down, and swiping the mouse cursor; left-clicking to position the insertion point, then pressing and holding the keyboard Shift key, then left-clicking the mouse again at the point at which I wish the selection to end) produce no effect whatsoever. I stress that I cant select the blogs digital text when its shown as the content of €śnormal Web pages€ť because I am able to select the same digital text when its shown as the blogs source code, was Ill explain below. b. ANY right-click on ANY of the blog's normal Web pages produces only an obnoxious "yellow triangle with black exclamation point" small rectangular error message which appears at the center of the screen to the accompaniment of the equally-obnoxious default Windows XP error "sounder." The error message is shown in my chosen "slate blue" Windows XP Home system color, it's captioned "Message from webpage," and it fairly chortles "Content Protected ! " (The exclamation point is included...) The context menu has clearly been disabled completely. This linked text appears at the absolute bottom of the content area of each page at the blog: Content Protected Using Blog Protector By: PcDrome. The text is linked to the software's "product page" at: http://www.pcdrome.com/blog-protector The developer says, "Blog Protector is a wordpress plugin which can Protect your valuable blog content as from getting copied. Many Bloggers simply copy-paste the content from your blog without giving credit to the original source. This plugin basically disable the text selection & right click which will make the bloggers difficult to copy the content. Features –*Disable right click on your blog. –*Disable selection of text on your blog." (Funny... €śBlog Protector€ť hasn't been used on the developers own "product page" for the software€¦ I was able to copy that text and to paste it to this field without any difficulty...) c. Since I'm not able to select text in any way on the Web/blog pages, I can't use the Ctrl+C keystroke combination from the keyboard. d. As I explained, none of this is a problem with respect to my access to the source code, which I can reach by using IE8's View \Source menu. The code is displayed on a separate tab with an http: prefix. The text is "formatted Plain Text," so to speak, as would be shown by a code editor, such as Dreamweaver. Text appears in different colors as numbered lines of code. I can select that text by any method, the context menu is fully functional on those browser tabs of code, and I can also use the tabs' Edit\Cop menus if I wish. I'm able to select that code without difficulty. 3. You said, "Are you able to print the page? If so, can you print it to a PDF printer or "Generic (Text Only)" printer? It's possible that either of those would leave you with just as much of a mess as the raw HTML, but it might be worth a try." With that suggestion you may have solved my problem! I so rarely print any content to paper that I know next to nil about printing. I've seen references to "printing to files" rather than to paper, and I tried it a couple of times, purely out of curiosity, but I was never able to achieve any success. This time I succeeded! Because I'm not able to select the text I wished to save on the blog pages I was forced to go directly to my browser's File\Print menu, which I assumed would either print all of each Web page, or none of it. The Print Dialog was displayed. The "Select Printer" field of the dialog only offered my minimal monochromatic laser printer and the "Microsoft XPS Document Writer" as printer options. I selected the XPS Writer and I left-clicked the "Print" button on the Print dialog. After a brief delay, I was shown the €śSave file as€ť dialog. I saved the output as an XPS file. I was then able to open the files by double-left-clicking them in Windows Explorer's Contents Pane in the Windows Classic View in Windows XP Home Edition. I found these files were crude RTF files which were displayed in my browser. My usual method of saving digital text from Web pages is to paste the content into my local Word 2007 files by using the home ribbon's "Paste\Paste Special\Unformatted text" menu. I then reformat the text minimally, but these XPS files preserve the formatting which is shown on the blog pages, and I may gradually retain that formatting in my local files, although Ill convert the text to my standard fonts and sizes. I'm able to select text from the full XPS copies of the Web pages, which are shown on multiple pages of normal word processor size in these XPS files. Im able to copy that text as Id normally copy any text on Web pages which havent been protected from copying. I didn't even know what XPS files were, so I used Uniblue's File Extension site to check the extension. The site identifies these as Microsoft "XML Paper Specification" files at: http://filext.com/file-extension/XPS They link to Microsoft's own page at: http://msdn.microsoft.com/en-us/wind.../gg463373.aspx Suzanne, as I could have expected, you came through in my behalf! As I said, my knowledge of printing is extremely limited and I know even less about printing to files rather than to paper, but your suggestion seems to be a solution to my problem! You showed me that this method does seem to operate as an HTML to RTF converter utility would operate, but it's quicker because it directly converts the "protected" Web text to RTF which can then be copied normally. I assume a converter utility would require me to copy the source code, then paste the code into the converter, and then copy the converter's output. This cuts out the source code step. (It may be difficult to delete these temporary XLS files when I've copied selected contents from them and have pasted that content to my Log file, but I'll solve that minor problem when I've built up a backlog of these files. I was told I couldn't delete the first few files of this type because another application was using them, but that problem may clear up when I close my browser &/or Word 2007.) I don't know if this is happening in Word 2007 or not, but it works! THANKS AGAIN! Jeff Hook, NJ, USA On Apr 10, 9:43 am, "Suzanne S. Barnhill" wrote: Since you seem to have a pretty good idea of what you're doing, this is definitely a long shot, but I'll ask anyway. Can you select text in the blog at all? If so, is Ctrl+C to copy also disabled? (snipped) |
#12
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted into Word 2007
Microsoft .NET Framework 3.0 natively includes a dedicated XPS Document
Writer and Viewer. When people double-click a ´.xpsˇ document in the Windows shell, the document is automatically displayed in the XPS Document Viewer embedded in Internet Explorer v.6+. |
#13
Posted to microsoft.public.word.docmanagement
|
|||
|
|||
Removing or Converting HTML code when Web text is pasted into Word 2007
The World Browser (http://www.ioage.com/twen/) is an alternative Web
browser based on the Internet Explorer engine. It has a ´View | View In Edit Modeˇ main menu command. When the page at http://blog.kimblechartingsolutions.com/ is displayed in that ´Edit Modeˇ, you can right-click within the main text, choose ´Select Allˇ, then press ´Ctrl+Cˇ. You can then paste the whole page into Word using ´Ctrl+Vˇ. |
Reply |
Thread Tools | |
Display Modes | |
|
|
Similar Threads | ||||
Thread | Forum | |||
Removing gray background from pasted text document | Microsoft Word Help | |||
Converting html pages into Word 2007 via Clipboard | Microsoft Word Help | |||
Removing spaces in pasted text. | Page Layout | |||
How can I see the html code in word 2007? | New Users | |||
how do i paste html code in personal web page | New Users |