Managing a huge and active guest blogging community is exciting. I never stop discovering new and new things.
In fact, the scam I am going to share today isn’t new at all.
It’s been around for ages, but considering I’ve been involved in the content marketing for quite some time and have never seen this before, I’ve figured it may be new to you as well.
If you accept guest articles or outsource content creation, you’ll be as surprised to hear about this scam as I was. It’s almost impossible to catch!
The scam is not easy to explain, so let’s start with a short exercise:
Do these pieces look exactly the same? | |
“This post looks at one type of content re-packaging: turning your old content into an image…” | “This p?st l??ks at ?ne type ?f c?ntent re-packaging: turning y?ur ?ld c?ntent int? an image…” |
Well, they are not… Let’s compare the source code in the editor:
“This post looks at one type of content re-packaging: turning your old content into an image…” | “This p?st l??ks at ?ne type ?f c?ntent re-packaging: turning y?ur ?ld c?ntent int? an image…” |
Note: If you select the source code in the second column and click “View Selection Source”, you are most likely to be unable to see the wicked symbols even there (the above screenshots are made from the HTML editor).
Now try also searching Google by directly copy-pasting each text from *each* column:
“This post looks at one type of content re-packaging: turning your old content into an image…” | “This p?st l??ks at ?ne type ?f c?ntent re-packaging: turning y?ur ?ld c?ntent int? an image…” |
Wicked, huh?
Like Google, no duplicate content checker will ever recognize that line of text as a copy from the existing text.
Table of Contents
How the Heck Could This Happen?
Simple. The text from the second column contains Cyrillic character “?” which looks exactly the same as the English “o” but, taken from another language, it forms a new word.
Here a better explanation:
every word is actually a mix of Western European and Cyrillic characters. Cyrillic is made of unique characters, as well as characters that look exactly like ours. E.g. the letters “M” “o” “c” “b” etc. display the same in Russian and in English, but they actually have a different code
How Do I Detect the Scam?
>>>Search Google: as in the screenshot above, Google won’t find *anything*, even a very common word, when it has foreign characters. Or at least it’ll treat it as a mistyped word:
>>>Use UTF Converters (this one for example) to any errors:
>>>Try Google Translate: It won’t be able to either read the phrase in voice or translate it:
WordPress plugin: There’s absolutely cool new plugin that can decode your WordPress content to UTF-8. lease download it and always get alarmed of stolen content:
How Can the “Alien” Characters Be Stripped?
If you have identified the scam and want to find where the article was initially published, that’s not going to be easy. Simply put, there are hardly any…
WYSIWYG editor – Our very-much-loved WordPress Editor not only fails to strip the alien characters (which is quite natural as who knows what they should be converted into), but also fails to signal about them in any way (it turns the corresponding code into normally-looking characters). So if you are using WordPress and copy-paste the text from the second column into the “Visual” editor, you won’t notice anything. This makes the scam even more dangerous and easier to go away with. My WordPress even didn’t highlight the words with foreign characters as errors.
Notepads (or any clipping utility that is meant to strip the code): not all of them. Unicode Notepad (any modern Windows or Mac version now runs them) won’t strip the alien characters unless you save the file and choose to save ASCII (not Unicode), then close the file and re-open it. The words having alien characters might be highlighted as errors in some cases:
Conclusion
The best advice here: search Google! Even if the article you are going to publish passes Copyscape, grab a long phrase from it and search for the exact match to make sure Google understands it.
Good luck!
cc licensed ( BY ) flickr photo shared by alancleaver_2000