Money from advertising income motivates blog scrapers to steal content. It’s only a matter of time before you discover your copyright has been violated and your content is now duplicated on a site you don’t want to be associated with. So let’s consider the impact of that duplication and how to counter it.
All Rights Reserved – Any content reblogged from one cool site must adhere to the terms of © Copyright
When full copies of your content appear anywhere other than on the site it was originally published on
- your brand has been diluted;
- your traffic has been siphoned away from your site;
- backlinks from the duplicate lack value as they are usually from sites that either have no pagerank or a low pagerank and sadly your article is now associated with those sites or even with banned sites;
- too many backlinks from low quality sites can compromise your blog’s positioning in the SERPs;
- the duplicate content can outrank the original content in the SERPs (search engine page results).
These days convenience rules. Instead of composing our own new posts and including backlinks to related articles in them we click reblog buttons. We are forgetting that backlinks to our articles transfer pagerank thereby contributing to the authority our sites have within our niche.
Reblogs do pass pagerank, when there’s any to be passed. However, we’ve all witnessed the kind of blog that wouldn’t exist if it weren’t for throwaway comments and like button clicks from similar sites where content is primarily prompted posts, awards, product reviews, competitions, contests, other memes and reblogs.
What is value of a reblog backlink from a site with no pagerank or very low pagerank that’s light on original content, but rich in prompted posts, awards, product reviews, competitions, contests, other memes and reblogs?
Not much is my polite answer. Blogs with content of that nature are so close to being splogs that I cringe when my posts are reblogged on those sites, as I know that too many backlinks from low quality sites can compromise a blog’s positioning in the SERPs.
“Help! Facebook thinks my blog is spam” can be the plaintive cry of a blogger posting to the support forums. I always provide the correct answer, despite the fact I visited the blog in question and usually concur with robotic analysis and error message it gave rise to.
Have you analyzed your site to make sure you don’t link to spam sites? Frequently spam sites that are not obvious will offer widgets, “awards” and images for people to use in their blogs and there is adware/malware attached. If you’ve hotlinked from any of those sites, then you’ll have to remove it and flush your Facebook cache before Facebook will reclassify you.
1. Try flushing your FB cache but disconnect from FB first.
2. Disconnect the Publicize app and reconnect it to the correct account.Try the full reconnect procedure again detailed in the support documentation including step 3 (removing the FB app), which is probably the most important step.
3. See also publicize troubleshooting.
4. Then you can report it as a false-positive from within the error message Facebook provides.
No Google juice and no page view stat
When we click reblog, share and like buttons off-site without reading the original article where it was published, the result is that the button clicking doesn’t create a page view stat. It seems we are moving towards a model where we place a higher value on likes and shares resulting from autoposting to social media sites thereby replacing backlinks and actual blog visits. Well, that doesn’t make me happy because off site button clicking doesn’t result in visitors who locate the rest of my blog content.
Countering blog scrapers
Below are 11 steps I take to ensure that full copies of my content appear only on the site it was originally published on.
- I use a theme that is designed to display an author byline on each article that’s linked to all my other posts published on the same site.
- I have set my RSS Feeds to “Summary” rather than “Full” on this page > Settings > Reading.
- I either insert “the more tag” following an introductory paragraph into each article prior to publication, or I use a theme that automatically provides excerpts on the Front page, Archives, Categories and Tags pages.
- I insert a copyright notice with an embedded link to my copyright page into each article.
- In each new article I backlink to my earlier related articles using appropriate anchor text.
- A few hours after publication I copy and paste a sentence from the body of my article into Google search and do a duplicate content search.
- In addition I have set up daily Google Alerts for my domain names and my pseudonym.
- From time to time I also use Copyspace to search for duplicates of my content.
- I also use Plagium occasionally to track plagiarism.
- When my content has been stolen I file a DMCA take down notice.
- I report scraped content ranking above or instead of the original to Google.
- I recommend watermarking your images.
Search engines like Google aim to provide the most relevant and fresh results to users. Blog scrapers aim to make advertising income from the content they steal. The duplicate content blog scrapers create pollutes search results causing frustration and stealing time from search engine users. So it’s not surprising that Google is scraping blog scraped content from their search results.
How can I make sure that Google knows my content is original?
Can you benefit from content scraped from your site?
Google Authorship gives you the ability to tell Google that you are the author of the content. And connecting your Google+ Profile to your WordPress.com blog creates an official connection between your posts and your Google+ account.
Is your author name embedded in every article*?
Is your copyright notice is embedded in every article*?
Do you backlink to your internal related content in every article*?;
*The same applies to all digital products such as PDFs files, audio files and videos,etc.
Hat tip to aivlys who inspired this article.
Hat tip to ♡eM who inspired the Splog Off! section by prompting me to reflect on “the good old days”.