Here are a couple of tips on how to find the content stealing culprits!
Use Copyscape. Copyscape is a great little tool that auto-magically searches for content similiar to that of your website on other websites. All you need to do is mention the url of the website, after which copyscape will search the web for similar articles.
The only downside to this method is that if someone has rephrased some of the words in the article, copyscape might not be able to find it (you'll have to use the google method mentioned below). Another thing is that the free version will only show you a couple of websites that have plagarized content from your website, and not all the websites on the internet that have copied your content.Use Google. Yup, that's right, this wonderful search engine can also be utilized to search for copies of your posts. Here's how you can find out if someone else has similar posts. Take one of your post titles, or key sentences in posts that you might think are plagiarized and search for them on google. The trick here is to enclose the title or key phrases in quotations, so Google will only show those phrases and not phrases/titles related to the query. Here's an example:
Yes, in case you are wondering... that website pictured in the screenshot above stole my post =( !
Use digg.com/slashdot.com/shoutwire.com search. You'll be surprised how many websites out there copy your posts, submit it to digg, and get more visitors than you do. Many people have websites for the only one reason - making money. If a website gets dugg, it usually gets lot of visitors.
Even if it doesn't make it to the homepage of digg, there is a good chance that when someone searches for that particular phrase, they land up on the digg.com submission page and eventually visit the website (that stole your content) to see what the post is about. So, all you need to do is search for your title on digg, and if you find a website that has copied your post... bingo! You've found the content thief!
Check your web-analytics logs. Often you might have put a link to a post in your own website in one of your posts. Many you simply copy and paste also copy all the links in that post. This means that when a user clicks on one of your links, they'll be coming to your website. Most of the time all web statistic softwares will record this incoming link, and add the incoming entries to your log. So once in a while, simply browse through your logs and check for any websites that might have reposted your content.
If you use wordpress or any other CMS software, check any trackbacks, or incoming links that might help you find the website that is stealing your posts.
I'm sure, if you use these tips, it should be enough to catch anyone who is stealing your content and republishing it on their website. Once you've found out who is stealing your content... dealing with them is another long story, which I'll try to cover in some other post.
Got a question, tip or comment? Send them to firstname.lastname@example.org and we'll try to answer it in a blog post!