Spam Links

Web Spam

Web spam is mostly aimed at increasing the ranking of spammed links in search engines. To achieve this end links were first spammed in guestbooks, then blogs, referrer logs and now Wikis - anywhere that a user of a site may submit links is vulnerable to spamdexing. Web spam also wants to be viewed by people, just like email spam.

Web Spam Overviews

Fighting Spam for the Search Engines - www.engine-spam.com/
Spamdexing, from Wikipedia - en.wikipedia.org/wiki/Spamdexing
Poly's Anti-Spam-Seite gegen Suchmaschinen-Spamming - geocities.com/polyphydra/Anti_Spam.html - German
Link Spamming - www.rickconner.net/spamweb/spam_linkspam.html
Link spamming - www.spam.co.nz/linkspamming.html
A tube for spam - www.rogelsview.com/technology-and-software/a-tube-for-spam/
SEO Egghead on spam - www.seoegghead.com/blog/category/spam
Akismet web spam statistics - akismet.com/stats/
Plagiarism Today on spam - www.plagiarismtoday.com/tag/spam/
What Google Knows About Spam - www.mattcutts.com/blog/what-google-knows-about-spam/

Top Index

Web Spam Prevention

Can you distinguish a robot and a person? That's what these methods try to do. Web spam prevention is a subset of the wider problem of web fraud detection, used (among other things) to help stop phishing.

Spam2.0: Fake user accounts and spam profiles - googlewebmastercentral.blogspot.com/2009/06/spam20-fake-user-accounts-and-spam.html
Stopping spambots with hashes and honeypots - nedbatchelder.com/text/stopbots.html
KittenAuth - www.thepcspy.com/kittenauth
ModSecurity - www.modsecurity.org/
ModSecurity rules - gotroot.com/tiki-index.php?page=mod_security%20rules
Protect Web Form - www.protectwebform.com/
LinkSleeve (SLV) Spam Link Verification - www.linksleeve.org/
Strider Search Defender - research.microsoft.com/en-us/um/redmond/projects/strider/searchdefender/default.htm
Search engine SPAM detector - tool.motoricerca.info/spam-detector/
PHPrbl - phprbl.init1.nl/
Mod_Access_RBL2 - code.google.com/p/modaccessrbl2/
Babycart - apthorpe.cynistar.net/code/babycart/
Keywords Abuse and Search Engine Submission Complaints - www.seopros.org/Processing/Complaints/offenderlist.asp
SEOmoz URL Spam Detection Algorithm - www.seomoz.org/user_files/spam-detection/
Spam vs. Accessibility - www.joedolson.com/articles/2008/06/spam-vs-accessibility/
The 3-D CAPTCHA - spamfizzle.com/Captcha.aspx
aiCaptcha — Using AI to beat CAPTCHA and post comment spam - www.brains-n-brawn.com/default.aspx?vDir=aicaptcha
CAPTCHA farming - jc.ngo.org.uk/blog/2006/11/21/captcha-farming/
Honeypot Captcha - haacked.com/archive/2007/09/11/honeypot-captcha.aspx

Top Index

RSS Spam Filtering

Fed up with spam in your RSS feeds? Are blog owners unable to stem the tide on their own? You can filter the blogs you read using an RSS spam filter.

feedscrub - www.feedscrub.com/

Top Index

Web Identity Management

OpenID - openid.net/
Facebook Connect - developers.facebook.com/fbconnect.php
Identity 2.0 - identity20.com/

Top Index

Web Spam Research

Asirra - research.microsoft.com/en-us/um/redmond/projects/asirra/
Resources for Research on Web Spam - www.yr-bcn.es/webspam/
Adversarial Information Retrieval (AIR) on the Web - airweb.cse.lehigh.edu/
Web spam challenge - webspam.lip6.fr/wiki/pmwiki.php
Introducing the Webb Spam Corpus: Using Email Spam to Identify Web Spam Automatically - www.ceas.cc/2006/6.pdf
Combating Web Spam with TrustRank - ilpubs.stanford.edu:8090/770/
Carlos Castillo publications on web information retrieval - www.chato.cl/research/
Using Rank Propagation and Probabilistic Counting for Link-based Spam Detection - www.slideshare.net/ChaToX/linkbased-spam-detection-fws-2006-barcelona
Link analysis for Web spam detection - www.chato.cl/papers/becchetti_2007_link_analysis_web_spam_detection.pdf
Formalization of Link Farm Structure Using Graph Grammar - www.ieeexplore.ieee.org/xpl/freeabs_all.jsp?isnumber=4482669&arnumber=4482802&count=160&index=132
Microsoft Research Text Mining Search and Navigation Research - research.microsoft.com/en-us/groups/tmsn
Transductive Web Spam Detection - www2007.org/workshops/paper_120.pdf - slides
Robust PageRank and Locally Computable Spam Detection Features - airweb.cse.lehigh.edu/2008/submissions/andersen_2008_robust_pagerank_local_spam.pdf
Social Honeypots: Making Friends With A Spammer Near You - www.ceas.cc/2008/papers/ceas2008-paper-50.pdf
How lexical analysis can combat Web spam - www.seo-theory.com/2009/02/13/how-lexical-analysis-can-combat-web-spam/
CAPTCHAs' Effect on Conversion Rates - www.seomoz.org/blog/captchas-affect-on-conversion-rates

Top Index

Social Networking Spam

Top Index

Twitter spam

Twitter's Unintended Hidden Spam Engine — New Follower Direct Messages - blog.thebusybrain.com/twitters-unintended-hidden-spam-engine-new-follower-direct-messages/612
Social network spam - blog.wordtothewise.com/2010/02/social-network-spam/
Report Twitter spam - twitter.com/spam
Stop Twitter Spam - www.stoptwitterspam.com/blog/
Turning Up The Heat On Spam - blog.twitter.com/2008/08/turning-up-heat-on-spam.html
Twitter Spam Invades Trending Topics - mashable.com/2009/05/11/twitter-spam-trending-topics/
Commercial Twitter spamming tool hits the market - blogs.zdnet.com/security/?p=2477
Twitter DM Deleter - dcortesi.com/tools/dm-deleter/
Twitspam - twitspam.org/
TwitChuck - www.twitchuck.com/
TidyTweet - www.tidytweet.com/
Twittercism: Security & Privacy - twittercism.com/category/security/
Tweepi - tweepi.com/
TwerpScan - twerpscan.com/
Clean Tweets - blvdstatus.com/clean-tweets.html
Tidy Tweet - www.tidytweet.com/
Tweet Blocker - tweetblocker.com/
True Twit - www.truetwit.com/truetwit/signUp/index

Top of Section Top Index

Other "web 2.0" spam

Facebook: Pass it on - blog.facebook.com/blog.php?post=7830237130
Web 2.0 Cracks Start to Show - www.wired.com/science/discoveries/news/2005/10/69366
Report Spam on Google Maps - groups.google.com/group/Google-Maps-For-Business-Owners/browse_thread/thread/ea2898fa2c921792?hl=en
Are Tagged Photos on Facebook a New Source of Marketing Spam? - www.readwriteweb.com/archives/tagged_photos_on_facebook_new_source_of_marketing_spam.php
YouTube Spam Panic Emerging; Why Don't All Networks Have Spam Control? - www.readwriteweb.com/archives/youtube_spam_panic_emerging_wh.php
The Undignified Death of Social Networks - www.loosewireblog.com/2008/11/the-undignified.html
Spammers Hijack Facebook Group of 1.5 Million - www.internetnews.com/webcontent/article.php/3802326
419 Scammers Set Up Roost on Facebook - www.louisgray.com/live/2009/01/419-scammers-set-up-roost-on-facebook.html
Spam2.0: Fake user accounts and spam profiles - googlewebmastercentral.blogspot.com/2009/06/spam20-fake-user-accounts-and-spam.html
Magic Diet Product Scams Invade Freecycle And Meetup Groups - consumerist.com/2009/07/magic-diet-product-scams-invade-freecycle-and-meetup-groups.html
Twitter suspends accounts of users with infected computers - www.infoworld.com/d/security-central/twitter-suspends-accounts-users-infected-computers-878
Apple Bans Bushel Of Spam Apps - consumerist.com/2009/08/apple-bans-bushel-of-spam-apps.html

Top of Section Top Index

Blog Spam/Comment Spam

Blogs were hailed as a new, open, trusting communications method, but it wasn't long before the spammers came long and wrecked open comments, pings and trackbacks.

Top Index

About Blog Spam

Preventing comment spam - googleblog.blogspot.com/2005/01/preventing-comment-spam.html
Blog Spam DB - www.markcarey.com/spamdb/
The "Anti-Spam Feedback" Initiative - www.tamingthebeast.net/antispam/antispam.htm
Seven quick tips for a spam-free blog - cheerleader.yoz.com/2003/09/seven-quick-tips-for-a-spam-free-blog.html
Comment spam - kalsey.com/2003/09/comment_spam/
Blacklisting Comment Spam - simonwillison.net/2003/Sep/2/blacklisting/
Documented cases of spamming - spammers.chongqed.org/
Spam Huntress - spamhuntress.com/
Bloggers Declare War on Comment Spam, but Can They Win? - ojr.org/ojr/glaser/1095201311.php
Push back - www.rojisan.com/spam/
Damn Spam! - spam.tinyweb.net/
Blog Spammer Caught. Now What? - www.spam-blocker-resource.com/
Building a Better Spam Detector - www.seomoz.org/blog/building-a-better-spam-detector
Comment spam archive - www.herod.net/spam/
On Free Speech and Civil Discourse: Filtering Abuse in Blog Comments - www.ceas.cc/2008/papers/ceas2008-paper-43.pdf
WordPressDirect: Blogging Tool or Spam Engine? - www.blogherald.com/2008/11/24/wordpressdirect-blogging-tool-or-spam-engine/
Is Comment Spam Getting Smarter? - www.velvetblues.com/web-development-blog/comment-spam-getting-smarter/

Top of Section Top Index

Stop Wordpress Blog Spam

Combating Comment Spam - codex.wordpress.org/Combating_Comment_Spam
Spam Tools - wordpress.org/extend/plugins/tags/spam
Combat Comment Spam - www.tamba2.org.uk/wordpress/spam/
Spam Karma - unknowngenius.com/blog/wordpress/spam-karma/
Anti-spam Plugin - weblog.sinteur.com/2004/11/yet-another-anti-spam-measure/
Akismet - akismet.com/
SpamWords - dev.wp-plugins.org/wiki/SpamWords
Spaminator - dev.wp-plugins.org/wiki/Spaminator
Trackback Validator Plugin - seclab.cs.rice.edu/proj/trackback/trackback-validator-plugin/
WP-spamfree - wordpress.org/extend/plugins/wp-spamfree/
Did You Pass Math? - www.herod.net/dypm/
Math Comment Spam Protection Plugin - sw-guide.de/wordpress/plugins/math-comment-spam-protection/
Simple Trackback Validation Plugin - sw-guide.de/wordpress/plugins/simple-trackback-validation/
Simple Spam Filter for WordPress - tantannoodles.com/toolkit/spam-filter/
Antispam Bee - playground.ebiene.de/1137/antispam-bee-wordpress-plugin/

Top of Section Top Index

Stop Movable Type Blog Spam

Learning Movable Type: Concerning Spam - www.learningmovabletype.com/a/000246concerning_spam/
Six Apart Guide to Comment Spam - www.sixapart.com/pronet/comment_spam.html
MT-Blacklist/Comment Spam Clearinghouse - jayallen.org/comment_spam/
Keystrokes Plugin - overstated.net/projects/mt-keystrokes/
SpamLookup - bradchoate.com/projects/spamlookup/
Using mod_security to shield Movable Type from Blog Comment Spam - jeremy.zawodny.com/blog/archives/007442.html

Top of Section Top Index

Stop Other Blog Spam

Word banning in TypePad - www.sixapart.com/typepad/news/2006/05/word_banning_an.html
Akismet API and implementations - akismet.com/development/
Clean up link spam on your Plone site - plone.org/documentation/how-to/clean-up-link-spam-on-your-site
Akismet for Ruby on Rails - rubyforge.org/projects/ror-akismet/
Trac spam filter - trac.edgewall.org/wiki/SpamFilter
Drupal Spam Module - drupal.org/node/11104
Defensio - defensio.com/ (Reviews: 1)
Mollom - mollom.com/
Akismet spam graphs with PHP RRD - www.ioncannon.net/php/113/akismet-spam-graphs-with-php-rrd/
More Spam Fun (Akismet) - blog.joshuaeichorn.com/archives/2006/12/21/more-spam-fun/
Fun with comment spammers... - blogs.herod.net/steven/archives/94
Blogger: Preventing Unwanted Comments and Comment Spam - www.google.com/support/blogger/bin/answer.py?answer=42064
How to prevent spam in Rails - www.elctech.com/articles/how-to-prevent-spam-in-rails

Top of Section Top Index

Trackback Blog Spam

Trackback Spam Resources - seclab.cs.rice.edu/proj/trackback/
Confessions of a Trackback Spammer: Please… Stop Me! - www.copyblogger.com/trackback-spam/

Top of Section Top Index

Spam Blogs (Splogs)

Fight Splog - fightsplog.blogspot.com/
Flagday - flagday.pbworks.com/
Splog - wiki.chongqed.org/Splog
AntiSplog - antisplog.phpmagazine.net/
Destroy all Malware - www.destroyallmalware.com/
On Spam Removals - buzz.blogger.com/2006/04/on-spam-removals.html
Splog software from Hell - ebiquity.umbc.edu/blogger/2006/04/03/splog-software-from-hell/
Detecting Spam Blogs: A Machine Learning Approach - ebiquity.umbc.edu/paper/html/id/296/Detecting-Spam-Blogs-A-Machine-Learning-Approach
Characterizing the Splogosphere - ebiquity.umbc.edu/paper/html/id/299/Characterizing-the-Splogosphere
Spam blog - en.wikipedia.org/wiki/Spam_blog
On Auto-Bloggers - www.talkbiz.net/ramblings/archives.php?id=A2006061
Twingly.com — spam-free blog search - www.twingly.com/search?q=&spam-free=Spam-free+search+(beta)
WordPress Plugin: Digital Fingerprint — detecting content theft - www.maxpower.ca/wordpress-plugin-digital-fingerprint-detecting-content-theft/2006/09/25/
Copyscape - www.copyscape.com/

Top of Section Top Index

Spam Pings (Spings)

FeedThirsty - www.feedthirsty.com/
Pings, spings, splogs and the Splogosphere - ebiquity.umbc.edu/blogger/2007/02/01/pings-spings-splogs-and-the-splogosphere-2007-updates/
Welcome to the Splogosphere - ebiquity.umbc.edu/blogger/2005/12/15/welcome-to-the-splogosphere-75-of-new-blog-posts-are-spam/
Spings on Wikipedia - en.wikipedia.org/wiki/Sping
State of the Splogosphere, Part III - www.destroyallmalware.com/?guid=20060401093834

Top of Section Top Index

Wiki Spam/LinkSpam

Top Index

About Wiki Spam

Fight Wiki Spam - chongqed.org/fightback.html
BannedContent Discussion - www.communitywiki.org/cw/BannedContentDiscussion
SpamBusters - www.communitywiki.org/cw?SpamBusters
Wikipedia Nofollows Links - blogoscoped.com/archive/2007-01-22-n21.html
Usemod on spam - www.usemod.com/cgi-bin/mb.pl?CategorySpam
WikiSpam - www.cocoadev.com/?WikiSpam
Hiding Spam with CSS - wiki.chongqed.org/CSSHiddenSpam
MediaWiki Default Pages Spam - wiki.chongqed.org/MediaWikiDefaultPagesSpam
Discussion of antispam ideas - wiki.chongqed.org/WikiSpam
Wiki Immune System ideas - www.nooranch.com/synaesmedia/wiki/wiki.cgi?WikiImmuneSystem
Wiki Spam Solutions - c2.com/cgi/wiki?WikiSpamSolutions

Top of Section Top Index

Anti-WikiSpam Solutions and Filters

MediaWiki anti-spam features - www.mediawiki.org/wiki/Anti-spam_features
SpamBlacklist for MediaWiki - www.mediawiki.org/wiki/SpamBlacklist_extension
Blocking Spam in MediaWiki - www.umasswiki.com/wiki/UMassWiki:Blocking_Spam_in_MediaWiki
MediaWiki Spam Filter project - www.mediawiki.org/wiki/Spam_Filter
Proxy blocking discussion at Wikimedia - meta.wikimedia.org/wiki/Proxy_blocking
MoinMoin AntiSpamGlobalSolution - moinmo.in/AntiSpamGlobalSolution
Kwiki DNSBL - svn.kwiki.org/jooon/Kwiki-DNSBL/
TWiki Blacklist Plugin - twiki.org/cgi-bin/view/Plugins/BlackListPlugin
Zimbra Anti-spam - wiki.zimbra.com/index.php?title=Category:Anti-spam
Spam cleanup script - www.wikia.com/wiki/Spam_cleanup_script
DokuWiki spam logging - www.dokuwiki.org/blacklist
Oddmuse Antispam module - www.oddmuse.org/cgi-bin/oddmuse/Antispam_Module
Oddmuse SpamCatching module - www.oddmuse.org/cgi-bin/oddmuse/SpamCatching_Module
PmWiki URL Approvals - www.pmwiki.org/wiki/PmWiki/UrlApprovals
Spam Proof Wiki - sourceforge.net/projects/spamproofwiki/
Fighting spam in Wikka - wikkawiki.org/WikkaSpamFighting

Top of Section Top Index

Blog and Wiki Spam Blacklists

Much blog and wiki spam blocking is based on identifying spammy keywords, URLs or IPs, much like the blacklists of words, URLs and IPs that have been built to counter email spam. Lists designed for one wiki or blog are probably useful (with a bit of tweaking) for any other Web 2.0 application.

MoinMoin BadContent - master.moinmo.in/BadContent
Wikia Spam Blacklist - www.wikia.com/wiki/Spam_Blacklist
Mozilla wiki spam blacklist - wiki.mozilla.org/Spam_blacklist
Wikihow spam blacklist - www.wikihow.com/Spam-Blacklist
S23 spam blacklist - s23.org/wiki/Spam_blacklist
Audacity spam blacklist - wiki.audacityteam.org/index.php?title=Audacity_spam_blacklist
Perplexcity wiki spam blacklist - perplexcitywiki.com/wiki/Meta:Spam_Blacklist
Spamikaze BadContent - spamikaze.org/BadContent
Wiki Spammers - www.openwiki.com/ow.asp?WikiSpammers - IPs
Meatball Ban List - www.usemod.com/cgi-bin/mb.pl?BanList - IPs and hostnames
Wiki Vandals - rollerweblogger.org/wiki/Wiki.jsp?page=WikiVandals
WikiSpam - twiki.org/cgi-bin/view/Codev/WikiSpam
Trac bad content - trac.edgewall.org/wiki/BadContent
CommunityWiki Banned Content - www.communitywiki.org/en/BannedContent
Blog Spam Blacklist - blogspambl.com/

Top Index

Forum Spam

bbAntiSpam: phpBB antispam solution - bbantispam.com/
phpBB ree Anti-Spam Check - www.phpbb-security.com/check.php
Disable phpBB Spambots - www.phpbb.com/files/mods/disable-spambots-1.0.1.mod
ForumBan - www.sugapablo.com/forumban/
Forum Equalizer? Spam Equalizer - www.talkbiz.net/ramblings/archives.php?id=A2006111
Stop Forum Spam - www.stopforumspam.com/

Top Index

Guestbook Spam

Fight Guest Book and Public Forum Spam - www.unwantedlinks.com/GuestBookSpam.html

Top Index

Referrer Spam

Analytics spam: coming to an internet near you - www.seomoz.org/blog/analytics-spam-coming-to-an-internet-near-you
REF(errer)SPAM FUCKER 3000 - docs.g-blog.net/code/RefSpamFucker/3000/refspamfucker3000.php.txt
Adminshop - visualintensity.com/adminshop-com/
Get Rid of Referer Spammers - www.topsiteswebdirectory.com/referer_spam/

Top Index

Web Adverts

These sites list hostnames that are associated with web advertising, which some consider to be as offensive as spam. The ads may be viewed in bulk, but unless they were placed onto the site in question without the permission of the owner they are not unsolicited, so they're not strictly spam.

Hostess - accs-net.com/hostess/
KillHost - www.sillysot.com/other.htm
Using the Hosts File - www.accs-net.com/hosts/
How To Block Ads (& Web Bugs) Without Extra Software - ssmedia.com/Utilities/hosts/
List of ad servers - pgl.yoyo.org/adservers/
Blocking Unwanted Parasites with a Hosts File - www.mvps.org/winhelp2002/hosts.htm
someonewhocares hosts file - someonewhocares.org/hosts/
Hosts file updates - datadragon.com/banners/hosts.shtml

Top Index

everything you didn't want to have to know about spam

Hosted by spam.abuse.net, with help from Neil Schwartzman. Domain registration by Gregg DesElms. Logo by Art101.
Spam Links Home Creative Commons License
This work is licensed under a Creative Commons License. SPAM is a trademark of Hormel Foods.
Page last updated: 21-Mar-2010