Welcome | View My Profile | Sign Out
MediaPost Home About MediaPost Privacy/Terms Media Kit Sitemap
Publications Home News
Online Media Daily Media Daily News Marketing Daily Mobile Marketing Daily Search Marketing Daily
Daily Feed> Email Daily Feed> Video Daily Feed> Social
Online Blogs
Online Spin Email Insider Search Insider Behavioral Insider Online Publishing Insider Mobile Insider Video Insider Gaming Insider Performance Insider Metrics Insider Social Media Insider Just An Online Minute Daily Online Examiner Raw Blog
Media Blogs
Research Brief Diane Mermigas:On Media TV Watch TV Board Magazine Rack Media Creativity Notes From the Digital Frontier Digital Outsider Mad Blog Red White and Blog
Marketing Blogs
Engage:Hispanics Engage:Kids 6-11 Engage:Moms Engage:Boomers Engage:Gen Y Engage:Teens Marketing:Green Marketing:Sports
Magazines
OMMA Magazine Media Magazine
Subscribe
Feedback Loop RSS Feeds Archives Subscribe
Dec 2 Search Insider Summit (Utah) Dec 6 Email Insider Summit (Utah) Jan 11 OMMA Agency of the Year (NYC) Jan 12 MEDIA Agency of the Year (NYC) Jan 26 OMMA Social (San Francisco) Jan 27 OMMA Performance (SF) Feb 24 OMMA Metrics Measurement (NYC) Feb 25 OMMA Behavioral (NYC) Mar 15 OMMA Global (San Francisco) Apr 14 Search Insider Summit (FL) Apr 18 Email Insider Summit (FL)
Recently Concluded Events
Nov 3 OMMA Adnets (NYC) Oct 30 OMMA Video (LA) Oct 29 OMMA Mobile (LA) Oct 29 OMMA Mobile & Video (LA) Sep 23 Creative Media Awards (NYC) Sep 23 The Future Of Media (NYC) Sep 22 Online All Stars (NYC) Sep 21 OMMA Awards (NYC) Sep 21 MediaPost Live at Advertising Week All-Access (NYC) Sep 21 OMMA Global New York (NYC)
All MediaPost/OMMA Events Event Blogging Past Event Videos
Industry Events Calendar
2010 OMMA Agency of the Year 2010 MEDIA Agency of the Year
2009 Creative Media Awards 2009 OMMA Awards 2009 Digital Out-of-Home Awards 2009 Media Agency of the Year 2009 OMMA Agency of the Year
All Awards
Employment Situations Wanted Services Offered Post a Job
Briefs Reports Online
MediaPost Directories
Mobile Insiders Group
People Finder Edit My Profile View My Profile My Contacts My Calendar
HOME • MANAGE SUBSCRIPTIONS • MEDIA KIT
You've Been De-Anonymized
by Steve Smith, Friday, April 3, 2009, 1:30 PM

SHARE

TOOLS

RELATED ARTICLES
TAGS:  Behavioral Targeting

MOST READ

The supposed "anonymity" of users in behavioral tracking systems has been challenged repeatedly by privacy advocates over the years. One way or another, even when a non-personal information identifier is applied to a user, it is possible (possible, mind you) to track back at least to that person's IP address and perhaps to sensitive PII. In a new academic paper from two University of Texas scholars, Arvind Narayanan and Vitaly Shmatikov, computer science researchers are turning their eye towards defining "anonymity" and "privacy" online.

 

"De-anonymizing Social Networks" is a turgid read to be sure, and our eyes glaze over when they start throwing in equations. But this is the sort of academic scrutiny of privacy that is getting funded (partially by National Science Foundation grants in this case) and will form a bedrock of research about the technical possibilities around identity protection and theft. The outlines of this research are worth reviewing because they reveal the contours of what privacy protection policies will have to address in dealing with social networks.

The basic approach taken by these University of Texas researchers involves overlaying anonymized data from multiple social networks in a way that ultimately reveals identity. The paper does not argue that PII is available directly to third parties, advertisers or malicious hackers, but that anonymized social graphs that are available can be combined to render identities.

The researchers claim that anonymized social graphs are available from many social networks to academic and government data mining, advertising, third-party applications developers and network aggregators. For instance, the paper cites a few instances where widget makers and application developers exposed sensitive information about users on the network. Whether it is through available APIs into a social network or just profile scraping "it is important to understand what a malicious third party application can learn about members of a social network even if it obtains the data in an anonymous form," the researchers argue.

The aim of the research was to determine whether sensitive information about individuals could be extracted from "anonymous" social graphs. The reserarchers used social graph data from Twitter and Flickr, whose APIs they say include mandatory username fields and optional name and location fields. Also included in the research was a graph of friend relationships from the LiveJournal blogging service. They used a series of algorithms (which make our head hurt just thinking about them) to find common people across two networks and then mapped out from there the social graphs to find common cliques.

We wish we could say with some high degree of confidence how these guys managed the trick of de-anonymizing social network members via algorithms, but we confess to some measure of befuddlement over the equations behind the test and the opaqueness of the terminology. Once the basic matches were established, however, the researchers claimed that in 30.8% of the cases their algorithms correctly re-identified previously anonymized members of the social graph. We're still not clear from the paper what the level of identification of individuals was in this experiment.

The implications of this research are twofold, at least. First, the researchers seem to be demonstrating that simple algorithms can be applied by just about any advertiser or developer on readily available anonymous social graphs to render identities. The test claims to have successfully "re-identified" thousands of Twitter users simply by cross-referencing anonymized social graphs with Flickr. In fact, these researchers said their assumptions in making the algorithms were conservative and that in the real world overlap of massive networks like Facebook and MySpace it should be much easier to re-identify higher percentages of common users, if only because so many people readily identify themselves on one or both of the networks. As the social networks get larger and overlap their membership, the research argues, it will become much easier to thwart anonymity and to do so easily. Even anonymized data contains attributes that can make the process easy.

The researchers feel that the potential for abuse is clear and present. Any potential solution would appear to necessitate a fundamental shift in business models and practices and clearer privacy laws on the subject of Personally Identifiable Information."

Perhaps more to the point, the researchers are arguing that "anonymity" online does not equal privacy. In fact, they seem to be proving privacy can be breached on social networks at least fairly easily.

At their clearer online FAQ on the research, the team says it does not expect a technology solution to the problem. "First, the false dichotomy between personally identifiable and non-personally identifiable information should disappear from privacy policies, laws, etc." the team says in its online FAQ. "Any aspect of an individual's online personality can be used for de-anonymization, and this reality should be recognized by the relevant legislation and corporate privacy policies."

Just as the lines between PII and non-PII have themselves become blurred, the associations between anonymity and privacy growing more are complex. As Jules Polonetsky of the Future of Privacy Forum tells us in response to this new research, "It's important to understand that personal and anonymous are not black and white. There are gradients. Any time there are a number of pieces of data about one user, there is an increasing potential they can be identified. For most users it's rather unlikely that anyone will go through the trouble, but it is important to understand that as you build an electronic trail, identification becomes possible."

Hearty readers and cosine freaks can download the original paper or view an online version at http://randomwalker.info/social-networks/index.htm.

4 people recommend this article. 

2 comments on "You've Been De-Anonymized"

  1. Bruce May from Trelios DMG
    commented on: April 03, 2009 at 2:13 PM
    Wow... and I thought that no would ever find out about my secret fascination with Galactic Clusters and radical string theories... oh well, I guess you might as well know everything about me.... The bottom line is simple, if I put my real name on any network then you can find out everything there is to know about me. Since most of us are on Linkedin I guess we have already lost the war. The government can outlaw this kind of data acquisition but once the data is sold to a third party you have the problem of tracking that transaction and we know how difficult that is (especially across national boundaries). Since I don't want the government to tell me what information I can and can't share on a social network (that would be unconstitutional) it looks like I am assuming all the risk. Since I don't see any technical way to control it (and neither do the authors of this study) what can the government do anyway? It looks like a perplexing problem just got a lot more perplexing.

  2. Warren Lee from Segmint
    commented on: April 03, 2009 at 1:59 PM
    Well that's surely a surprise: Use a bunch of tax payers money for a study that seems self evident at the outset. It is obvious that given enough anonymous information, there are math wizards and algorithms that are able to match a minority, abet a large one, of identities. The issue for me is the privacy policy that lets the information out in the first place. Now is there any harm done to anyone? Now consider:

    Since there is no such thing as privacy in the offline world, why prey-tell is the online world held to a different standard? I think this is a major question that needs to be addressed. If I go to a DM firm, offline, I can select female breast cancer survivors, over 50 in the following zip codes who have a HHI of over $150k and etc. Online if I were to try that I think that it might start an inquisition! What is the justification for the disparity?

Leave a Comment

You must be signed in to comment. Sign In

Do you have strong opinions and inside knowledge about the topic of this article -- and do you want to share your insights, observations and points of view regularly with the readers of MediaPost? To be considered as a MediaPost contributing writer, please send pertinent info about your credentials, plus several column ideas and one example of your writing on the topic, to pfine@mediapost.com. Please see our editorial guidelines here first.

STEVE SMITH
  • Contributing writer Steve Smith is a lapsed academic who saw the light, bolted the University and spent the last decade as a digital media critic and consultant. He is chair and programmer of OMMA Mobile and OMMA Behavioral conferences from Mediapost and is the Digital Media Editor at Media Industry Newsletter (MIN) from Access Intelligence. Contact him here.


AUTHORS

ARCHIVES

RECENT VIDEOS
Recent Behavioral Insider Articles
Slipping Into The Peer-to-Peer Channel: Where Trends Begin   
Somewhere beneath the marketing radar, millions of users are online every day registering their desires on...
Think Of BT As A Spiritual Cleansing   
Behavioral targeting is one of those terms execs just want to stay away from because it's...
The Stuff You Don't Even Know You Want To Buy Yet    
When I go to Amazon.com -- oh, about five times a week (yes, I am that...
BT In The Nontraditional Sense   
TARGUSinfo on Tuesday reported a partnership with ad network AdMeld designed to better target consumers with...
Baking In The Privacy Issue   
Who really owns the privacy relationship with users? Increasingly, most players in the ad technology game...
Why Is Adoption Rate Low For Google's BT Service?   
It's been nearly eight months since Google announced a behavioral targeting service, which it calls "interest-based"...
CFOs Don't Click   
B2B content has always been a tough nut to crack when it comes to ad networking....
What's It Like To Be A True Marketing Geek? John Nardone Knows   
John Nardone, CEO of [x+1], describes himself as a "marketing geek" who became addicted to the...
The Value Of The New Machine   
One person who has strong views about the way ad networking environments are evolving is Rich...
BT Thrives In Economic Downturn    
Thank goodness for the economic downturn. Sounds crazy, right? But without tighter ad budgets, marketers might...
>> Behavioral Insider Archives 
ABOUT MEDIAPOST • MASTHEAD • MEDIA KIT • RSS FEEDS • PRIVACY/TERMS & CONDITIONS
©2009 MediaPost Communications. All rights reserved.
1140 Broadway, 4th Floor, New York, NY 10001
tel. 212-204-2000, fax 212-204-2038, feedback@mediapost.com