Moderator: “Can there ever be too much data for a brand?”
Industry exec: “Interesting - I have never been asked that question before.
As we are praising the opportunities to create more insights from Big Data and to make better decisions, nobody seems to question what the right amount of data is. Just because by definition you need more data to get to Big Data, does not mean it is a good idea to keep collecting.
There are a couple of more or less obvious consequences from gathering ever more data: it’s costly, it slows down decisions, and it can be outright misleading.Data collection is expensive
Together with the CFO of a major media company, we recently looked at just a couple of subscriptions to typical industry data providers like Nielsen, comScore, etc. In a heartbeat, he was able to cut nearly $40,000 a year by questioning what was really being used (let alone analyzing what is instrumental to make a decision).
Other costs for data are storage, bandwidth, and processing, which can quickly run into thousands of dollars. In one example, keeping a record of all paid and earned Facebook data that a mid-size social media provider manages will increase just the storage bill by $80,000 in one year.
Keeping data becomes a liability, too. Certain manipulation of data sets, like aggregation, might not be allowed per the original license agreements, but who is tracking that, and how? And even discarding data requires effort, as somebody needs to decide what can be removed and then actually has to get around to actually purging it. The data assessment project at a major agency holding company took four months and countless meetings, emails, conference calls and spreadsheets before somebody knew how big the problem was.Data can be overwhelming
Somehow the term information overload isn't trending anymore. Compared to 2004, just 20% of people search for answers to information overload these days (according to Google Trends). However, it is obvious that few people really know what relevance that newfound "treasure chest" of data actually has, and thus are wasting their time with useless data. That leads to further delays in making decisions, which is a behavior, spreading like a disease, once thought to be contained to large corporations but now becoming pervasive even in nimble start-ups.Additional data can be misleading
Data scientists are constantly warning about the danger of uncontrolled data visualization, where pretty correlations are mistaken for causations and lead to wrong conclusions. That spike in sales shortly after a great increase in Facebook "likes" certainly means we should do another Facebook campaign, shouldn’t we?
While all of the points above are rather straightforward, less obvious is a more fundamental psychological reaction to more data: making wrong decisions due to the active pursuit of additional data, even if it was deemed non-instrumental beforehand (‘non-instrumental’ = should not influence decision). In a more than 25-year-old study, researchers found that people felt compelled to take data into account as soon as it was offered. And when they do, they disregard their original criteria to make a decision and overweight the additional information.
Imagine your data scientists created a well-balanced media-mix-model, and just before committing budgets, somebody suggested waiting for the latest Nielsen-Twitter data to tell you about engagement from unique authors with the shows in your media plan. According to the psychological study mentioned above, you will most probably wait for the outcome, and you will be influenced in your go-ahead decision based on the relative weighting of the tweet-engagements -- even it has no relevance to your model whatsoever. "Dancing with the Stars" had just 26,000 unique authors, vs. the 34,000 of "The Voice," both at 66,000 tweets, but you don’t have "The Voice" in your media plan? Does it matter? Should it matter?
Ray of light
Kimberly-Clark CMO Clive Sirkin was recently
quoted saying, "We can't see where we are supposed to go, so we naturally go where data is.”
As a company moves from "small data" to Big Data, it makes sense get your initial batch of data sources connected to the business intelligence infrastructure and start populating first models. However, once the confidence in those models is high and the expected data is flowing in, it’s time to avoid the pitfalls of more data, and go where you have planned to go. A good data management strategy will precisely document these choices.
It’s just like the old story about the lost keys in a dark alley. At the beginning, it makes sense to search where the light (data) is. But once you are able to form a solid model retracing your steps from the bar to the car and get your own flashlight, you need to follow that search plan relentlessly. Just because somebody turned on a big flood lamp on the other side of street, doesn’t mean you should go and search there.
So always ask yourself if you really need more data. Because if you don't know what to do with that data, don't even bother with it.