Google is not a proxy for big data. Ah-ah-achoo!!
Google is a great source of data for Google. Twitter is a great source of data for Twitter. Facebook is a great source of data for Facebook. Remember more data mean more skepticism, not less. Correlation doesn’t mean causation.
Another year, another report about how inaccurate the Google Flu Trends predictions turned out to be for the previous year. And more warnings about the dangers of relying on Google, and therefore “big data” and algorithms, for important stuff.
Repeat after me: Google is not a proxy for big data. It also isn’t supposed to replace the Centers for Disease Control. Even it wouldn’t make that claim.
I made the same argument in more detail when this concern popped up last year, but here it is in a nutshell: “Big data” isn’t the enemy, it’s a friend. So are algorithms. But they must be used correctly.
Google is a great source of data for Google. Twitter is a great source of data for Twitter. Facebook is a great source of data for Facebook. For everyone else, they’re just additional sources of data of varying value depending on what’s being…
View original post 182 more words