Causality and Big Data

in Technology by

“You see there is only one constant. One universal. It is the only real truth: Causality. Action, reaction. Cause and effect.“ The Frenchman America Online’s 2006 crisis and Edward Snowden’s disclosure got great amounts of public attention, criticizing surveillance parameters and the ethics of big data usage. In this article, I want to look beyond privacy and rather talk about institutionalization. The first thing we should ask is fundamentally how does a search engine work? As any statistical correlation study, big data also starts with sampling but rather than random distribution it collects everything possible where sampling becomes equal to all of the data. Interpreters then make several correlation studies, but usually interpretation ends up with probabilistic causality (factors increase the probability of another). This is normal because as the numbers get bigger, distortion becomes less visible and lets us see things that we have never thought about. Though as we have ‘smoking causes cancer’ signs on cigarette packages, we also should have ‘big data does not mean causality’ because things are getting wrong without any regulation and we have only started to see its implications. Causality is the relationship between one specific event with another and is generally used…

