Causality and Big Data

January 26, 2015
1.2K views
3 mins read

You see there is only one constant. One universal. It is the only real truth: Causality. Action, reaction. Cause and effect.

The Frenchman

America Online’s 2006 crisis and Edward Snowden’s disclosure got great amounts of public attention, criticizing surveillance parameters and the ethics of big data usage. In this article, I want to look beyond privacy and rather talk about institutionalization.

The first thing we should ask is fundamentally how does a search engine work? As any statistical correlation study, big data also starts with sampling but rather than random distribution it collects everything possible where sampling becomes equal to all of the data. Interpreters then make several correlation studies, but usually interpretation ends up with probabilistic causality (factors increase the probability of another). This is normal because as the numbers get bigger, distortion becomes less visible and lets us see things that we have never thought about. Though as we have ‘smoking causes cancer’ signs on cigarette packages, we also should have ‘big data does not mean causality’ because things are getting wrong without any regulation and we have only started to see its implications.

Causality is the relationship between one specific event with another and is generally used to express the cause and the effect. From a logical point it is defined as, “If x is a necessary cause of y, then the presence of y necessarily implies the presence of x. The presence of x, however, does not imply that y will occur.” In reality every cause is an effect of another, forming an infinite casual chain. Mathematical equations have the constant K as a hidden variable to represent the influence of freewill, therefore, we can disregard freewill to manipulate information. Consequently, choice becomes meaningless and people become passive actors in the world not changing anything. It seems awkward, but with big data correlations, we are facing it every day.

Internet neutrality is a principle that means service providers use all sorts of data in a similar way rather than discriminating any type. A neutrality clause in human rights also protects users from telecom companies that may try channeling the users forcefully to their own sites for economic benefits. Although digital freedoms are promising, we are still missing data and algorithm neutrality. Consequently, companies like Facebook tell us which friends we are more close to by looking to our “like” clicks. Is it a correlation or causality? What are the probable consequences?

An insurance company may fine you higher according to your unhealthy food Foursquare check-ins; a bank may give you a loan based on your Amazon consumption rate, and maybe a resume analyzer compares our wordings with successful candidates. There are many other examples out there and most of them are improving our lives in a positive way, but can they be perfectly knowledgeable about all factors involved? Or from another perspective, do we still have free will? Maybe not. A prayer gathering may be seen as a probabilistic terrorist event, or irregular social network behavior can signal a person’s criminal behavior. Don’t get me wrong, big data is not bad at all. It also shows other correlations related to the same event if the analyst is asking the right questions and properly regarding room for error in correlation. That’s why a person flying with an eyebrow-raising name (Bin Laden for instance) can enter to the US without any immigration problems.

We need to set rules and regulations to at least educate ourselves rather than inflict punishment on people unjustly because most of the time different types of information from different sources do not align perfectly. Otherwise, we could have disregarded Thomas Edison, Albert Einstein or Steve Jobs because of their childhood success signals. Were they just lucky? Luck is not a rational explanation, and it is more likely that we are missing some indicators from our analysis.

Moreover, can we say that certain groups of people owning power are exploiting the means of information? Probably not. After all, we do not know the causality of their cruel manner but there is one thing certain, we should find a way to keep neutrality. Viktor Mayer Schönberger and Kenneth Cukier’s book Big Data comes up with several solutions. They recommend usage of general topics rather than smaller relationships, accepting the messiness of data, phrasing the usage as correlation instead of causality, redefining the justice and freedom act, forming lawyers to fight data cases and lastly, reducing permanent storage memory.

At last, we are humans, not games defined by rules to be as predictable, as chess peons. We are clearly not looking for a dictatorship. Causality is an infinite chain, and YouTube will never know why I liked that video, so I just use data and not let data use us.

Erdem has recently graduated from Yonsei University's master's degree program, majoring in International Finance (minor in Global Strategy). Currently he is assisting Temka Tour's entrepreneurial drive. He has job experience in various business fields. Besides he is co-authoring a book about new management paradigms based on adoption of technological advancements.

Leave a Reply

Your email address will not be published.

Previous Story

Decentralization of Production and Customization Culture

Next Story

Technological Advances in M-health

Latest from Technology & Innovation

oil well

Energy Reforms in Latin America: What the Mexican State Needs to Learn

The growing importance of developing countries’ national oil companies to the global supply-demand balance raises questions about the emerging policies of association, objectives and regulations of these organisations. In particular, shifts in those policies will have a great impact on the future development of global oil and gas markets, not to mention the socioeconomic development of the companies’ host countries. National oil companies are expected to control a greater proportion of future oil and gas supplies over the next two decades, as these commodities in the mature producing regions of the OECD countries continue to show natural decline of supply.
Kizilgaha beacon tower

Beacons Over Mars

Lets begin with a question: How is trade created? At first an intellectual being defines a phenomenon and then that person (or maybe someone else who has somehow acquired that knowledge) generates an interest for it. Spontaneously that interest defines a need which ends up with an exchange. The difficulty of this process can vary; sometimes rocket science is required to define a phenomenon but sometimes it is as easy as lighting a beacon. For instance the Silk Road of the ancient world was not even a leveled sand road but rather several beacons lit by the locals to invite trade caravans. As

Can We Deploy THAAD Now?

South Korean defense officials let out a collective groan on May 9, 2015, as North Korea reportedly conducted a successful test-fire of a submarine-launched ballistic missile (SLBM). International news websites and security blogs were inundated with photos released by the Korean Central News Agency (KCNA) of a proud Kim Jong-Un personally overseeing the test launches. The dramatic photos also show the moment the SLBM exited the water. Bukgeuksung-1 (북극성-1), presumably the name of the missile, is seen painted on the side. Though still in its nascent stages, if the reports are accurate, an operational SLBM is alarming for a number

Iran: Why Nuclear Weapons Made Sense

The nuclear talks between Iran and the Six Powers have dominated headlines for the first week of April. After days of tense negotiations, a deadline extension, and flaring tempers, an agreed upon framework has finally emerged. And, unsurprisingly, there has been no shortage of critics. The more hawkish American and Israeli lawmakers voiced displeasure. Iranian hardliners found it inadequate. And both countries seem to be reporting different versions of the framework. And to top it all off, there still is no concrete deal in place. March 31st was only a soft deadline important to only the United States to placate

What is the Future of the Assembly Line?

Production changed drastically during the last century and is continuing doing so without loosing any momentum. The most recognized technological shift occurred with Henry Ford’s use of the assembly line and progress has continued in a variety of fashions. One of these that is often overlooked is the use of robotization, an example being KUKA robots, which have reinvigorated factories over the past decades. Through this robotization and computerization more efficient solutions have been brought to the industry and now we are heading to another techno-climax. In this decade we will look for answers to the following questions: What will we consume and how how we

Don't Miss