At Keenalytix, we love words just as much as we love data. Better, we strongly believe in the necessity for analytics as a profession to learn how to express itself (its approach, its models, its results) in plain English, as explained in our last post. Our love affair with words materializes in a passion for etymology, the study of origins and meaning of words.
This led us to today’s post: what does analytics even mean? Where does the word come from? And more importantly, can the understanding of the word’s origins and meaning help us improve as analytics professionals?
So, let’s explore the history of analytics. Like many good things in modern life, our story starts in Greece, a long, long time ago…
Loosen up!
The concept of analysis emerged in Ancient Greece and literally means “the resolution of anything complex into simple elements”. The word itself is the combination of ana (“up, back or throughout”) and lysis (“to loosen, divide, cut apart”).
In other words, one analyzes a problem by decomposing it into smaller elements to understand how they are connected to each other. These elements are typically called variables or factors in modern analytics. The connection between them is what your data scientists call a model.
This definition brings home two critical ideas taught in every statistics class and (hopefully) used by all practitioners every day:
- Analytics is, fundamentally, all about understanding relations between variables. While it is always good to start with some simple, descriptive metrics (e.g. average sales, price variance, etc.), the true value of analytics lies in building models aimed at representing how these variables impact each other. The quality of these models (their accuracy) depends on how closely they represent the real-life interactions between their variables.
- The variables in a model need to be independent from each other. Going back to the original meaning of analysis helps us understand why. To truly understand (and eventually solve) a problem, one first needs to divide it into its smaller components. The division shouldn’t stop until the core elements of the problem have been identified.
These two ideas (independence and interrelations) are fundamental in analytics. However, they are not enough. The (proposed) interrelations between variables also have to make logical sense. Enters the first data scientist in history.
We're all descendants of Aristotle
The first (known) use of the word “analytics” comes from no less than Aristotle himself, almost 2500 years ago. Interestingly, the word appeared in Aristotle’s treatise on Logic, now known as The Organon. Yes, for Aristotle, analytics belonged in logic. So much so that two of the six books in the Organon are titled “Prior Analytics” (on deductive reasoning) and “Posterior Analytics” (on demonstration and scientific knowledge).
Granted, Aristotle wasn’t building random forest models in Python but the analytical process he developed in The Organon provided the conceptual foundations for the work that modern scientists around the globe do. Critically, he emphasized the absolute necessity of using logic in any analytics endeavor.
You may be wondering: how is Aristotle’s work on Logic relevant to solving 21st-century business problems using tons and tons of data and cutting-edge technology?
Here is how: the lack of logic can lead to the wrong decisions. While this statement is certainly more applicable to politics than it is to business, companies are not immune to drawing the wrong conclusions because of a lack of logical thinking, even with (or, rather, despite) solid data and analytics to back it off. Logic is what makes the difference between simple correlations (especially the spurious ones) and truly causal relationships. Logic also takes away potential biases the business leaders could base their decision on. Finally, logic can compensate for bad or incomplete data.
Analysis vs. Synthesis
Lastly, we can also define Analytics by what it is not. The antonym of analysis is synthesis, which also comes from Ancient Greek and literally means “putting things together”. So, in essence, analyzing is the opposite of synthesizing: the analysis will break a problem into pieces and try to understand their logical interactions, whereas the synthesis will put all these pieces together and tackle the problem as a whole.
This helps understand the challenges in communication between business leaders and analytics professionals that we described in our previous post. Business leaders are, generally speaking, excellent at synthesizing problems and focusing on the big picture; it is even a core skill taught and nurtured in business schools. On the other hand, analytics professionals need to go into more details and study the various elements of a problem to solve it.
It gives us a solution to start addressing the communication gaps between the two groups: analytics professionals would greatly benefit from learning how to synthesize their work, especially their results. Presenting “big picture” findings as opposed to focusing on the details would go a long way in getting the business’ attention.
Relevance for the modern analyst
Going back to the roots of what analytics really means, with even a small touch of philosophy, provided us with surprisingly applicable findings for analytics professionals:
- Make sure your variables are truly independent from each other, i.e., did you really get to the bottom of your problem?
- Focus your modeling efforts on relations between variables, i.e., are you representing a causal relationship or a mere correlation?
- Think logically, i.e., does your model make sense outside of your lab?
- Learn to synthesize, i.e., are you communicating to your business leaders in a way they are comfortable with?