What Really Is Machine Learning?

AI & NLP

Did you ever get burned by a hot object as a child and vow never to make the same mistake again? That was a critical moment of learning. Through a variety of experiences, we develop and become better at navigating the world around us. In addition to humans, this ability is also possessed by animals, plants (yes, really), and, most recently, Machines 🤖. There is a lot of marketing buzz surrounding Machine Learning, Deep Learning, and Artificial Intelligence – and a lot of nonsense. Our aim is to help you make sense of these terms with this article. So let’s get to it! 🚀

Within the Artificial Intelligence bubble, Machine Learning and NLP intersect, Deep Learning is within Machine Learning. — This diagram is a simple portrayal of how the different AI terms interconnect. As the article advances, you can use this as a reference to look back and see their connections.

Artificial Intelligence: The Basis of Machine Learning

Let us begin with a generic term, one that has been around for a long time: Artificial Intelligence (AI).

AI describes the ability of a machine to interpret and respond to inputs in an intelligent way. This discipline is decades old, with the first research stemming from the 1950s. Over time AI had its ups and downs – ‘winters,’ where it was deemed a dead discipline, and summers where every ice-cream shop thought it needed to jump on the bandwagon 🚎. The most recent AI hype is mostly due to a specific method within the AI universe: Machine Learning.

Machine Learning Algorithms rely on data instead of specific rules to make their decisions. You have most probably applied a Machine Learning algorithm yourself already, even before it became ‘hip’. We’re talking about linear regression (or more colloquial “line-fitting”): Fitting a curve to best describe some (dirty) data. This is a two-parameter Machine Learning model.

Linear Regression

Obviously, a lot of the problems we want to solve are not describable with such a simple linear relationship. That’s where Neural Networks and Deep Learning (DL) come into play.

Neural Networks & Deep Learning

Neural Networks are a subset of Machine Learning algorithms. They have been around since the dawn of AI – some 70 years ago! But while they have lived the destiny of sleeping beauty 👸 💤 for most of their existence, in recent years they have risen to power and been at the core of the most recent AI frenzy.

Neural networks are loosely modeled after the human brain 🧠 with neurons propagating information forward, often over multiple “layers”. Stacking a lot of these layers onto each other is then called ‘Deep Learning’. The power of these massive networks, which often have 100s of millions of trainable weights (as opposed to the two weights of linear regression) is that they can model – and thus – learn almost any logic.

Machine Learning: Sentiment Analysis & Natural Language Processing

Let’s illustrate the differences with a concrete example: Sentiment Analysis .

Problem Statement: We want our software to classify the sentences we provide it with into “Positive”, “Neutral” and “Negative”.

By the way: This widely used method is part of the “Natural language processing” domain, a class of algorithms that deals with human language (either written or oral). Natural language processing is a subset of the AI family. Learn more about NLP here.

A simple rule-based system (which counts as AI as well) might have a list of words that count as “positive” or “negative”. For example, it might say that sentences containing the words “Good”, Great” and “Happy,” should be considered to be “Positive”. However, just adding the word “Not” before one of these positive indicators changes the sentiment entirely. So you would have to implement some more logic that if any of the “Positive” words are preceded by “Not” the sentiment is inversed. But what about the sentence “Great service. Not.” Further rules would be needed to cover this case as well. As you can see it is a daunting and never-ending task to dissect language based on fixed rules.

That’s why most of today’s cutting-edge sentiment analysis tools rely on Machine Learning: Instead of defining which words mean what, you show the algorithm a large number of examples, from which it will deduct the rules on what is positive and negative itself. How does that work concretely? Let’s look at one training step:

You pass the software an input example: “I liked the movie.”
You ask the software to predict the sentiment of this sample. As it’s not trained yet, the output is basically random – it might answer “Negative”.
You correct it, providing the correct answer “Positive”. Then you tell the system to adjust its internal rules, such that its prediction for this specific example would have been correct.
Repeat. A lot.

By repeating this over and over again, showing the system 100’000+ examples of its internal rules will become a better and better model of our language. The goal is to get the model to generalize well, meaning its internal rules actually capture the essence of what we want it to understand and not capture some spurious correlation.

How Does Caplena Employ Machine Learning?

Caplena uses advanced Machine Learning models like Transformers to detect topics in your data, do sentiment analysis and classify text comments into various categories. But how is it that you can customize the topics you want without, or, only very few training examples, as opposed to the 100,000+ examples mentioned above? The answer to this includes Pre-Training, Transfer Learning, and Augmented Intelligence. But those are topics for a next time 😊

PSST…. Would you like a FREE trial?

Caplena conducts sentiment analysis using AI to understand how your customers or employees genuinely feel so you can spend less time on the analysis and more time on the results! 🏆️

Let's Begin!

Useful Links:

Are you a developer beginning Machine Learning? Click here for a useful beginner-level blog.

Does AutoML work for Diverse Tasks?

Who is responsible when AI is irresponsible?