Learning Library

← Back to Library

Machine Learning Basics: Supervised Learning

Key Points

  • AI is the broad concept of machines mimicking human problem‑solving, with machine learning (ML) as a data‑driven subset that learns from examples, and deep learning as a further subset that automates feature extraction for massive datasets.
  • The talk focuses on ML, specifically its two main supervised learning approaches: classification (grouping data into predefined categories) and regression (modeling relationships with weighted input variables).
  • A real‑world classification example is customer churn prediction, where labeled historical customer behavior trains a model to flag likely churners so businesses can intervene and retain them.
  • Regression techniques build mathematical equations that assign weights to input features to predict continuous outcomes, enabling more nuanced forecasting beyond simple category assignment.

Full Transcript

# Machine Learning Basics: Supervised Learning **Source:** [https://www.youtube.com/watch?v=9gGnTQTYNaE](https://www.youtube.com/watch?v=9gGnTQTYNaE) **Duration:** 00:08:22 ## Summary - AI is the broad concept of machines mimicking human problem‑solving, with machine learning (ML) as a data‑driven subset that learns from examples, and deep learning as a further subset that automates feature extraction for massive datasets. - The talk focuses on ML, specifically its two main supervised learning approaches: classification (grouping data into predefined categories) and regression (modeling relationships with weighted input variables). - A real‑world classification example is customer churn prediction, where labeled historical customer behavior trains a model to flag likely churners so businesses can intervene and retain them. - Regression techniques build mathematical equations that assign weights to input features to predict continuous outcomes, enabling more nuanced forecasting beyond simple category assignment. ## Sections - [00:00:00](https://www.youtube.com/watch?v=9gGnTQTYNaE&t=0s) **AI vs. ML vs. Deep Learning** - An IBM Data Platform Solution Engineer clarifies the differences among artificial intelligence, machine learning, and deep learning before introducing supervised learning as a primary machine‑learning approach. - [00:03:20](https://www.youtube.com/watch?v=9gGnTQTYNaE&t=200s) **Regression and Clustering in ML** - The passage explains regression for predicting airline ticket prices and then introduces unsupervised clustering for customer segmentation. - [00:06:35](https://www.youtube.com/watch?v=9gGnTQTYNaE&t=395s) **Reinforcement Learning for Autonomous Driving** - The speaker explains reinforcement learning as a semi‑supervised method where an agent learns via rewards and penalties in an environment, illustrating it with self‑driving cars and urging viewers to explore machine‑learning concepts further. ## Full Transcript
0:00Hey, what's up everyone? 0:01My name is Luv Aggarwal, and I’m a Data Platform Solution engineer for IBM. 0:06Machine Learning. 0:07There's no doubt that this is an incredibly hot topic with significant interest from both 0:11business professionals as well as technologists. So let's talk about what machine learning, 0:16or ML, is. 0:17So, before we get too far into the details, I want to take a minute to talk about some 0:21terms that are often used interchangeably but have certain differences. 0:26Terms like “artificial intelligence”, “machine learning”, and even “deep learning”. 0:37So, at the highest level, AI is defined as leveraging computers or machines to mimic 0:45the problem-solving and the decision-making capabilities of the human mind. 0:50And machine learning is a subset within AI that's more focused on the use of various self-learning 0:58algorithms that derive knowledge from data in order to predict outcomes. 1:03And then, finally, deep learning is a further subset within even machine learning, and deep 1:08learning is often thought of as scalable machine learning because it automates a lot of the 1:13feature extraction process away and eliminates the some of the human intervention involved 1:19to enable the use of some really, really big data sets. 1:22But for today we'll focus just on machine learning, so we'll get rid of the other two 1:26and dive one level deeper and talk about the different types of machine learning. 1:31Ok. So, the first type that we have is called “supervised learning”. 1:42And this is when we use labeled data sets to train algorithms to classify data or predict outcomes. 1:49And when I say labeled, I mean that the rows in the data set are labeled, tagged, or classified 1:55in some interesting way that tells us something about that data. 1:58So, it could be a yes or a no, or it could be a particular category of some, you know, 2:04different attribute. 2:06OK, so how do we apply supervised machine learning techniques? 2:10Well, this really depends on your particular use-case. 2:14We could be using a classification model 2:23which recognizes and groups ideas or objects into predefined categories. 2:29An example of this in the real world is with customer retention. 2:39So, if you're in the business of managing customers, one of your goals is typically 2:43minimizing and identifying customer churn, right, which are customers that no longer 2:48buy a particular product or service, and we want to avoid churn because it's almost always 2:54more costly to acquire a new customer than it is to retain an existing one, right? 3:00So, if we have historical data for the customer, like their activity - whether they churned 3:07or not, right - we can build a classification model using supervised machine learning, and 3:12our labeled data set that will help us identify customers that are about to churn, and then 3:18allow us to take action to retain them. 3:22OK, so the other type of supervised learning is regression. 3:31Now, this is when we build an equation using various input values with their specific weights 3:36determined by the overall value of their impact on the outcome. 3:41And we use these to generate an estimate for an output value. 3:45So, let me give you another example here. 3:48So, airlines rely heavily on machine learning, and they use regression techniques to accurately 3:56predict how much they should be charging for a particular flight, right? 4:00So, they use various input factors like, you know, days before departure, the day of the week, 4:05the departure, the destination to use these to predict an accurate dollar value 4:13for how much they should be charging for a specific flight that will maximize their revenue. 4:19OK, so now let's move on to the second type of machine learning which is 4:27“unsupervised learning”. 4:36OK, so this is when we use machine learning algorithms to analyze and cluster unlabeled 4:42data sets, and this method helps us discover hidden patterns or groupings without the need 4:47for human intervention, right? 4:49So, we're using unlabeled data here. 4:53So, again, let's talk about the different techniques for unsupervised learning. 4:58One method is “clustering”. 5:05And a real-world example of this is when organizations try to do 5:10customer segmentation. 5:15Right. 5:15So, when businesses try to do effective marketing it's really critical that they really understand 5:21who their customers are, right, so that they can connect with them in the most relevant way. 5:25And, oftentimes, it's not obvious or clear how certain customers are similar to or different 5:31from one another, right, and clustering algorithms can help take into account a variety of information 5:37on the customer like their purchase history, 5:44you know, their social media activity, or website activity, 5:51could be their geography, and much more, to group similar customers 5:58into buckets so that we can send them more relevant offers, provide them better customer 6:03service, and be more targeted with our marketing efforts. 6:08Ok. 6:10And the last point I want to touch on for unsupervised learning is 6:15called “dimensionality reduction”. 6:21So, we won't discuss this in detail in this video, but this refers to techniques that 6:26reduce the number of input variables in a data set so we don't let some redundant parameters 6:32over represent the impact on the on the outcome. 6:37Ok. 6:39So the last type of machine learning I want to talk about today is called 6:43“reinforcement learning”. 6:46Now, this is a form of semi-supervised learning where we typically have an agent or system 6:56take actions 7:01in an environment. 7:05Now the environment will then either reward the agent for correct moves, 7:12or punish it for incorrect moves. Right. 7:15And, through many iterations of this, we can teach a system a particular task. 7:20Now a great example of this method in the real world is with self-driving cars. 7:25So, autonomous driving has several factors, right? 7:28There's the speed limit, there are drivable zones, there are collisions, and so on. 7:33So, we can use forms of reinforcement learning to teach a system how to drive by avoiding 7:39collisions, following the speed limit, and so on. 7:43OK, so we covered many topics today, but you know, 7:47we've barely scratched the surface of each one. 7:49If you found any one particular aspect of machine learning interesting, I encourage 7:53you to dive deeper and learn more about it. And if you want to know what are some of the 7:58common machine learning algorithms and how to leverage them in data science, please check 8:03out some of the links in the description. 8:05Thank you. 8:06If you have questions please drop us a line below, and if you want to see more 8:10videos like this in the future, please like and subscribe. 8:14And don't forget, you can grow your skills and earn a badge with IBM Cloud Labs, 8:18which are free browser-based interactive Kubernetes labs.