Learning Library

← Back to Library

Supervised vs Unsupervised Learning Explained

Key Points

  • Supervised learning trains models on labeled data, enabling them to predict known output categories (classification) or continuous values (regression) and to measure accuracy during training.
  • Unsupervised learning works without labels, discovering hidden structures through tasks such as clustering (e.g., customer segmentation), association rule mining (e.g., market‑basket analysis), and dimensionality reduction (e.g., noise‑removing autoencoders).
  • The main distinction is that supervised models require explicit input‑output pairs to learn and generalize, whereas unsupervised models infer patterns purely from the input data.
  • Choosing between them depends on whether you have annotated data and a specific prediction goal (supervised) or need to explore and extract insights from raw data without predefined targets (unsupervised).

Full Transcript

# Supervised vs Unsupervised Learning Explained **Source:** [https://www.youtube.com/watch?v=W01tIRP_Rqs](https://www.youtube.com/watch?v=W01tIRP_Rqs) **Duration:** 00:07:02 ## Summary - Supervised learning trains models on labeled data, enabling them to predict known output categories (classification) or continuous values (regression) and to measure accuracy during training. - Unsupervised learning works without labels, discovering hidden structures through tasks such as clustering (e.g., customer segmentation), association rule mining (e.g., market‑basket analysis), and dimensionality reduction (e.g., noise‑removing autoencoders). - The main distinction is that supervised models require explicit input‑output pairs to learn and generalize, whereas unsupervised models infer patterns purely from the input data. - Choosing between them depends on whether you have annotated data and a specific prediction goal (supervised) or need to explore and extract insights from raw data without predefined targets (unsupervised). ## Sections - [00:00:00](https://www.youtube.com/watch?v=W01tIRP_Rqs&t=0s) **Understanding Supervised and Unsupervised Learning** - The passage explains how supervised learning uses labeled data for tasks like classification and regression, while unsupervised learning operates without labels, outlining their differences and common algorithm examples. - [00:03:06](https://www.youtube.com/watch?v=W01tIRP_Rqs&t=186s) **Supervised vs Unsupervised Learning** - The passage contrasts supervised learning, which uses labeled data to train predictive models, with unsupervised learning, which autonomously finds structure in unlabeled data, illustrated by commute‑time predictions and image‑clustering examples, and mentions autoencoders for noise reduction. - [00:06:16](https://www.youtube.com/watch?v=W01tIRP_Rqs&t=376s) **Machine Learning in Medical Imaging** - The segment explains how radiologists can label a small sample of CT scans to train machine learning models that predict which patients need further attention, outlines choosing appropriate models and the supervised‑vs‑unsupervised decision, and ends with a call to action for viewer engagement. ## Full Transcript
0:01Supervised and unsupervised learning are two core components in building machine learning models. 0:07So what's the difference? 0:09Well, just to cut to the chase: 0:11supervised learning, that uses labeled input  and output data, 0:15while an unsupervised learning model doesn't. 0:19But what does that really mean? 0:20Well, let's better define both learning models, 0:24go deeper into the differences between them 0:28and then answer the question of which is best for you. 0:32Now, in supervised learning, the machine learning algorithm is  trained on a labeled dataset. 0:38So this means that each example in the training dataset, the algorithm knows what the correct output is. 0:44And the algorithm uses this knowledge to try to generalize to new examples that it's never seen before. 0:50Now, using labeled inputs and outputs, the model can measure its accuracy and learn over time. 0:56Supervised learning can be actually divided into a couple of subcategories. 0:59Firstly, there is a category of classification. 1:06And classification talks about whether the output is a discrete class label 1:11such as "spam" and "not spam". 1:14Linear classifiers, support vector machines, or SPMs, decision trees, random forests - 1:19they're all common examples of classification algorithms. 1:24The other example is regression. 1:30The output here is a continuous value, such as price or probability. 1:36Linear regression and logistic regression are two common types of regression algorithms. 1:43Now, unsupervised learning is where the machine learning algorithm is not really given any labels  at all. 1:49And these algorithms discover hidden patterns in data without the need for human intervention. 1:56They're unsupervised. 1:57Unsupervised learning models are used for three main tasks, such as clustering, association and dimensionality reduction. 2:05So let's take a look at each one of those, starting with clustering. 2:12Now clustering is where the algorithm groups similar experiences together. 2:17So a common application of clustering is customer segmentation, 2:20where businesses might group customers together based on similarities like, 2:24I don't know, age or location or spending habits, something like that. 2:28Then you have association. 2:32And association is where the algorithm looks for relationships between variables in the data. 2:38Now association rules are often used in market basket analysis, 2:41where businesses want to know  which items are often bought together. 2:45You know, something along the lines of, "customers who bought  this item also bought ", that sort of thing. 2:52The final one to talk about is dimensional ... 2:58dimensional reduction. 2:59And this is where the algorithm reduces the number of variables in the data, 3:03while still preserving as much of the information as possible. 3:06Now, often this technique is used in the pre-processing data stage, 3:10such as when autoencoders remove noise from visual images to improve picture quality. 3:15Okay, so let's talk about the differences between these two types of learning. 3:19In supervised learning, the algorithm learns from training datasets by iteratively making predictions on the data 3:26and then adjusting for the correct answer. 3:29While supervised learning models tend to be more accurate than unsupervised learning models, 3:33they do require all of this up-front human intervention to label the data appropriately. 3:38For example, a supervised learning model can predict how long your commute will be 3:43on the time of day and thinking about the weather conditions and so forth. 3:48But first you'll have to train it to know things like rainy weather extends the driving time. 3:55By contrast, unsupervised learning models work on their own to discover the inherent structure of unlabeled data. 4:03These models don't need humans to intervene. 4:05They can automatically find patterns in data and group them together. 4:08So, for example, an unsupervised learning model can cluster images by the objects they contain 4:13- things like people and animals and buildings - 4:18without being told what those objects were ahead of time. 4:22Now, an important distinction to make is that unsupervised learning models don't make predictions. 4:28They only group data together. 4:29So if you were to use an unsupervised learning model on that same commute dataset, 4:34it would group together commutes with similar conditions  like the time of day and the weather, 4:38but it wouldn't be able to predict how long each commute would take. 4:42Okay, so which of these two  options is right for you? 4:46In general, supervised learning is more commonly used than unsupervised learning, 4:52and that's really because it's more accurate and efficient. 4:54But that being said, unsupervised learning has its own advantages. There's two that I can think of. 4:59Firstly, unsupervised learning can be used on data that is not labeled, 5:04which is often the case in real world datasets. 5:06And then secondly, unsupervised learning can be used to find hidden patterns in data that supervised learning models just wouldn't find. 5:13Classifying big data can be a real challenge in supervised learning, but the results are highly accurate and trustworthy. 5:20And in contrast, unsupervised learning can handle large volumes of data in real time. 5:26But there's a lack of transparency into how that data is clustered and a high risk given accurate results. 5:31But wait, it is not an "either/or" choice. 5:36May I present to you the middle ground known as semi-supervised learning. 5:44This is, well, a happy medium where you use a training data set with both labeled and unlabeled data. 5:53And it's particularly useful when it's difficult to extract relevant features from data when you have a high volume of data. 5:59So, for example, you could use a semi-supervised learning algorithm on a data set with millions of images 6:05where only a few thousand of those images are actually labeled. 6:09Semi-supervised learning is ideal for medical images, where a small amount of training data could lead to a significant improvement in accuracy. 6:17For example, a radiologist can look at and label some small subset of CT scans for tumors or diseases, 6:24and then the machine can more accurately predict which patients might require more medical attention 6:29without going through and labeling the entire set. 6:32Machine learning models are a powerful way to gain the data insights that improve our world. 6:38The right model for your data depends on the type of data that you have and what you want to do with it. 6:45And the choice between supervised and unsupervised learning is only the first step. 6:53If you have any questions, please drop us a line below. 6:55And if you want to see more videos like this in the future, please like and subscribe. 7:00Thanks for watching!