Learning Library

← Back to Library

Data Science: Definition, Types, and Lifecycle

Key Points

  • Data science is defined as extracting knowledge and insights from noisy data and converting those insights into actionable business decisions.
  • It sits at the intersection of computer science, mathematics, and business expertise, requiring collaboration across all three domains for true data‑science initiatives.
  • Different analytics methods serve varying business questions: descriptive (what is happening), diagnostic (why it happened), predictive (what will happen), and prescriptive (what should be done).
  • The data‑science lifecycle begins with solid business understanding to ensure the right problem is tackled, highlighting the critical role of domain knowledge.
  • After defining the problem, the process moves through data mining, data cleaning, and subsequent analytical steps to generate insights and recommendations.

Full Transcript

# Data Science: Definition, Types, and Lifecycle **Source:** [https://www.youtube.com/watch?v=RBSUwFGa6Fk](https://www.youtube.com/watch?v=RBSUwFGa6Fk) **Duration:** 00:07:51 ## Summary - Data science is defined as extracting knowledge and insights from noisy data and converting those insights into actionable business decisions. - It sits at the intersection of computer science, mathematics, and business expertise, requiring collaboration across all three domains for true data‑science initiatives. - Different analytics methods serve varying business questions: descriptive (what is happening), diagnostic (why it happened), predictive (what will happen), and prescriptive (what should be done). - The data‑science lifecycle begins with solid business understanding to ensure the right problem is tackled, highlighting the critical role of domain knowledge. - After defining the problem, the process moves through data mining, data cleaning, and subsequent analytical steps to generate insights and recommendations. ## Sections - [00:00:00](https://www.youtube.com/watch?v=RBSUwFGa6Fk&t=0s) **Defining Data Science and Its Scope** - The speaker defines data science, outlines its three‑discipline foundation of computer science, mathematics, and business, and introduces the hierarchy of analytics methods, beginning with descriptive analytics. ## Full Transcript
0:00let's talk about data science and some 0:01of the other related terms you may have 0:03heard such as predictive analytics 0:04machine learning 0:06advanced analytics and others 0:08so let's start with the textbook 0:10definition of data science 0:11so data science is the field of study 0:14that that involves extracting 0:18knowledge 0:20and 0:21insights 0:24from 0:26noisy 0:27data 0:31and then turning those insights 0:33into 0:35actions 0:36that our business or organization can 0:39take 0:40okay so let's dig into it a little bit 0:42more and discuss what are the different 0:45areas that are covered by data science 0:47so really data science is the 0:49intersection between three different 0:51disciplines 0:53we start with 0:55computer science 0:58then 0:59we also cover 1:02the area of mathematics 1:06and then what i think is the most 1:08important 1:10is business 1:14expertise 1:17so the intersection of these three 1:18disciplines is data science 1:21and true data science initiatives 1:23involve collaboration across all these 1:25three different areas 1:29okay so now let's touch on the different 1:31types of data science that you can do 1:33now what we need to understand here is 1:35that we have different data science 1:37methods 1:38for different questions that we might 1:40ask in an organization 1:42and these questions can vary by 1:44complexity and the value that we get out 1:46of them so let's chart them here 1:52by complexity 1:55and 1:57value 2:00okay so the first one that we have here 2:02is 2:04descriptive 2:06analytics 2:08so this is really about what is 2:11happening in my business right and it 2:13involves having accurate data collection 2:15to make sure that we know what's 2:16happening so a a good question we could 2:18ask here is 2:19well did sales go up or down 2:22the next level is 2:26diagnostic 2:28analytics 2:31and this is more about why did something 2:34happen so why did sales go up or down 2:36and it involves 2:38drilling down to the root cause of our 2:40problem 2:42now the next one that we have is 2:47predictive 2:48analytics 2:50so this is about what is likely to 2:52happen next 2:53right so what will our sales performance 2:56be next quarter 2:58and it involves using historical 2:59patterns in our uh in our data to 3:01predict outcomes in the future 3:05and then finally 3:07we have 3:09prescriptive 3:11analytics 3:14so this is about what do i need to do 3:16next what is the recommended best action 3:19for a particular outcome so question we 3:21could ask here is what do i need to do 3:23to improve sales by 10 3:25right 3:27okay 3:28so now we can talk about how data 3:29science is done and who actually does it 3:33so let's look at the data science life 3:36cycle and the first thing that we always 3:38must start with is 3:42business 3:45understanding 3:49so this is really critical to make sure 3:51that we're asking the right question 3:53before we go down a lengthy data science 3:57initiative 3:58and this is where you can see the having 4:00the business expertise and the domain 4:02expertise 4:03can be incredibly critical to make sure 4:05that we're asking the right questions 4:07okay so once we've defined that 4:10we can move on to 4:13data mining 4:16so this is this is the process of 4:18actually going out into our data 4:20landscape and procuring the data that we 4:22need for our analysis 4:24so once we've done that 4:26we can move on to 4:29data 4:31cleaning 4:34so 4:34the the reality of the marketplace is 4:37that 4:38once we when we find data it's probably 4:41not in the best format that we need it 4:43in and it probably has uh some some 4:46issues with it right it might have rows 4:48that have missing values it might have 4:50duplicates in it so there's some 4:52preparation and cleaning that we have to 4:54do before it's ready for our analysis 4:56so once we've done that cleansing 5:00we can move on to 5:03exploration 5:08okay so this is the part of the process 5:11that allows us to use different 5:13analytical tools that can start helping 5:15us answer some of 5:16the the types of questions that i 5:18mentioned here earlier 5:20and if we actually want to get into some 5:22of these higher value questions like 5:24predictive and prescriptive 5:26then we must start using advanced 5:28analytical tools such as 5:30machine learning tools 5:32that leverage massive amounts of 5:34computing power and massive amounts of 5:36high quality data 5:38to make predictions and prescribe 5:40actions for the future 5:43now once we've done our exploration and 5:45perhaps our advanced 5:47analytics 5:48what do we do next well we need to 5:51visualize 5:54our insights and outcomes of our 5:56analysis 5:58okay 5:59now i want to quickly touch on who does 6:01what 6:02in this life cycle 6:04so in an organization you may have roles 6:06like 6:07a business analyst 6:10you might have 6:11data engineers 6:13and then you might have 6:15data scientists 6:18so 6:19business analysts 6:20are obviously involved in formulating 6:23the questions they have the domain 6:24expertise they can help with the 6:26business understanding but they're also 6:28involved with 6:30visualizing our insights in a way that's 6:33useful for the business right 6:35and then we have folks like data 6:36engineering folks so these are the 6:38people that can help us find the data 6:42clean the data 6:44and then also help with some of the 6:45exploration 6:48we move on to our data scientists so 6:50these are the people that will really 6:52help us with the exploration they'll 6:54help us with the advanced machine 6:56learning techniques 6:57and they'll also assist in the 6:59visualization 7:01so you can see there's there's some 7:03overlap between the roles and that's why 7:05it's critical 7:07to have collaboration 7:09across 7:10these roles 7:11and what you also start seeing nowadays 7:13in the marketplace is that sometimes 7:16business analysts have to do some 7:17machine learning they have to help out 7:19with exploration 7:20data scientists sometimes need to go and 7:22find the data on their own so there's a 7:24lot of overlap and 7:26these different roles must collaborate 7:28with each other 7:30okay so i hope you can see now how the 7:32data science life cycle can help us take 7:35noisy data turn it into knowledge and 7:37insights and then turn it into 7:39meaningful action for our business 7:41thank you 7:43if you have questions please drop us a 7:44line below and if you want to see more 7:46videos like this in the future please 7:48like and subscribe