What is Data Science?
Data Science is the field of study that combines the technical capacities of computer science, mathematics (especially statistics and linear algebra), and domain-specific expertise to extract useful insights from noisy data sets.
The value data science brings to organizations comes in the form of enabling better decisions by being better informed. Through discovering and clearly communicating insights and the story that they tell, data scientists help leaders improve their business operations. The effectiveness of data science is due to the combination of advanced software, robust methods and the massive compute power available today. Data science applications can be found in everything from small data sets to large, complex, high velocity data streams.
Python is a popular programming language for data science tasks due to the ease of writing Python code and the numerous libraries that have been developed to make data science tasks easier. These libraries include NumPy, Pandas, Polars, Matplotlib, SciPy, Scikit-Learn, Seaborn, and Folium. SQL (Structured Query Language) is a useful language to learn as it is commonly used to interact with databases.
Data Science Methodology
Adapted from the HackerNoon Data Science Heirarchy of Needs and the excellent YouTube video explanation by Joma Tech 'What REALLY is Data Science? Told by a Data Scientist'.