What is Scikit-Learn?
In this course, we will learn how to construct various machine learning algorithms from scratch. In the real world, however, we don’t want to recreate a complex algorithm every time we want to use it. Writing an algorithm from scratch is a great way to understand the fundamental principles of why it works, but we may not get the efficiency or reliability we need.
Scikit-learn is a library in Python that provides many unsupervised and supervised learning algorithms. It’s built upon some of the technology you might already be familiar with, like NumPy, pandas, and Matplotlib!
The functionality that scikit-learn provides include:
- Regression, including Linear and Logistic Regression
- Classification, including K-Nearest Neighbors
- Clustering, including K-Means and K-Means++
- Model selection
- Preprocessing, including Min-Max Normalization
As you move through Codecademy’s Machine Learning content, you will become familiar with many of these terms. You will also see scikit-learn (in Python, sklearn
) modules being used. For example:
sklearn.linear_model.LinearRegression()
is a Linear Regression model inside the linear_model
module of sklearn
.
The power of scikit-learn will greatly aid your creation of robust Machine Learning programs.
Happy Coding!
Author
'The Codecademy Team, composed of experienced educators and tech experts, is dedicated to making tech skills accessible to all. We empower learners worldwide with expert-reviewed content that develops and enhances the technical skills needed to advance and succeed in their careers.'
Meet the full team