Advanced Data Analytics Using Python_ With Machine Learning, Deep Learning and NLP Examples ( 2023)
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
CHAPTER 3
Supervised Learning
Using Python
In this chapter, I will introduce the three most essential components of
machine learning.
• Dimensionality reduction tells how to choose the most
important features from a set of features.
• Classification tells how to categorize data to a set of
target categories with the help of a given training/
example data set.
• Regression tells how to realize a variable as a linear or
nonlinear polynomial of a set of independent variables.
Dimensionality Reduction with Python
Dimensionality reduction is an important aspect of data analysis. It is
required for both numerical and categorical data. Survey or factor analysis
is one of the most popular applications of dimensionality reduction. As an
example, suppose that an organization wants to find out which factors are
most important in influencing or bringing about changes in its operations.
It takes opinions from different employees in the organization and, based
on this survey data, does a factor analysis to derive a smaller set of factors
in conclusion.
© Sayan Mukhopadhyay 2018
S. Mukhopadhyay, Advanced Data Analytics Using Python,
https://doi.org/10.1007/978-1-4842-3450-1_3
49