09.10.2023 Views

Advanced Data Analytics Using Python_ With Machine Learning, Deep Learning and NLP Examples ( 2023)

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

CHAPTER 3

Supervised Learning

Using Python

In this chapter, I will introduce the three most essential components of

machine learning.

• Dimensionality reduction tells how to choose the most

important features from a set of features.

• Classification tells how to categorize data to a set of

target categories with the help of a given training/

example data set.

• Regression tells how to realize a variable as a linear or

nonlinear polynomial of a set of independent variables.

Dimensionality Reduction with Python

Dimensionality reduction is an important aspect of data analysis. It is

required for both numerical and categorical data. Survey or factor analysis

is one of the most popular applications of dimensionality reduction. As an

example, suppose that an organization wants to find out which factors are

most important in influencing or bringing about changes in its operations.

It takes opinions from different employees in the organization and, based

on this survey data, does a factor analysis to derive a smaller set of factors

in conclusion.

© Sayan Mukhopadhyay 2018

S. Mukhopadhyay, Advanced Data Analytics Using Python,

https://doi.org/10.1007/978-1-4842-3450-1_3

49

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!