09.10.2023 Views

Advanced Data Analytics Using Python_ With Machine Learning, Deep Learning and NLP Examples ( 2023)

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 7

Analytics at Scale

Figure 7-2. Details of the summarization pattern

Filtering Pattern

In MapReduce filtering is done in a divide-and-conquer way (Figure 7-3).

Each mapper job filters a subset of data, and the reducer aggregates the

filtered subset and produces the final output. Generating the top N records,

searching data, and sampling data are the common use cases of the

filtering pattern.

Figure 7-3. Details of the filtering pattern

160

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!