Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems

100%

jz1b-2026-02-09_15_48_52-101-pdfsam-hands-on-machine-learning-with-scikit-learn-keras-and-tensorflow-2nd-edition-aurelien-geron.pdf

full_pipeline equals ColumnTransformer([(num, num_pipeline, num_attribs), (cat, OneHotEncoder(), cat_attribs)],

housing_prepared equals full_pipeline.fit_transform(housing)

First we import the ColumnTransformer class, next we get the list of numerical column names and the list of categorical column names, and then we construct a ColumnTransformer. The constructor requires a list of tuples, where each tuple contains a name, a transformer, and a list of names (or indices) of columns that the transformer should be applied to. In this example, we specify that the numerical columns should be transformed using the num_pipeline that we defined earlier, and the categorical columns should be transformed using a OneHotEncoder. Finally, we apply this ColumnTransformer to the housing data: it applies each transformer to the appropriate columns and concatenates the outputs along the second axis (the transformers must return the same number of rows).

Note that the OneHotEncoder returns a sparse matrix, while the num_pipeline returns a dense matrix. When there is such a mix of sparse and dense matrices, the ColumnTransformer estimates the density of the final matrix (i.e., the ratio of nonzero cells), and it returns a sparse matrix if the density is lower than a given threshold (by default, sparse_threshold equals zero point three). In this example, it returns a dense matrix. And that's it! We have a preprocessing pipeline that takes the full housing data and applies the appropriate transformations to each column.

Instead of using a transformer, you can specify the string "drop" if you want the columns to be dropped, or you can specify "pass through" if you want the columns to be left untouched. By default, the remaining columns (i.e., the ones that were not listed) will be dropped, but you can set the remainder hyperparameter to any transformer (or to "passthrough") if you want these columns to be handled differently.

If you are using Scikit-Learn zero point nineteen or earlier, you can use a third-party library such as sklearn-pandas, or you can roll out your own custom transformer to get the same functionality as the ColumnTransformer. Alternatively, you can use the FeatureUnion

Prepare the Data for Machine Learning Algorithms

class, which can apply different transformers and concatenate their outputs. But you cannot specify different columns for each transformer; they all apply to the whole data. It is possible to work around this limitation using a custom transformer for column selection (see the Jupyter notebook for an example).

Select and Train a Model

Training and Evaluating on the Training Set

Better Evaluation Using Cross-Validation

Select and Train a Model

Chapter two: End-to-End Machine Learning Project

Fine-Tune Your Model

Grid Search

Fine-Tune Your Model |

Randomized Search

Ensemble Methods

Analyze the Best Models and Their Errors

Evaluate Your System on the Test Set

Launch, Monitor, and Maintain Your System

Chapter two: End-to-End Machine Learning Project

Launch, Monitor, and Maintain Your System

Try It Out!

Try It Out! | eighty-three

Exercises

CHAPTER THREE

MNIST

Training a Binary Classifier

Performance Measures

Measuring Accuracy Using Cross-Validation

Implementing Cross-Validation

Confusion Matrix

Equation three-one. Precision

Equation three-two. Recall

Precision and Recall

Equation three-three.

Precision/Recall Trade-off

| Chapter three: Classification

The ROC Curve

Multiclass Classification

Error Analysis

Multilabel Classification

Multioutput Classification

Multioutput Classification One

Exercises

CHAPTER FOUR

Linear Regression

Linear Regression one

Computational Complexity

Gradient Descent

Batch Gradient Descent

Convergence Rate

Stochastic Gradient Descent

Gradient Descent one

Mini-batch Gradient Descent

Polynomial Regression

Learning Curves

Learning Curves I

The Bias/Variance Trade-off

Regularized Linear Models

Ridge Regression

Equation four dash eight. Ridge Regression cost function

Equation four-nine. Ridge Regression closed-form solution

Lasso Regression

Equation four-ten. Lasso Regression cost function

Elastic Net

Early Stopping

Logistic Regression

Estimating Probabilities

Training and Cost Function

Decision Boundaries

Logistic Regression One

Softmax Regression

Cross Entropy

Exercises

CHAPTER Five

Linear Support Vector Machine Classification

Soft Margin Classification

Linear SVM Classification

Nonlinear SVM Classification

Polynomial Kernel

Similarity Features

Equation five point one. Gaussian RBF

Gaussian RBF Kernel

Computational Complexity

SVM Regression

Under the Hood

Decision Function and Predictions

Training Objective

The Dual Problem

Kernelized SVMs

Overview

An extensive resource focused on machine learning, this book provides readers with practical tools and techniques to effectively apply machine learning concepts using Scikit-Learn, Keras, and TensorFlow. It combines theory with hands-on examples to prepare individuals for real-world applications in building intelligent systems.

Key Points

1Covers a wide range of machine learning concepts and techniques
2Employs popular Python libraries like Scikit-Learn, Keras, and TensorFlow
3Includes hands-on examples and projects for practical implementation
4Discusses the theoretical foundations behind popular algorithms and practices
5Aims to prepare readers for real-world machine learning challenges

Details

Authors: Aurélien Géron
Category: Technology and Engineering

PDF
KarGO: A Smarter Mobile Platform for Tricycle Transportation
KarGO is a mobile platform designed to optimize tricycle transportation in the Philippines, making it easier for users to book rides and helping registered drivers find more passengers, while ensuring safety and convenience through technology.
PDF
KarGO: A Smarter Transportation Solution for Tricycles
This document introduces KarGO, a mobile platform designed to improve the tricycle transportation experience for passengers and drivers in the Philippines. It outlines how users can book rides or deliveries and emphasizes the convenience and safety features of the app.
PDF
KarGO: A Smarter Way to Move Your Community
KarGO is a mobile platform designed to improve transportation for passengers and tricycle drivers in the Philippines, allowing users to book rides, track trips in real-time, and utilize cashless payments.
PDF
Introducing KarGO: A Smarter Transportation Solution for Tricyle Services
KarGO is a mobile platform designed to streamline tricycle transportation in the Philippines, allowing passengers to easily book rides and drivers to find more opportunities. The platform enhances safety for school transportation with real-time GPS tracking and facilitates cashless transactions.
PDF
Cognitive Edge Computing: A Comprehensive Survey on Optimizing Large Models and AI Agents for Pervasive Deployment
This comprehensive survey explores Cognitive Edge Computing as a methodology for deploying advanced AI models and agents on resource-constrained edge devices. It examines model optimization, system architecture, and adaptive intelligence necessary for effective cognitive processing in such environments.