Machine Learning - A First Course for Engineers and Scientists

A new textbook on machine learning

When we developed the course Statistical Machine Learning for engineering students at Uppsala University, we found no appropriate textbook, so we ended up writing our own. It was published by Cambridge University Press in 2022, and you can order printed books from them or through most bookstores.

Book cover

Andreas Lindholm, Niklas Wahlström, Fredrik Lindsten, and Thomas B. Schön

A PDF draft of the book is available here. Latest draft version of the book (older versions >>)

The PDF is now updated. It is similar (although not identical) to the printed book, with the same page and equation numbering etc.

Introduction
- The machine learning problem
- Machine learning concepts via examples
- About this book
Supervised machine learning: a first approach
- Supervised machine learning
- A distance-based method: k-NN
- A rule-based method: Decision trees
Basic parametric models for regression and classification
- Linear regression
- Classification and logistic regression
- Polynomial regression and regularization
- Generalized linear models
Understanding, evaluating and improving the performance
- Expected new data error: performance in production
- Estimating the expected new data error
- The training error–generalization gap decomposition
- The bias-variance decomposition
- Additional tools for evaluating binary classifiers
Learning parametric models
- Principles pf parametric modelling
- Loss functions and likelihood-based models
- Regularization
- Parameter optimization
- Optimization with large datasets
- Hyperparameter optimization
Neural networks and deep learning
- The neural network model
- Training a neural network
- Convolutional neural networks
- Dropout
Ensemble methods: Bagging and boosting
- Bagging
- Random forests
- Boosting and AdaBoost
- Gradient boosting
Nonlinear input transformations and kernels
- Creating features by nonlinear input transformations
- Kernel ridge regdression
- Support vector regression
- Kernel theory
- Support vector classification
The Bayesian approach and Gaussian processes
- The Bayesian idea
- Bayesian linear regression
- The Gaussian process Online material: Gaussian process visualization
- Practial aspects of the Gaussian process
- Other Bayesian methods in machine learning
Generative models and learning from unlabeled data
- The Gaussian mixture model and discriminant analysis
- Cluster analysis
- Deep generative models
- Representation learning and dimensionality reduction
User aspects of machine learning
- Defining the machine learning problem
- Improving a machine learning model
- What if we cannot collect more data?
- Practical data issues
- Can I trust my machine learning model?
Ethics in machine learning (by David Sumpter)
- Fairness and error functions
- Misleading claims about performance
- Limitations of training data

Some reviews

“An authoritative treatment of modern machine learning, covering a broad range of topics, for readers who want to use and understand machine learning.” Bernhard Schölkopf, Max Planck Institute for Intelligent Systems

“This book provides the perfect introduction to modern machine learning, with an ideal balance between mathematical depth and breadth. Its outstanding clarity and many illustrations make it a perfect tool for self-learning or as a textbook for an introductory machine learning class.” Francis Bach, Inria Ecole Normale Supérieure

“Lucid and engaging, this book is a brilliant companion to anyone with a numerate background who wants to know what really goes on under the hood in supervised learning. The core theory and rich illustrative examples enable practitioners navigate the jungle of modern machine learning.” Carl Edward Rasmussen, University of Cambridge

“This book provides an excellent introduction to machine learning for engineers and scientists. It covers the main techniques in this exciting area ranging from basic approaches, such as linear regression and principal component analysis, to modern deep learning and generative modelling techniques. The authors have managed to find the right balance between academic rigor, intuition and applications. Required reading for any newcomer interested in this field!” Arnaud Doucet, University of Oxford

“This book strikes a very good balance between accessibility and rigour. It will be a very good companion for the mathematically trained who want to understand the hows and whats of machine learning.” Ole Winther, University of Copenhagen and Technical University of Denmark

If you want to cite the book, you can cite it as

@book{smlbook,
   author = {Lindholm, Andreas and Wahlstr\"om, Niklas and Lindsten, Fredrik and Sch\"on, Thomas B.},
   year = 2022,
   title = {Machine Learning - A First Course for Engineers and Scientists},
   publisher = {Cambridge University Press},
   URL={https://smlbook.org},
}

Exercise material

Will eventually be added to this page. Meanwhile you may have a look at the material for our course at Uppsala University.

Report mistakes and give feedback

Please report any mistakes or feedback using the gitHub issue tracker (A free GitHub account is required) We appreciate all help in improving the text!

Table of Contents

Some reviews

Exercise material

Report mistakes and give feedback