Machine learning

Updated 2024-03-17 (95990c1)

Notes on Machine Learning

This collection of links serves as a comprehensive guide to various aspects of machine learning, from foundational concepts to advanced techniques and frameworks.

Machine Learning Overview

The difference between AI and Machine Learning: Clarifying misconceptions and highlighting the distinctions between AI and ML.
- The difference between AI and Machine Learning

Core Concepts

Data Handling and Generation: Techniques for synthetic data creation are crucial for training models where real data is scarce.
- Synthetic Data Generation: Synthetic data with SDV and Gaussian copulas, Synthetic data with SDV and CTGAN, Synthetic data with SDV and CopulaGAN
Model Explainability: Making machine learning models understandable to humans.
- Explainability
Handling Drift: Maintaining model performance over time by addressing concept, model, and data drift.
- Introduction to Concept Drift in Machine Learning, Model drift, Data drift

Model Development and Evaluation

Optimisation Techniques: Algorithms to improve model performance, including both gradient-based and gradient-free options.
- Gradient Descent, Stochastic Gradient Descent, Stochastic Gradient descent with momentum, Mini-Batch Gradient Descent, Adagrad, RMSProp, AdaDelta, Adam, Gradient-free optimisation
Model Selection and Evaluation: Strategies for selecting the best model and assessing its performance.
- Cross-validation, Kernel functions: Interpretation and applications, Kernel functions
Error and Performance Metrics: Metrics to evaluate model errors and performance.
- Error metrics: Distance metrics
- Performance: Language performance metrics, Model performance metrics

Machine Learning Methods and Techniques

Time-Series Analysis: Analysing and predicting data that changes over time, and detecting anomalies in streaming data.
- Time-series analysis, Streaming anomaly detection
Clustering: Grouping data points based on their similarities.
- K-means clustering
Fairness: Ensuring models do not perpetuate biases.
- Fairness in Machine Learning, Model fairness
Transformations: Preprocessing steps for effective algorithm performance.
- Feature scaling
Recurrent Neural Networks (RNN): Recognising patterns in sequences of data with LSTM networks.
- LSTM

Machine Learning Applications and Frameworks

Supervised and Unsupervised Learning: Differentiating these foundational approaches with examples.
- Supervised methods: Random Forest, Regression: Gaussian Process Regression
- Unsupervised methods: Self-organising maps
Machine Learning Frameworks and Tools: Tools and frameworks for ML development, from model building to deployment.
- KServe, Cookiecutter Data Science, Scikit-learn, Model serving

Fundamental Theory and Statistics

Statistics in Machine Learning: Statistical concepts and methods crucial for ML algorithms and evaluation.
- Statistics, Streaming statistics, Thompson sampling, Statistical dependence

Rui Vieira

Contents

Machine learning

Notes on Machine Learning

Machine Learning Overview

Core Concepts

Model Development and Evaluation

Machine Learning Methods and Techniques

Machine Learning Applications and Frameworks

Fundamental Theory and Statistics