Machine learning
Notes on Machine Learning
This collection of links serves as a comprehensive guide to various aspects of machine learning, from foundational concepts to advanced techniques and frameworks.
Machine Learning Overview
- The difference between AI and Machine Learning: Clarifying misconceptions and highlighting the distinctions between AI and ML.
Core Concepts
- Data Handling and Generation: Techniques for synthetic data creation are crucial for training models where real data is scarce.
- Synthetic Data Generation: Synthetic data with SDV and Gaussian copulas, Synthetic data with SDV and CTGAN, Synthetic data with SDV and CopulaGAN
- Model Explainability: Making machine learning models understandable to humans.
- Handling Drift: Maintaining model performance over time by addressing concept, model, and data drift.
Model Development and Evaluation
- Optimisation Techniques: Algorithms to improve model performance, including both gradient-based and gradient-free options.
- Gradient Descent, Stochastic Gradient Descent, Stochastic Gradient descent with momentum, Mini-Batch Gradient Descent, Adagrad, RMSProp, AdaDelta, Adam, Gradient-free optimisation
- Model Selection and Evaluation: Strategies for selecting the best model and assessing its performance.
- Cross-validation, Kernel functions: Interpretation and applications, Kernel functions
- Error and Performance Metrics: Metrics to evaluate model errors and performance.
- Error metrics: Distance metrics
- Performance: Language performance metrics, Model performance metrics
Machine Learning Methods and Techniques
- Time-Series Analysis: Analysing and predicting data that changes over time, and detecting anomalies in streaming data.
- Clustering: Grouping data points based on their similarities.
- Fairness: Ensuring models do not perpetuate biases.
- Transformations: Preprocessing steps for effective algorithm performance.
- Recurrent Neural Networks (RNN): Recognising patterns in sequences of data with LSTM networks.
Machine Learning Applications and Frameworks
- Supervised and Unsupervised Learning: Differentiating these foundational approaches with examples.
- Supervised methods: Random Forest, Regression: Gaussian Process Regression
- Unsupervised methods: Self-organising maps
- Machine Learning Frameworks and Tools: Tools and frameworks for ML development, from model building to deployment.
- KServe, Cookiecutter Data Science, Scikit-learn, Model serving
Fundamental Theory and Statistics
- Statistics in Machine Learning: Statistical concepts and methods crucial for ML algorithms and evaluation.