Machine learning

Machine learning (ML) is a branch of computer science in which systems learn from data to improve their performance on a task, without being explicitly programmed with rules. Rather than following hand-written instructions, a machine learning model is trained on examples and learns to generalise from them.

ML encompasses a wide range of techniques and problem types:

  • Supervised learning — The model is trained on labelled examples (input/output pairs) to learn a mapping from inputs to outputs. Used for classification (eg. spam detection, image recognition) and regression (eg. predicting house prices).

  • Unsupervised learning — The model finds structure in unlabelled data. Used for clustering, dimensionality reduction, and anomaly detection.

  • Reinforcement learning — An agent learns by interacting with an environment and receiving rewards or penalties. Used in robotics, game-playing, and fine-tuning language models.

Generative AI is a subfield of machine learning focused on models that produce new content — text, images, audio, and video — rather than classifying or predicting from existing data. Large language models (LLMs), such as GPT and Claude, are the most prominent example. They are neural networks trained using ML techniques on vast text datasets to generate human-like language. Modern LLMs are built on the transformer architecture.


References