MBMACHINE BRIEF
AnalysisOriginalsModelsResearchStartupsTools
Newsletter
Home/Glossary

AI Glossary

Your guide to understanding AI and machine learning terminology. From transformers and attention to RLHF and fine-tuning — every term explained in plain language.

178 terms found

A

Activation Function

ai

A mathematical function applied to a neuron's output that introduces non-linearity into the network.

Read more

Adam Optimizer

ai

An optimization algorithm that combines the best parts of two other methods — AdaGrad and RMSProp.

Read more

AGI

ai

Artificial General Intelligence.

Read more

AI Agent

ai

An autonomous AI system that can perceive its environment, make decisions, and take actions to achieve goals.

Read more

AI Alignment

ai

The research field focused on making sure AI systems do what humans actually want them to do.

Read more

AI Safety

ai

The broad field studying how to build AI systems that are safe, reliable, and beneficial.

Read more

Anthropic

ai

An AI safety company founded in 2021 by former OpenAI researchers, including Dario and Daniela Amodei.

Read more

Artificial Intelligence

ai

The science of creating machines that can perform tasks requiring human-like intelligence — reasoning, learning, perception, language understanding, and decision-making.

Read more

ASI

ai

Artificial Superintelligence.

Read more

Attention

ai

A mechanism that lets neural networks focus on the most relevant parts of their input when producing output.

Read more

Autoencoder

ai

A neural network trained to compress input data into a smaller representation and then reconstruct it.

Read more

Autonomous AI

ai

AI systems capable of operating independently for extended periods without human intervention.

Read more

Autoregressive Model

ai

A model that generates output one piece at a time, with each new piece depending on all the previous ones.

Read more

B

Backpropagation

ai

The algorithm that makes neural network training possible.

Read more

Batch Normalization

ai

A technique that normalizes the inputs to each layer in a neural network, making training faster and more stable.

Read more

Batch Size

ai

The number of training examples processed together before the model updates its weights.

Read more

Beam Search

ai

A decoding strategy that keeps track of multiple candidate sequences at each step instead of just picking the single best option.

Read more

Benchmark

ai

A standardized test used to measure and compare AI model performance.

Read more

BERT

ai

Bidirectional Encoder Representations from Transformers.

Read more

Bias

ai

In AI, bias has two meanings.

Read more

BPE

ai

Byte Pair Encoding.

Read more

C

Catastrophic Forgetting

ai

When a neural network trained on new data suddenly loses its ability to perform well on previously learned tasks.

Read more

Chain of Thought

ai

A prompting technique where you ask an AI model to show its reasoning step by step before giving a final answer.

Read more

Chatbot

ai

An AI system designed to have conversations with humans through text or voice.

Read more

Chinchilla

ai

A research paper from DeepMind that proved most large language models were over-sized and under-trained.

Read more

Classification

ai

A machine learning task where the model assigns input data to predefined categories.

Read more

Claude

ai

Anthropic's family of AI assistants, including Claude Haiku, Sonnet, and Opus.

Read more

CLIP

ai

Contrastive Language-Image Pre-training.

Read more

CNN

ai

Convolutional Neural Network.

Read more

Compute

ai

The processing power needed to train and run AI models.

Read more

Computer Vision

ai

The field of AI focused on enabling machines to interpret and understand visual information from images and video.

Read more

Constitutional AI

ai

An approach developed by Anthropic where an AI system is trained to follow a set of principles (a 'constitution') rather than relying solely on human feedback for every decision.

Read more

Context Window

ai

The maximum amount of text a language model can process at once, measured in tokens.

Read more

Contrastive Learning

ai

A self-supervised learning approach where the model learns by comparing similar and dissimilar pairs of examples.

Read more

Conversational AI

ai

AI systems designed for natural, multi-turn dialogue with humans.

Read more

Cross-Attention

ai

An attention mechanism where one sequence attends to a different sequence.

Read more

CUDA

ai

NVIDIA's parallel computing platform that lets developers use GPUs for general-purpose computing.

Read more

D

DALL-E

ai

OpenAI's text-to-image generation model.

Read more

Data Augmentation

ai

Techniques for artificially expanding training datasets by creating modified versions of existing data.

Read more

Data Poisoning

ai

Deliberately corrupting training data to manipulate a model's behavior.

Read more

Decoder

ai

The part of a neural network that generates output from an internal representation.

Read more

Deep Learning

ai

A subset of machine learning that uses neural networks with many layers (hence 'deep') to learn complex patterns from large amounts of data.

Read more

Deepfake

ai

AI-generated media that realistically depicts a person saying or doing something they never actually did.

Read more

DeepMind

ai

A leading AI research lab, now part of Google.

Read more

Diffusion Model

ai

A generative AI model that creates data by learning to reverse a gradual noising process.

Read more

Distillation

ai

A technique where a smaller 'student' model learns to mimic a larger 'teacher' model.

Read more

DPO

ai

Direct Preference Optimization.

Read more

Dropout

ai

A regularization technique that randomly deactivates a percentage of neurons during training.

Read more

E

Edge AI

ai

Running AI models directly on local devices (phones, laptops, IoT devices) instead of in the cloud.

Read more

Embedding

ai

A dense numerical representation of data (words, images, etc.

Read more

Emergent Abilities

ai

Capabilities that appear suddenly as language models reach certain sizes.

Read more

Emergent Behavior

ai

Capabilities that appear in AI models at scale without being explicitly trained for.

Read more

Encoder

ai

The part of a neural network that processes input data into an internal representation.

Read more

Encoder-Decoder

ai

A neural network architecture with two parts: an encoder that processes the input into a representation, and a decoder that generates the output from that representation.

Read more

Epoch

ai

One complete pass through the entire training dataset.

Read more

Ethical AI

ai

The practice of developing AI systems that are fair, transparent, accountable, and respect human rights.

Read more

Evaluation

ai

The process of measuring how well an AI model performs on its intended task.

Read more

Explainability

ai

The ability to understand and explain why an AI model made a particular decision.

Read more

F

Feature Extraction

ai

The process of identifying and pulling out the most important characteristics from raw data.

Read more

Federated Learning

ai

A training approach where the model learns from data spread across many devices without that data ever leaving those devices.

Read more

Few-Shot Learning

ai

The ability of a model to learn a new task from just a handful of examples, often provided in the prompt itself.

Read more

Fine-Tuning

ai

The process of taking a pre-trained model and continuing to train it on a smaller, specific dataset to adapt it for a particular task or domain.

Read more

Flash Attention

ai

An optimized attention algorithm that's mathematically equivalent to standard attention but runs much faster and uses less GPU memory.

Read more

Foundation Model

ai

A large AI model trained on broad data that can be adapted for many different tasks.

Read more

Function Calling

ai

A capability that lets language models interact with external tools and APIs by generating structured function calls.

Read more

G

GAN

ai

Generative Adversarial Network.

Read more

GELU

ai

Gaussian Error Linear Unit.

Read more

Gemini

ai

Google's flagship multimodal AI model family, developed by Google DeepMind.

Read more

Generative AI

ai

AI systems that create new content — text, images, audio, video, or code — rather than just analyzing or classifying existing data.

Read more

GPT

ai

Generative Pre-trained Transformer.

Read more

GPU

ai

Graphics Processing Unit.

Read more

Gradient Accumulation

ai

A technique that simulates larger batch sizes by accumulating gradients over multiple forward passes before updating weights.

Read more

Gradient Descent

ai

The fundamental optimization algorithm used to train neural networks.

Read more

Grounding

ai

Connecting an AI model's outputs to verified, factual information sources.

Read more

Guardrails

ai

Safety measures built into AI systems to prevent harmful, inappropriate, or off-topic outputs.

Read more

H

Hallucination

ai

When an AI model generates confident-sounding but factually incorrect or completely fabricated information.

Read more

Hallucination Detection

ai

Methods for identifying when an AI model generates false or unsupported claims.

Read more

Hugging Face

ai

The leading platform for sharing and collaborating on AI models, datasets, and applications.

Read more

Hyperparameter

ai

A setting you choose before training begins, as opposed to parameters the model learns during training.

Read more

I

Image Classification

ai

The task of assigning a label to an image from a set of predefined categories.

Read more

ImageNet

ai

A massive image dataset containing over 14 million labeled images across 20,000+ categories.

Read more

In-Context Learning

ai

A model's ability to learn new tasks simply from examples provided in the prompt, without any weight updates.

Read more

Inference

ai

Running a trained model to make predictions on new data.

Read more

Instruction Tuning

ai

Fine-tuning a language model on datasets of instructions paired with appropriate responses.

Read more

J

Jailbreak

ai

A technique for bypassing an AI model's safety restrictions and guardrails.

Read more

K

Knowledge Distillation

ai

Training a smaller model to replicate the behavior of a larger one.

Read more

Knowledge Graph

ai

A structured representation of information as a network of entities and their relationships.

Read more

L

Language Model

ai

An AI model that understands and generates human language.

Read more

Large Language Model

ai

An AI model with billions of parameters trained on massive text datasets.

Read more

Latent Space

ai

The compressed, internal representation space where a model encodes data.

Read more

Layer Normalization

ai

A technique that normalizes activations across the features of each training example, rather than across the batch.

Read more

Learning Rate

ai

A hyperparameter that controls how much the model's weights change in response to each update.

Read more

LLaMA

ai

Meta's family of open-weight large language models.

Read more

LLM

ai

Large Language Model.

Read more

LoRA

ai

Low-Rank Adaptation.

Read more

Loss Function

ai

A mathematical function that measures how far the model's predictions are from the correct answers.

Read more

LSTM

ai

Long Short-Term Memory.

Read more

M

Machine Learning

ai

A branch of AI where systems learn patterns from data instead of following explicitly programmed rules.

Read more

Masked Language Modeling

ai

A pre-training technique where random words in text are hidden (masked) and the model learns to predict them from context.

Read more

Meta-Learning

ai

Training models that learn how to learn — after training on many tasks, they can quickly adapt to new tasks with very little data.

Read more

Midjourney

ai

A popular AI image generation service known for its distinctive artistic style.

Read more

Mistral

ai

A French AI company that builds efficient, high-performance language models.

Read more

Mixture of Experts

ai

An architecture where multiple specialized sub-networks (experts) share a model, but only a few activate for each input.

Read more

MMLU

ai

Massive Multitask Language Understanding.

Read more

Model Collapse

ai

A degradation that happens when AI models are trained on data generated by other AI models.

Read more

Multi-Head Attention

ai

An extension of the attention mechanism that runs multiple attention operations in parallel, each with different learned projections.

Read more

Multimodal

ai

AI models that can understand and generate multiple types of data — text, images, audio, video.

Read more

N

Narrow AI

ai

AI systems designed for a specific task, as opposed to general intelligence.

Read more

Natural Language Processing

ai

The field of AI focused on enabling computers to understand, interpret, and generate human language.

Read more

Neural Network

ai

A computing system loosely inspired by biological brains, consisting of interconnected nodes (neurons) organized in layers.

Read more

Next-Token Prediction

ai

The fundamental task that language models are trained on: given a sequence of tokens, predict what comes next.

Read more

NLP

ai

Natural Language Processing.

Read more

NVIDIA

ai

The dominant provider of AI hardware.

Read more

O

Object Detection

ai

A computer vision task that identifies and locates objects within an image, drawing bounding boxes around each one.

Read more

Open Source AI

ai

AI models whose weights, code, and sometimes training data are publicly released for anyone to use, modify, and build upon.

Read more

OpenAI

ai

The AI company behind ChatGPT, GPT-4, DALL-E, and Whisper.

Read more

Optimization

ai

The process of finding the best set of model parameters by minimizing a loss function.

Read more

Overfitting

ai

When a model memorizes the training data so well that it performs poorly on new, unseen data.

Read more

P

Parameter

ai

A value the model learns during training — specifically, the weights and biases in neural network layers.

Read more

Perplexity

ai

A measurement of how well a language model predicts text.

Read more

Positional Encoding

ai

Information added to token embeddings to tell a transformer the order of elements in a sequence.

Read more

Pre-Training

ai

The initial, expensive phase of training where a model learns general patterns from a massive dataset.

Read more

Prompt Engineering

ai

The art and science of crafting inputs to AI models to get the best possible outputs.

Read more

Prompting

ai

The text input you give to an AI model to direct its behavior.

Read more

PyTorch

ai

The most popular deep learning framework, developed by Meta.

Read more

Q

Quantization

ai

Reducing the precision of a model's numerical values — for example, from 32-bit to 4-bit numbers.

Read more

R

RAG

ai

Retrieval-Augmented Generation.

Read more

Reasoning

ai

The ability of AI models to draw conclusions, solve problems logically, and work through multi-step challenges.

Read more

Recurrent Neural Network

ai

A neural network architecture where connections form loops, letting the network maintain a form of memory across sequences.

Read more

Red Teaming

ai

Systematically testing an AI system by trying to make it produce harmful, biased, or incorrect outputs.

Read more

Regression

ai

A machine learning task where the model predicts a continuous numerical value.

Read more

Regularization

ai

Techniques that prevent a model from overfitting by adding constraints during training.

Read more

Reinforcement Learning

ai

A learning approach where an agent learns by interacting with an environment and receiving rewards or penalties.

Read more

ReLU

ai

Rectified Linear Unit.

Read more

Representation Learning

ai

The idea that useful AI comes from learning good internal representations of data.

Read more

Responsible AI

ai

The practice of developing and deploying AI systems with careful attention to fairness, transparency, safety, privacy, and social impact.

Read more

Reward Model

ai

A model trained to predict how helpful, harmless, and honest a response is, based on human preferences.

Read more

RLHF

ai

Reinforcement Learning from Human Feedback.

Read more

RNN

ai

Recurrent Neural Network.

Read more

RoPE

ai

Rotary Position Embedding.

Read more

S

Sampling

ai

The process of selecting the next token from the model's predicted probability distribution during text generation.

Read more

Scaling Laws

ai

Mathematical relationships showing how AI model performance improves predictably with more data, compute, and parameters.

Read more

Self-Attention

ai

An attention mechanism where a sequence attends to itself — each element looks at all other elements to understand relationships.

Read more

Self-Supervised Learning

ai

A training approach where the model creates its own labels from the data itself.

Read more

Semantic Search

ai

Search that understands meaning and intent rather than just matching keywords.

Read more

Sentiment Analysis

ai

Automatically determining whether a piece of text expresses positive, negative, or neutral sentiment.

Read more

Softmax

ai

A function that converts a vector of numbers into a probability distribution — all values between 0 and 1 that sum to 1.

Read more

Speech Recognition

ai

Converting spoken audio into written text.

Read more

Stable Diffusion

ai

An open-source image generation model released by Stability AI.

Read more

Structured Output

ai

Getting a language model to generate output in a specific format like JSON, XML, or a database schema.

Read more

Supervised Learning

ai

The most common machine learning approach: training a model on labeled data where each example comes with the correct answer.

Read more

Synthetic Data

ai

Artificially generated data used for training AI models.

Read more

System Prompt

ai

Instructions given to an AI model that define its role, personality, constraints, and behavior rules.

Read more

T

Temperature

ai

A parameter that controls the randomness of a language model's output.

Read more

TensorFlow

ai

Google's open-source deep learning framework.

Read more

Text-to-Image

ai

AI models that generate images from text descriptions.

Read more

Text-to-Speech

ai

AI systems that convert written text into natural-sounding spoken audio.

Read more

Token

ai

The basic unit of text that language models work with.

Read more

Tokenizer

ai

The component that converts raw text into tokens that a language model can process.

Read more

Tool Use

ai

The ability of AI models to interact with external tools and systems — browsing the web, running code, querying APIs, reading files.

Read more

Top-P Sampling

ai

A text generation method (also called nucleus sampling) that only considers tokens whose cumulative probability exceeds a threshold P.

Read more

TPU

ai

Tensor Processing Unit.

Read more

Training

ai

The process of teaching an AI model by exposing it to data and adjusting its parameters to minimize errors.

Read more

Transfer Learning

ai

Using knowledge learned from one task to improve performance on a different but related task.

Read more

Transformer

ai

The neural network architecture behind virtually all modern AI language models.

Read more

Turing Test

ai

A test proposed by Alan Turing in 1950: if a human can't reliably tell whether they're talking to a machine or another human, the machine passes.

Read more

U

Underfitting

ai

When a model is too simple to capture the patterns in the data, performing poorly on both training and test sets.

Read more

Unsupervised Learning

ai

Machine learning on data without labels — the model finds patterns and structure on its own.

Read more

V

VAE

ai

Variational Autoencoder.

Read more

Vector Database

ai

A database optimized for storing and searching high-dimensional vectors (embeddings).

Read more

Vision Transformer

ai

A transformer architecture adapted for image processing.

Read more

Voice Cloning

ai

Using AI to create a synthetic copy of someone's voice from a small sample of their speech.

Read more

W

Weight

ai

A numerical value in a neural network that determines the strength of the connection between neurons.

Read more

Whisper

ai

OpenAI's open-source speech recognition model.

Read more

Word2Vec

ai

One of the earliest successful word embedding models, from Google in 2013.

Read more

World Model

ai

An AI system's internal representation of how the world works — understanding physics, cause and effect, and spatial relationships.

Read more

Y

YOLO

ai

You Only Look Once.

Read more

Z

Zero-Shot Learning

ai

A model's ability to perform a task it was never explicitly trained on, with no examples provided.

Read more

Navigate

  • Home
  • About Us
  • Newsletter
  • Search
  • Sitemap

Content

  • Original Analysis
  • Blog
  • Glossary
  • Best Lists
  • AI Tools

Categories

  • Models
  • Research
  • Startups
  • Robotics
  • Policy
  • Business
  • Analysis
  • Originals

Legal

  • Privacy Policy
  • Terms of Service
Machine Brief|

2026 Machine Brief. All rights reserved.