Quantum Machine Learning

Updated May 2026
Quantum machine learning (QML) explores whether quantum computers can learn from data faster or more effectively than classical computers. The field combines quantum computing primitives like superposition and entanglement with machine learning frameworks like kernel methods, neural networks, and generative models. While theoretical results suggest quantum advantages for specific learning tasks, no practical demonstration has yet shown a quantum machine learning model outperforming the best classical methods on a real-world dataset, making QML one of the most promising but unproven areas of quantum computing research.

Why Quantum Might Help Machine Learning

Machine learning algorithms fundamentally operate by finding patterns in high-dimensional data spaces. A dataset with 1,000 features defines a 1,000-dimensional space, and the learning algorithm must identify the boundaries, clusters, or relationships within this space that correspond to meaningful predictions. Classical algorithms navigate these spaces using linear algebra operations (matrix multiplications, eigenvalue decompositions, gradient computations) applied to data stored in classical memory. The computational cost of these operations grows polynomially with data dimensionality and dataset size, which is manageable for most practical problems using GPUs and TPUs.

Quantum computers could potentially help in two ways. First, they can represent and manipulate exponentially large vector spaces using only polynomially many qubits. A register of 50 qubits spans a Hilbert space of dimension 2^50 (roughly one quadrillion), enabling operations on vectors far larger than what classical hardware can store. If a learning task benefits from working in these exponentially large spaces, quantum computation could provide an exponential advantage. Second, quantum algorithms for specific linear algebra operations (matrix inversion, eigenvalue estimation, sampling) offer polynomial to exponential speedups over classical counterparts under certain conditions, and many ML algorithms reduce to these operations.

The critical question is whether these theoretical advantages translate to practical benefits on real learning tasks. Several factors complicate the picture. Classical ML hardware is extremely fast and highly optimized for the specific operations ML requires. Quantum input/output creates bottlenecks: loading classical data into a quantum state and reading out results both take time that can erase the quantum speedup. And the problems where quantum ML has proven theoretical advantages often involve artificial data distributions specifically designed to be hard for classical methods, with unclear relevance to real-world datasets.

Quantum Kernel Methods

Kernel methods are a class of ML algorithms that map data into a higher-dimensional feature space where linear classification becomes possible, even when the original data is not linearly separable. The kernel trick avoids explicitly computing the high-dimensional mapping by instead computing the inner product (similarity) between all pairs of data points in the feature space, represented as a kernel matrix. Classical kernels like the radial basis function (RBF) kernel map data into infinite-dimensional spaces, but the effective dimensionality of the learned features is limited by the kernel's functional form.

Quantum kernel methods use a quantum circuit to map classical data points into quantum states in the 2^N-dimensional Hilbert space of N qubits, then estimate the kernel (inner product between quantum states) by measuring the overlap between the circuit outputs for different data points. The quantum feature space is exponentially large and has a specific structure determined by the circuit architecture, potentially capturing patterns that classical kernels miss. The kernel matrix computed on the quantum processor is then fed to a classical support vector machine or other kernel-based classifier for training.

Theoretical work by Huang, Kueng, and Preskill (2021) showed that quantum kernels can provide exponential advantages for learning certain data distributions, but only when the data has specific quantum structure. For most naturally occurring datasets, classical kernels perform comparably. The practical challenge is identifying real-world problems where the data happens to have the structure that quantum kernels exploit. Experiments on current quantum hardware have demonstrated quantum kernel estimation on small datasets, but the noisy gate operations and limited qubit counts prevent scaling to datasets large enough to be practically interesting.

Variational Quantum Classifiers and Quantum Neural Networks

Variational quantum classifiers use parameterized quantum circuits as the model, analogous to how classical neural networks use parameterized layers of weights and biases. A classical data point is encoded into the initial quantum state (through rotation gates whose angles are set by the data features), the parameterized circuit transforms the state, and the output is measured to produce a classification prediction. The circuit parameters are optimized using a classical optimizer that minimizes a loss function computed from the measurement results.

These models are often called quantum neural networks (QNNs), though they differ from classical neural networks in fundamental ways. Classical neural networks gain expressivity from depth (many layers) and width (many neurons per layer). Quantum circuits gain expressivity from entanglement and the exponential dimension of the Hilbert space they operate in. A shallow quantum circuit on 10 qubits can represent states in a 1024-dimensional space, while a classical neural network with 10 neurons per layer operates in a 10-dimensional space per layer. However, the quantum circuit's output is probabilistic (requiring many measurement shots for accurate estimation), and the measurement process collapses the exponentially large state to a single bit string, creating an information bottleneck.

The barren plateau problem is a fundamental challenge for training variational quantum models. As the number of qubits grows, the cost function landscape becomes exponentially flat for randomly initialized circuits, meaning the gradients vanish and the optimizer cannot find useful directions for improvement. This is somewhat analogous to the vanishing gradient problem in deep classical networks, but more severe: in quantum circuits, the gradient magnitude decreases exponentially with qubit count for generic circuit architectures. Mitigation strategies include using problem-specific circuit architectures that avoid barren plateaus, layer-wise training, and initialization methods that start near the solution.

Quantum Speedups for Classical ML Subroutines

Several quantum algorithms accelerate specific subroutines commonly used in classical ML. The HHL algorithm (Harrow, Hassidim, Lloyd, 2009) solves systems of linear equations exponentially faster than classical methods when only global properties of the solution are needed. Since many ML tasks reduce to solving linear systems (linear regression, least squares, Gaussian processes), HHL could theoretically accelerate these tasks. However, the speedup requires the data to be loaded into a quantum state efficiently (a non-trivial requirement), and reading out the full solution vector eliminates the advantage.

Quantum principal component analysis (qPCA) can estimate the principal components of a density matrix exponentially faster than classical PCA. Quantum recommendation systems can provide recommendations in time logarithmic in the number of users and items, compared to polynomial time classically. Quantum sampling algorithms can generate samples from certain probability distributions that are hard for classical computers, potentially benefiting generative models and Bayesian inference.

A recurring pattern in these results is that the quantum speedup often comes with caveats that limit practical applicability. The data loading problem (getting classical data into quantum form) often requires time comparable to the classical computation being replaced. The quantum random access memory (qRAM) required by many algorithms is a theoretical construct that has not been built at scale. And improved classical algorithms sometimes close the gap: after the quantum recommendation algorithm was published, a classical algorithm achieving comparable performance was discovered, demonstrating that the quantum advantage was partly an artifact of comparing against a suboptimal classical baseline.

Classical ML for Quantum Computing

The most productive current intersection of quantum computing and machine learning may be the reverse direction: using classical ML to improve quantum computing itself. Machine learning models optimize quantum gate calibration, finding the precise pulse shapes and timings that maximize gate fidelity. Neural network decoders process error correction syndrome data faster and more accurately than traditional minimum-weight perfect matching decoders. Reinforcement learning agents discover optimal quantum control sequences and circuit compilation strategies.

Generative models trained on quantum measurement data can learn the probability distribution of a quantum system's output, enabling efficient estimation of quantum state properties without full state tomography. Classical neural networks predict the outcomes of quantum chemical simulations, serving as surrogates that guide where to focus expensive quantum computation. These applications are valuable now, using today's classical ML hardware to make today's quantum hardware more useful.

The long-term vision for quantum machine learning involves fault-tolerant quantum computers with thousands of logical qubits processing quantum data that is inherently quantum in origin, such as outputs from quantum sensors, quantum communication networks, or quantum chemistry simulations. In this regime, the data loading problem disappears because the data is already quantum, and the exponential advantage of quantum processing is most clearly justified. This quantum-data-in, quantum-processing, quantum-data-out paradigm may be where QML ultimately finds its strongest applications, rather than in accelerating the processing of classical datasets.

Key Takeaway

Quantum machine learning offers theoretical speedups for specific learning tasks through exponentially large feature spaces and faster linear algebra, but practical quantum advantage over highly optimized classical ML methods on real-world datasets has not yet been demonstrated, making QML a promising but unproven research direction.