What Is Transfer Learning?

Updated May 2026
Transfer learning is the practice of taking a model trained on one task and adapting it to a different but related task. Instead of training from scratch, you start with a model that already knows useful patterns, then adjust it for your specific problem. This approach reduces the data, compute, and time needed for new tasks, and it is now the default method for building AI applications.

Why Transfer Learning Works

The key insight behind transfer learning is that many tasks share common features. A neural network trained to classify photos of animals learns to detect edges in its first layers, textures in its middle layers, and body parts in its later layers. Those edge and texture detectors are useful for almost any image task, not just animal classification. A medical imaging model does not need to learn what an edge is from scratch. It can borrow that knowledge from a model that already learned it on millions of general images.

For language models, the shared features are even more universal. A model trained to predict the next word in general text learns grammar, vocabulary, reasoning patterns, world knowledge, and style. All of those features transfer to almost any language task: summarization, translation, question answering, code generation, and more. This is why GPT, BERT, and their descendants are called foundation models, they provide a foundation that transfers broadly.

The mathematical reason transfer works is that the lower and middle layers of a neural network learn representations that are general, meaning they capture structure in the data that is relevant across many tasks. Only the top layers specialize for the specific task the model was trained on. When you transfer a model, you keep the general representations and retrain only the specialized parts.

How Transfer Learning Is Done

Feature Extraction

The simplest form of transfer learning treats the pre-trained model as a fixed feature extractor. You remove the final classification layer, pass your data through the remaining layers, and use the output as input features for a new, smaller model. The pre-trained model's weights do not change, you are only using its learned representations.

This approach is fast because you only train the small model on top. It works well when your new task is similar to the original task and when you have very little data for the new task. A researcher with 500 medical images can extract features from a model pre-trained on 14 million ImageNet images and train a small classifier on top. The pre-trained features do the heavy lifting.

Fine-Tuning

Fine-tuning goes further by also adjusting the pre-trained model's weights on the new task's data. Typically, you start with a low learning rate to avoid destroying the pre-trained knowledge and gradually update all layers. Some practitioners freeze the early layers (which contain the most general features) and only fine-tune the later layers.

Fine-tuning is the dominant approach for adapting large language models. When a company builds a customer service chatbot, it does not train a language model from scratch. It takes an existing model (like GPT-4 or Claude), fine-tunes it on examples of good customer service conversations, and deploys the result. The fine-tuned model retains the general language ability of the base model while specializing in the company's domain, terminology, and policies.

Domain Adaptation

Domain adaptation is a specific form of transfer learning where the source and target domains differ significantly. A sentiment analysis model trained on movie reviews might perform poorly on restaurant reviews because the vocabulary, writing style, and sentiment cues are different. Domain adaptation techniques explicitly align the representations from the two domains so that the model treats both types of text similarly.

This is particularly important in fields like medicine and law, where the language and concepts differ substantially from general text. A language model pre-trained on internet text knows very little about rare diseases or specific legal statutes. Domain adaptation, often through continued pre-training on domain-specific text followed by task-specific fine-tuning, bridges this gap.

The Economics of Transfer Learning

Transfer learning fundamentally changed the economics of AI development. Before transfer learning, every new application required training a model from scratch, which meant collecting a large labeled dataset and spending significant compute. Only well-funded teams could build competitive AI systems.

With transfer learning, a small team with a few hundred labeled examples can build a useful model by fine-tuning a publicly available pre-trained model. The compute cost drops from millions of dollars (for pre-training) to hundreds or thousands (for fine-tuning). The data requirement drops from billions of examples to thousands or even hundreds. This democratization is why AI applications have proliferated so rapidly since 2018.

The tradeoff is dependency. If you build your product on someone else's pre-trained model, you depend on their decisions about model architecture, training data, and updates. If they discontinue the model or change its behavior, your product is affected. This creates a tension between the efficiency of transfer learning and the control of training from scratch.

When Transfer Learning Fails

Transfer learning is not guaranteed to help. Negative transfer occurs when the pre-trained knowledge actually hurts performance on the new task. This typically happens when the source and target domains are very different: a model pre-trained on natural images might not transfer well to satellite imagery, and a model pre-trained on English text might not transfer well to code in a niche programming language.

Transfer learning can also fail when the pre-trained model has learned biases that are harmful in the new domain. A language model pre-trained on internet text carries the biases of that text. Fine-tuning on a small, carefully curated dataset may not be enough to remove deeply embedded biases, especially if the fine-tuning data is too small to override the pre-trained representations.

Key Takeaway

Transfer learning reuses knowledge from one task to accelerate learning on another. It works because neural networks learn general features that apply across tasks. Fine-tuning pre-trained models is now the standard approach for building AI applications, reducing data and compute requirements by orders of magnitude. The approach fails when the source and target domains are too different or when the pre-trained model carries harmful biases.