Machine Learning

What is Overfitting?

Overfitting is a common machine learning problem where a model learns the noise and specific details of training data too well, resulting in excellent performance on training data but poor generalization to new, unseen data, effectively memorizing rather than learning.

What Is Overfitting?

Overfitting occurs when a machine learning model memorizes the training data instead of learning the underlying patterns. The model captures not only the genuine signal in the data but also the noise -- random fluctuations, outliers, and idiosyncrasies that are specific to the training set and will not appear in new data.

An overfit model performs excellently on training data but poorly on new, unseen data. This makes it essentially useless for real-world predictions, despite appearing to work perfectly during development.

A Simple Analogy

Imagine a student preparing for an exam by memorizing every answer from past exams word-for-word, rather than understanding the underlying concepts. If the new exam contains the exact same questions, they will score perfectly. But if the questions are even slightly different, they will fail -- because they memorized answers rather than learning principles.

Overfitting in ML works the same way. The model "memorized the past exam" rather than learning the concepts that generalize.

How to Recognize Overfitting

The telltale sign of overfitting is a gap between training performance and test performance:

Scenario	Training Accuracy	Test Accuracy	Diagnosis
Underfitting	60%	58%	Model too simple
Good fit	92%	89%	Healthy generalization
Overfitting	99%	72%	Memorizing training data

If your model achieves near-perfect training performance but significantly worse test performance, overfitting is almost certainly the cause.

Other warning signs include:

Model complexity is very high relative to the amount of training data
Training loss keeps decreasing while validation loss starts increasing
The model makes confident but wrong predictions on new data
Small changes in input data cause large changes in predictions (instability)

What Causes Overfitting?

Several factors contribute to overfitting:

Too little training data -- With limited examples, the model does not have enough diversity to distinguish signal from noise
Too complex a model -- A model with too many parameters relative to the training data can memorize rather than generalize. A neural network with millions of parameters trained on hundreds of examples will almost certainly overfit.
Training for too long -- The longer you train, the more the model adapts to the specific training examples. At some point, it starts fitting the noise.
Noisy or mislabeled data -- If training data contains errors, the model may learn those errors as patterns
Irrelevant features -- Including features that are unrelated to the prediction target adds noise for the model to memorize

How to Prevent and Fix Overfitting

1. Get More Training Data

The most straightforward solution. More data provides more diverse examples, making it harder for the model to memorize. If collecting new data is expensive, consider data augmentation techniques (for images: rotation, flipping, cropping; for text: synonym replacement, back-translation).

2. Simplify the Model

Use a less complex model with fewer parameters:

Fewer layers and neurons in neural networks
Shallower trees in decision tree models
Fewer features in the input

3. Regularization

Add mathematical constraints that penalize complexity:

L1 regularization (Lasso) -- Encourages the model to use fewer features by pushing some weights to zero
L2 regularization (Ridge) -- Prevents any single weight from becoming too large
Dropout -- In neural networks, randomly disabling neurons during training to prevent co-adaptation
Early stopping -- Stop training when validation performance stops improving, even if training performance could still improve

4. Cross-Validation

Use K-fold cross-validation to get a more reliable estimate of model performance and to tune hyperparameters without overfitting to a single validation set.

5. Feature Selection

Remove irrelevant or redundant features. A model with fewer, more relevant features is less likely to overfit than one drowning in noise.

6. Ensemble Methods

Combine multiple models to reduce overfitting:

Random Forests -- Average predictions from many decision trees, each trained on a random subset of data and features
Bagging -- Train multiple models on different bootstrap samples and average their predictions
Gradient Boosting -- Build models sequentially, each correcting errors of the previous one (with regularization)

Business Impact of Overfitting

For business leaders, overfitting is not just a technical curiosity -- it has real financial consequences:

Failed deployments -- A model that appeared accurate in testing performs poorly in production, wasting development investment
Wrong decisions -- An overfit demand forecasting model might recommend inventory levels that are wildly off, leading to stockouts or excess inventory
Lost trust -- When an AI system makes confident but wrong predictions, stakeholders lose confidence in AI initiatives, making future projects harder to fund
Wasted resources -- Teams spend months building a model that cannot generalize, requiring a restart

Overfitting in Southeast Asian Business Contexts

Several regional factors increase overfitting risk:

Limited historical data -- Rapidly growing businesses in emerging ASEAN markets may have only months or a few years of data, increasing overfitting risk
Market volatility -- Fast-changing consumer behavior in markets like Indonesia and Vietnam means training data may not represent future conditions
Regional diversity -- A model trained on data from one market (e.g., Singapore) may overfit to that market's patterns and fail when applied to another (e.g., Thailand or the Philippines)
Seasonal events -- Models may overfit to specific seasonal patterns if training data covers only one or two cycles of major holidays

The Bottom Line

Overfitting is the most common failure mode in machine learning projects. Understanding it is essential for anyone involved in AI initiatives -- not just data scientists, but also the business leaders who evaluate model performance and make deployment decisions. When a vendor or team reports near-perfect accuracy, the first question should always be: "Is this the training accuracy or the test accuracy?"

Why It Matters for Business

Overfitting is arguably the most important concept for non-technical business leaders to understand about machine learning. It explains why a model that looked perfect during development can fail completely in production -- a scenario that wastes investment, damages stakeholder confidence, and delays AI adoption. For CEOs and CTOs approving AI projects, understanding overfitting helps you ask the right questions and avoid costly mistakes.

The financial impact is direct. Industry data suggests that 60-80% of ML models that perform well in development fail to deliver expected value in production, and overfitting is a primary cause. A model that appeared to predict customer churn with 95% accuracy in testing but only achieves 65% in production has cost your organization the full development budget with a fraction of the expected return. Multiply this across multiple AI initiatives and the cost of not understanding overfitting becomes substantial.

For Southeast Asian businesses, overfitting risk is elevated due to shorter data histories in rapidly growing markets, significant market diversity across ASEAN countries, and fast-changing consumer behavior. This makes it critical to insist on rigorous validation practices, out-of-sample testing, and realistic performance expectations. When evaluating AI projects or vendor deliverables, always ask to see test set performance (not training performance) and understand how the model was validated against data it has never seen.

Key Considerations

Always ask for test set or out-of-sample performance metrics, not training metrics -- high training accuracy alone means nothing
Ensure your data science team uses proper train-validation-test splits and never evaluates final performance on data used during training or tuning
Be skeptical of reported accuracies above 95% for complex business problems; this often indicates overfitting or data leakage rather than a genuinely excellent model
Budget for regularization and validation as part of every ML project -- these are not optional steps but essential quality controls
For markets with limited historical data, consider simpler models that are inherently less prone to overfitting
When deploying models across multiple ASEAN markets, validate performance separately in each market rather than relying on aggregate metrics
Monitor model performance after deployment; a sudden drop in accuracy compared to test results may indicate overfitting was not fully addressed

Frequently Asked Questions

How can I tell if a model is overfitting without being a data scientist?

Ask two questions: What was the model accuracy on training data, and what was it on test data (data the model never saw during development)? If training accuracy is significantly higher than test accuracy -- for example, 98% vs. 75% -- the model is likely overfitting. Also, after deployment, compare the model performance reported during development with actual real-world performance. A significant gap is a strong indicator of overfitting.

Is overfitting always bad?

Yes, overfitting is always a problem because it means the model will not perform as expected on new data. However, a small amount of "overfitting" (where training accuracy is a few percentage points higher than test accuracy) is normal and expected. The concern is when the gap is large. Also, note that in some specialized applications like anomaly detection, what appears to be overfitting on the majority class can actually be acceptable if the model correctly identifies rare events.

Need help implementing Overfitting?

Pertama Partners helps businesses across Southeast Asia adopt AI strategically. Let's discuss how overfitting fits into your AI roadmap.

Book a Consultation Browse AI Glossary