This article was automatically translated from the original Turkish version.

AdamW

Information And Communication Technologies

+1 More

Quote

Year	2017
Advantage(s)	Better Overfitting Control Weight Penalty Independence

Year

2017

Advantage(s)

Better Overfitting Control

Weight Penalty Independence

AdamW (Adam with Weight Decay) is a variant of the Adam optimization algorithm and provides a significant improvement related to model regularization. This variant aims to enhance Adam’s overall performance and generalization capability by incorporating an L2 penalty term (weight decay). In the traditional Adam algorithm, weight decay is computed together with the gradient updates; however, AdamW applies this penalty term independently of the update step, enabling more effective regularization.
Key Concepts
AdamW retains the core structure of the Adam algorithm but introduces a modification in how weight regularization is applied. The L2 penalty term helps prevent overfitting by constraining the magnitude of the model’s weights. While the Adam algorithm incorrectly incorporates this regularization within the gradient updates, AdamW applies it as a separate step.
Mathematical Formulation of AdamW
AdamW has a structure similar to the Adam algorithm but separates the weight decay term during the update process. The update steps of the AdamW algorithm are defined as follows:
Calculation of Moments:

Bibliographies

Kingma, D., and J. Ba. 2014. “Adam: A Method for Stochastic Optimization.” Computer Science. https://doi.org/10.48550/arXiv.1412.6980.

Loshchilov, Ilya, and Frank Hutter. 2019. “Decoupled Weight Decay Regularization.” ArXiv.org. January 4, 2019. https://doi.org/10.48550/arXiv.1711.05101.

Author Information

AuthorKaan GümeleDecember 9, 2025 at 6:24 AM

Year	2017
Advantage(s)	Better Overfitting Control Weight Penalty Independence

Discussions

No Discussion Added Yet

Start discussion for "AdamW" article

View Discussions

Key Concepts
Mathematical Formulation of AdamW
- Calculation of Moments:

AdamW

Key Concepts

Mathematical Formulation of AdamW

Calculation of Moments:

Bibliographies

Author Information

Tags

Discussions

Contents