L1/L2 regularization in PyTorch

By jacksparrow August 30, 2024

L1/L2 Regularization in PyTorch

Introduction

Regularization is a technique used in machine learning to prevent overfitting. Overfitting occurs when a model learns the training data too well, resulting in poor performance on unseen data. Regularization helps to address this by adding a penalty to the model’s loss function, discouraging it from becoming too complex.

L1 and L2 regularization are two common types of regularization used in deep learning.

L1 Regularization (Lasso)

L1 regularization adds a penalty to the loss function that is proportional to the absolute value of the weights.

How it Works:

L1 regularization encourages sparsity in the model by driving some of the weights to zero.
This can be helpful in feature selection, as it effectively removes features that are not contributing significantly to the model’s predictions.

Implementation in PyTorch:


import torch
import torch.nn as nn

# Define a linear layer with L1 regularization
class LinearWithL1(nn.Module):
  def __init__(self, in_features, out_features, l1_lambda):
    super().__init__()
    self.linear = nn.Linear(in_features, out_features)
    self.l1_lambda = l1_lambda

  def forward(self, x):
    return self.linear(x)

  def regularize(self):
    return self.l1_lambda * torch.sum(torch.abs(self.linear.weight))

# Example usage
model = LinearWithL1(10, 5, l1_lambda=0.01)
loss_fn = nn.MSELoss()
optimizer = torch.optim.Adam(model.parameters())

# Training loop
for epoch in range(10):
  # ... (train the model)
  loss = loss_fn(model(x), y) + model.regularize()
  optimizer.zero_grad()
  loss.backward()
  optimizer.step()

L2 Regularization (Ridge)

L2 regularization adds a penalty to the loss function that is proportional to the square of the weights.

How it Works:

L2 regularization prevents weights from becoming too large, reducing the model’s sensitivity to individual data points.
This helps to prevent overfitting and improve the model’s generalization ability.

Implementation in PyTorch:


import torch
import torch.nn as nn

# Define a linear layer with L2 regularization
class LinearWithL2(nn.Module):
  def __init__(self, in_features, out_features, l2_lambda):
    super().__init__()
    self.linear = nn.Linear(in_features, out_features)
    self.l2_lambda = l2_lambda

  def forward(self, x):
    return self.linear(x)

  def regularize(self):
    return self.l2_lambda * torch.sum(torch.square(self.linear.weight))

# Example usage
model = LinearWithL2(10, 5, l2_lambda=0.01)
loss_fn = nn.MSELoss()
optimizer = torch.optim.Adam(model.parameters())

# Training loop
for epoch in range(10):
  # ... (train the model)
  loss = loss_fn(model(x), y) + model.regularize()
  optimizer.zero_grad()
  loss.backward()
  optimizer.step()

Choosing L1 or L2 Regularization

The choice between L1 and L2 regularization depends on the specific problem and dataset. Here’s a summary:

Regularization	Effect	When to Use
L1 (Lasso)	Sparse weights, feature selection	High dimensional datasets, feature selection is important
L2 (Ridge)	Shrinks weights, improves generalization	Most situations, reduces overfitting

Conclusion

Regularization is an essential technique for improving the performance of machine learning models. L1 and L2 regularization are effective ways to prevent overfitting and enhance model generalization. By understanding the differences between these methods, you can choose the appropriate regularization technique for your specific problem.

Post Views: 7

L1/L2 regularization in PyTorch

L1/L2 Regularization in PyTorch

Introduction

L1 Regularization (Lasso)

How it Works:

Implementation in PyTorch:

L2 Regularization (Ridge)

How it Works:

Implementation in PyTorch:

Choosing L1 or L2 Regularization

Conclusion

By jacksparrow

Leave a Reply Cancel reply

You Missed

What is Python? – Definition, Features, Application

KeyAttestation in Android Nougat API 24

UTM tracking codes in Firebase

android.os.BadParcelableException: ClassNotFoundException when unmarshalling: com.facebook.flatbuffers.helpers.FlatBufferModelHelper$LazyHolder

L1/L2 regularization in PyTorch

L1/L2 Regularization in PyTorch

Introduction

L1 Regularization (Lasso)

How it Works:

Implementation in PyTorch:

L2 Regularization (Ridge)

How it Works:

Implementation in PyTorch:

Choosing L1 or L2 Regularization

Conclusion

By jacksparrow

Related Post

Leave a Reply Cancel reply

You Missed

What is Python? – Definition, Features, Application

KeyAttestation in Android Nougat API 24

UTM tracking codes in Firebase

android.os.BadParcelableException: ClassNotFoundException when unmarshalling: com.facebook.flatbuffers.helpers.FlatBufferModelHelper$LazyHolder