Pytorch Torch Nn Huberloss

## PyTorch torch.nn.HuberLoss `torch.nn.HuberLoss` is a loss function in PyTorch that calculates the Huber loss. It acts as a hybrid between Mean Squared Error (MSE) and Mean Absolute Error (MAE). It is highly robust to outliers while maintaining stable gradients during optimization. --- ### Mathematical Definition Huber loss behaves like MSE when the absolute error is small, and like MAE when the absolute error is large. This prevents large errors (outliers) from dominating the gradient updates. For each element, the loss is calculated as: $$ L(y, \hat{y}) = \begin{cases} 0.5 \times (y - \hat{y})^2, & \text{if } |y - \hat{y}| \le \delta \\ \delta \times (|y - \hat{y}| - 0.5 \times \delta), & \text{if } |y - \hat{y}| > \delta \end{cases} $$ Where: * $y$ is the target value. * $\hat{y}$ is the predicted value. * $\delta$ (delta) is the threshold parameter that controls the transition point between the quadratic (MSE) and linear (MAE) regimes. --- ### Syntax and Parameters ```python torch.nn.HuberLoss(delta=1.0, reduction='mean') ``` #### Parameters: * **`delta`** *(float, optional)*: The threshold at which to scale the loss from quadratic to linear. Must be positive. Default: `1.0`. * **`reduction`** *(string, optional)*: Specifies the reduction to apply to the output: * `'none'`: No reduction will be applied; returns a tensor of the same shape as the input. * `'mean'`: The sum of the output will be divided by the number of elements in the output. Default: `'mean'`. * `'sum'`: The output will be summed. --- ## Code Examples ### Example 1: Basic Usage and Comparison with MSE This example demonstrates how to initialize and compute `HuberLoss`, and compares its output with standard `MSELoss` when an outlier is present. ```python import torch import torch.nn as nn # Initialize Huber Loss with delta = 1.0 criterion = nn.HuberLoss(delta=1.0) # Define predictions and targets (with an outlier at the last index) pred = torch.tensor([1.0, 2.0, 3.0, 10.0]) target = torch.tensor([1.5, 2.5, 3.5, 8.0]) # Calculate Huber Loss loss = criterion(pred, target) print("Huber Loss:", loss.item()) # Compare with MSE Loss mse_criterion = nn.MSELoss() mse_loss = mse_criterion(pred, target) print("MSE Loss: ", mse_loss.item()) ``` ### Example 2: Using Huber Loss in a Regression Training Loop This example shows how to integrate `HuberLoss` into a standard PyTorch training step for a linear regression model. ```python import torch import torch.nn as nn # Define a simple linear model model = nn.Linear(10, 1) # Define the loss function and optimizer criterion = nn.HuberLoss(delta=1.0) optimizer = torch.optim.Adam(model.parameters(), lr=0.01) # Generate dummy training data X = torch.randn(100, 10) y = torch.randn(100, 1) # Single training step model.train() optimizer.zero_grad() # Forward pass output = model(X) # Compute loss loss = criterion(output, y) # Backward pass and optimization step loss.backward() optimizer.step() print("Training Loss:", loss.item()) ``` --- ## Common Use Cases * **Regression Tasks with Outliers**: Standard MSE loss can penalize outliers too heavily, causing the model to skew its predictions. Huber loss limits the influence of these extreme values. * **Object Detection (Bounding Box Regression)**: Popularized by architectures like Fast R-CNN, Huber loss (often referred to as Smooth L1 loss) is widely used to regress bounding box coordinates. * **Robust Deep Learning**: Training models on noisy datasets where some target labels might be corrupted or incorrect.

YouTip

Pytorch Torch Nn Huberloss

📂 Categories