Merge pull request #24 from djdameln/update-post-processing-guide

samet-akcay · web-flow · commit 61f1815ac0cb · 2025-01-03T22:40:27.000Z
update post-processor guide
diff --git a/docs/source/conf.py b/docs/source/conf.py
@@ -50,6 +50,8 @@
     "tasklist",
     "deflist",
     "fieldlist",
+    "amsmath",
+    "dollarmath",
 ]
 
 # Add separate setting for eval-rst
diff --git a/docs/source/markdown/guides/how_to/models/post_processor.md b/docs/source/markdown/guides/how_to/models/post_processor.md
@@ -8,180 +8,132 @@ This guide explains how post-processing works in Anomalib, its integration with
 
 ## Overview
 
-Post-processing in Anomalib is designed to handle model outputs and convert them into meaningful predictions. The post-processor:
+Post-processing in Anomalib refers to any additional operations that are applied after the model generates its raw predictions. Most anomaly detection models do not generate hard classification labels directly. Instead, the models generate an anomaly score, which can be seen as an estimation of the distance from the sample to the learned representation of normality. The raw anomaly scores may consist of a single score per image for anomaly classification, or a pixel-level anomaly map for anomaly localization/segmentation. The raw anomaly scores may be hard to interpret, as they are unbounded, and the range of values may differ between models. To convert the raw anomaly scores into useable predictions, we need to apply a threshold that maps the raw scores to the binary (normal vs. anomalous) classification labels. In addition, we may want to normalize the raw scores to the [0, 1] range for interpretability and visualization.
 
-- Computes anomaly scores from raw model outputs
-- Determines optimal thresholds for anomaly detection
-- Generates segmentation masks for pixel-level detection
-- Gets exported with the model for consistent inference
+The thresholding and normalization steps described above are typical post-processing steps in an anomaly detection workflow. The module that is responsible for these operations in Anomalib is the `PostProcessor`. The `PostProcessor` applies a set of post-processing operations on the raw predictions returned by the model. Similar to the {doc}`PreProcessor <./pre_processor>`, the `PostProcessor` also infuses its operations in the model graph during export. This ensures that during deployment:
 
-The `PostProcessor` class is an abstract base class that serves two roles:
+- Post-processing is part of the exported model (ONNX, OpenVINO)
+- Users don't need to manually apply post-processing steps such as thresholding and normalization
+- Edge deployment is simplified with automatic post-processing
+
+To achieve this, the `PostProcessor` class implements the following components:
 
 1. A PyTorch Module for processing model outputs that gets exported with the model
 2. A Lightning Callback for managing thresholds during training
 
-Anomalib provides concrete implementations like `OneClassPostProcessor` for specific use cases, such as one-class anomaly detection.
-This is based on the `PostProcessor` class. For any other use case, you can create a custom post-processor by inheriting from the `PostProcessor` class.
+The `PostProcessor` is an abstract base class that can be implemented to suit different post-processing workflows. In addition, Anomalib also provides a default `OneClassPostProcessor` implementation, which suits most one-class learning algorithms. Other learning types, such as zero-shot learning or VLM-based models may require different post-processing steps.
 
-## Basic Usage
+## OneClassPostProcessor
 
-The most common post-processor is `OneClassPostProcessor`:
+The `OneClassPostProcessor` is Anomalib's default post-processor class which covers the most common anomaly detection workflow. It is responsible for adaptively computing the optimal threshold value for the dataset, applying this threshold during testing/inference, and normalizing the predicted anomaly scores to the [0, 1] range for interpretability. Thresholding and normalization is applied separately for both image- and pixel-level predictions. The following descriptions focus on the image-level predictions, but the same principles apply for the pixel-level predictions.
 
-```python
-from anomalib.post_processing import OneClassPostProcessor
+**Thresholding**
 
-# Create a post-processor with sensitivity adjustments
-post_processor = OneClassPostProcessor(
-    image_sensitivity=0.5,    # Adjust image-level threshold sensitivity
-    pixel_sensitivity=0.5     # Adjust pixel-level threshold sensitivity
-)
+The post-processor adaptively computes the optimal threshold value during the validation sequence. The threshold is computed by collecting the raw anomaly scores and the corresponding ground truth labels for all the images in the validation set, and plotting the Precision-Recall (PR) curve for the range of possible threshold values $\mathbf{\theta}$.
 
-# Apply to model outputs
-predictions = post_processor(outputs)
-print(predictions.pred_score)     # Normalized anomaly scores
-print(predictions.pred_label)     # Binary predictions (0/1)
-print(predictions.pred_mask)      # Segmentation masks (if applicable)
-print(predictions.anomaly_map)    # Normalized anomaly maps
-```
+The resulting precision and recall values are then used to calculate the F1-score for each threshold value ${\theta}_i$ using the following formula:
 
-## Integration with Models
+$$
+F1_i = 2 \times \frac{Precision(\theta_i) × Recall(\theta_i)}{Precision(\theta_i) + Recall(\theta_i)}
+$$
 
-The post-processor is automatically integrated into Anomalib models:
+Finally, the optimal threshold value $\theta^*$ is determined as the threshold value that yields the highest the F1-score:
 
-```python
-from anomalib.models import Patchcore
-from anomalib.post_processing import OneClassPostProcessor
+$$
+\theta^* = \text{arg}\max_{i} F1_{i}
+$$
 
-# Model creates default post-processor (OneClassPostProcessor)
-model = Patchcore()
+During testing and predicting, the post-processor computes the binary classification labels by assigning a positive label (anomalous) to all anomaly scores that are higher than the threshold, and a negative label (normal) to all anomaly scores below the threshold. Given an anomaly score $s_{\text{test},i}$, the binary classifical label $\hat{y}_{\text{test},i}$ is given by:
 
-# Or specify custom post-processor
-model = Patchcore(
-    post_processor=OneClassPostProcessor(
-        image_sensitivity=0.5,
-        pixel_sensitivity=0.5
-    )
-)
-```
+$$
+\hat{y}_{\text{test},i} =
+\begin{cases}
+1 & \text{if } s_{\text{test},i} \geq \theta^* \\
+0 & \text{if } s_{\text{test},i} < \theta^*
+\end{cases}
+$$
 
-## Creating Custom Post-processors
+**Normalization**
 
-To create a custom post-processor, inherit from the abstract base class `PostProcessor`:
+During the validation sequence, the post-processor iterates over the raw anomaly score predictions for the validation set, $\mathbf{s}_{\text{val}}$, and keeps track of the lowest and highest observed values, $\min\mathbf{s}_{\text{val}}$ and $\max \mathbf{s}_{\text{val}}$.
 
-```python
-from anomalib.post_processing import PostProcessor
-from anomalib.data import InferenceBatch
-import torch
-
-class CustomPostProcessor(PostProcessor):
-    """Custom post-processor implementation."""
-
-    def forward(self, predictions: InferenceBatch) -> InferenceBatch:
-        """Post-process predictions.
+During testing and predicting, the post-processor uses the stored min and max values, together with the optimal threshold value, to normalize the values to the [0, 1] range. For a raw anomaly score $s_{\text{test},i}$, the normalized score $\tilde{s}_{\text{test},i}$ is given by:
 
-        This method must be implemented by all subclasses.
-        """
-        # Implement your post-processing logic here
-        raise NotImplementedError
-```
+$$
+\tilde{s}_{\text{test},i} = \frac{s_{\text{test},i} - \theta^*}{\max\mathbf{s}_\text{val} - \min\mathbf{s}_\text{val}} + 0.5
+$$
 
-### Example: One-Class Post-processor
+As a last step, the normalized scores are capped between 0 and 1.
 
-Here's a simplified version of how `OneClassPostProcessor` is implemented:
-
-```python
-from anomalib.post_processing import PostProcessor
-from anomalib.data import InferenceBatch
-from anomalib.metrics import F1AdaptiveThreshold, MinMax
+The $\theta^*$ term in the formula above ensures that the normalized values are centered around the threshold value, such that a value of 0.5 in the normalized domain corresponds to the value of the threshold in the un-normalized domain. This helps with interpretability of the results, as it asserts that normalized values of 0.5 and higher are labeled anomalous, while values below 0.5 are labeled normal.
 
-class CustomOneClassPostProcessor(PostProcessor):
-    """Custom one-class post-processor."""
+Centering the threshold value around 0.5 has the additional advantage that it allows us to add a sensitivity parameter $\alpha$ that changes the sensitivity of the anomaly detector. In the normalized domain, the binary classification label is given by:
 
-    def __init__(
-        self,
-        image_sensitivity: float | None = None,
-        pixel_sensitivity: float | None = None,
-    ):
-        super().__init__()
-        self._image_threshold = F1AdaptiveThreshold()
-        self._pixel_threshold = F1AdaptiveThreshold()
-        self._image_normalization = MinMax()
-        self._pixel_normalization = MinMax()
+$$
+\hat{y}_{\text{test},i} =
+\begin{cases}
+1 & \text{if } \tilde{s}_{\text{test},i} \geq 1 - \alpha \\
+0 & \text{if } \tilde{s}_{\text{test},i} < 1 - \alpha
+\end{cases}
+$$
 
-        self.image_sensitivity = image_sensitivity
-        self.pixel_sensitivity = pixel_sensitivity
+Where $\alpha$ is a sensitivity parameter that can be varied between 0 and 1, such that a higher sensitivity value lowers the effective anomaly score threshold. The sensitivity parameter can be tuned depending on the use case. For example, use-cases in which false positives should be avoided may benefit from reducing the sensitivity.
 
-    def forward(self, predictions: InferenceBatch) -> InferenceBatch:
-        """Post-process predictions."""
-        if predictions.pred_score is None and predictions.anomaly_map is None:
-            raise ValueError("At least one of pred_score or anomaly_map must be provided")
-
-        # Normalize scores
-        if predictions.pred_score is not None:
-            predictions.pred_score = self._normalize(
-                predictions.pred_score,
-                self._image_normalization.min,
-                self._image_normalization.max
-            )
-            predictions.pred_label = predictions.pred_score > self._image_threshold.value
-
-        # Normalize anomaly maps
-        if predictions.anomaly_map is not None:
-            predictions.anomaly_map = self._normalize(
-                predictions.anomaly_map,
-                self._pixel_normalization.min,
-                self._pixel_normalization.max
-            )
-            predictions.pred_mask = predictions.anomaly_map > self._pixel_threshold.value
-
-        return predictions
+```{note}
+Normalization and thresholding only works when your datamodule contains a validation set, preferably cosisting of both normal and anomalous samples. When your validation set only contains normal samples, the threshold will be set to the value of the highest observed anomaly score in your validation set.
 ```
 
-## Best Practices
-
-1. **Score Normalization**:
+## Basic Usage
 
-   - Normalize scores to [0,1] range
-   - Handle numerical stability
-   - Consider score distributions
+To use the `OneClassPostProcessor`, simply add it to any Anomalib model when creating the model:
 
-2. **Threshold Selection**:
+```python
+from anomalib.models import Padim
+from anomalib.post_processing import OneClassPostProcessor
 
-   - Use adaptive thresholding when possible
-   - Validate thresholds on validation set
-   - Consider application requirements
+post_processor = OneClassPostProcessor()
+model = Padim(post_processor=post_processor)
+```
 
-3. **Performance**:
+The post-processor can be configured using its constructor arguments. In the case of the `OneClassPostProcessor`, the only configuration parameters are the sensitivity for the thresholding operation on the image- and pixel-level:
 
-   - Optimize computations for large outputs
-   - Handle GPU/CPU transitions efficiently
-   - Cache computed thresholds
+```python
+post_processor = OneClassPostProcessor(
+    image_sensitivity=0.4,
+    pixel_sensitivity=0.4,
+)
+model = Padim(post_processor=post_processor)
+```
 
-4. **Validation**:
-   - Verify prediction shapes
-   - Check threshold computation
-   - Test edge cases
+When a post-processor instance is not passed explicitly to the model, the model will automatically configure a default post-processor instance. Let's confirm this by creating a Padim model and printing the `post_processor` attribute:
 
-## Common Pitfalls
+```python
+model = Padim()
+print(model.post_processor)
+# OneClassPostProcessor(
+#   (_image_threshold): F1AdaptiveThreshold() (value=0.50)
+#   (_pixel_threshold): F1AdaptiveThreshold() (value=0.50)
+#   (_image_normalization_stats): MinMax()
+#   (_pixel_normalization_stats): MinMax()
+# )
+```
 
-1. **Threshold Issues**:
+Each model implementation in Anomalib is required to implement the `configure_post_processor` method, which defines the default post-processor for that model. We can use this method to quickly inspect the default post-processing behaviour of an Anomalib model class:
 
-   - Not computing thresholds during training
-   - Incorrect threshold computation
-   - Not handling score distributions
+```python
+print(Padim.configure_post_processor())
+```
 
-2. **Normalization Problems**:
+In some cases it may be desirable to disable post-processing entirely. This is done by passing `False` to the model's `post_processor` argument:
 
-   - Inconsistent normalization
-   - Numerical instability
-   - Not handling outliers
+```python
+from anomalib.models import Padim
 
-3. **Memory Issues**:
-   - Large intermediate tensors
-   - Unnecessary CPU-GPU transfers
-   - Memory leaks in custom implementations
+model = Padim(post_processor=False)
+print(model.post_processor is None)  # True
+```
 
-## Edge Deployment
+### Exporting
 
 One key advantage of Anomalib's post-processor design is that it becomes part of the model graph during export. This means:
 
@@ -230,47 +182,45 @@ pred_labels = results[..., 2]      # Already thresholded (0/1)
 pred_masks = results[..., 3]       # Already thresholded masks (if applicable)
 ```
 
-### Benefits for Edge Deployment
+## Creating Custom Post-processors
 
-1. **Simplified Deployment**:
+Advanced users may want to define their own post-processing pipeline. This can be useful when the default post-processing behaviour of the `OneClassPostProcessor` is not suitable for the model and its predictions. To create a custom post-processor, inherit from the abstract base class `PostProcessor`:
 
-   ```python
-   # Before: Manual post-processing needed
-   core = Core()
-   model = core.read_model("model_without_postprocessing.xml")
-   compiled_model = core.compile_model(model)
-   raw_outputs = compiled_model([image])[output_key]
-   normalized = normalize_scores(raw_outputs)
-   predictions = apply_threshold(normalized)
+```python
+from anomalib.post_processing import PostProcessor
+from anomalib.data import InferenceBatch
+import torch
+
+class CustomPostProcessor(PostProcessor):
+    """Custom post-processor implementation."""
+
+    def forward(self, predictions: InferenceBatch) -> InferenceBatch:
+        """Post-process predictions.
 
-   # After: Everything included in OpenVINO model
-   core = Core()
-   model = core.read_model("model.xml")
-   compiled_model = core.compile_model(model)
-   results = compiled_model([image])[output_key]  # Ready to use!
-   ```
+        This method must be implemented by all subclasses.
+        """
+        # Implement your post-processing logic here
+        raise NotImplementedError
+```
 
-2. **Consistent Results**:
+After defining the class, it can be used in any Anomalib workflow by passing it to the model:
 
-   - Same normalization across environments
-   - Same thresholds as training
-   - No implementation differences
+```python
+from anomalib.models import Padim
+
+post_processor = CustomPostProcessor()
+model = Padim(post_processor=post_processor)
+```
 
-3. **Optimized Performance**:
+## Best Practices
 
-   - Post-processing operations are optimized by OpenVINO
-   - Hardware acceleration for all operations
-   - Reduced memory overhead
-   - Fewer host-device transfers
+**Validation**:
 
-4. **Reduced Deployment Complexity**:
-   - No need to port post-processing code
-   - Single model file contains everything
-   - Simpler deployment pipeline
-   - Ready for edge devices (CPU, GPU, VPU)
+- Ensure that your validation set contains both normal and anomalous samples.
+- Ensure that your validation set contains sufficient representative samples.
 
 ```{seealso}
 For more information:
+- {doc}`PreProcessing guide <./pre_processing>`
 - {doc}`AnomalibModule Documentation <../../reference/models/base>`
-- {doc}`Metrics Guide <../metrics/index>`
 ```

Original file line number	Diff line number	Diff line change
`@@ -50,6 +50,8 @@`
`50`	`50`	`"tasklist",`
`51`	`51`	`"deflist",`
`52`	`52`	`"fieldlist",`
	`53`	`+ "amsmath",`
	`54`	`+ "dollarmath",`
`53`	`55`	`]`
`54`	`56`
`55`	`57`	`# Add separate setting for eval-rst`