Usage Guide

This page covers device selection, performance tuning, and advanced configuration options.

Spectral Channels

OmniCloudMask expects a 3-band input array in the order Red, Green, NIR with shape (3, height, width).

What if I don’t have a NIR band?

If your imagery doesn’t include a near-infrared (NIR) band, you can still use OmniCloudMask — just pass an array of zeros in place of the NIR channel. The model was trained to be robust to this and will still produce good cloud mask predictions using only the Red and Green bands.

import numpy as np
from omnicloudmask import predict_from_array

# red and green are 2D arrays of shape (height, width)
nir_placeholder = np.zeros_like(red)

input_array = np.stack([red, green, nir_placeholder], axis=0)
mask = predict_from_array(input_array)

The same approach works with predict_from_load_func — just return the zero-filled NIR band from your custom loader:

import numpy as np
import rasterio as rio
from omnicloudmask import predict_from_load_func

def load_rgb_only(input_path):
    with rio.open(input_path) as src:
        red = src.read(1)
        green = src.read(2)
        profile = src.profile
    nir_placeholder = np.zeros_like(red)
    return np.stack([red, green, nir_placeholder], axis=0), profile

pred_paths = predict_from_load_func(scene_paths, load_rgb_only)

Device Selection

OmniCloudMask automatically selects the best available device (CUDA > MPS > CPU). See the Performance page for detailed benchmarks across hardware. Override this with the inference_device parameter:

from omnicloudmask import predict_from_array

# Force CPU
mask = predict_from_array(input_array, inference_device="cpu")

# Force CUDA
mask = predict_from_array(input_array, inference_device="cuda")

# Force Apple Silicon MPS
mask = predict_from_array(input_array, inference_device="mps")

GPU Performance Optimization

For NVIDIA GPUs, increase batch size and enable reduced precision (see Performance for the impact of these settings):

from omnicloudmask import predict_from_load_func, load_s2

pred_paths = predict_from_load_func(
    scene_paths=scene_paths,
    load_func=load_s2,
    inference_dtype="bf16",
    batch_size=4,            # increase for GPUs with more VRAM
    compile_models=True,     # faster inference for 10+ scenes
)

Enable compile_models=True when processing many scenes. This adds startup overhead but reduces per-scene inference time by 10-20%.

Downscale for Higher Throughput

Since OmniCloudMask works with variable resolution imagery (10-50 m, and down to 5 m with recent models), you can downscale for higher throughput at the cost of some accuracy. Within the validated 10-50 m range, finer imagery produces better results for the same ground extent (see Spatial Context for benchmarks); below about 10 m there is little further gain. For guidance on which resolution to run at — including why running finer than 10 m rarely helps — see Choosing a Resolution. For example, to process Sentinel-2 at 20 m instead of 10 m:

from functools import partial
from omnicloudmask import predict_from_load_func, load_s2

loader = partial(load_s2, resolution=20)

pred_paths = predict_from_load_func(scene_paths, loader)

This reduces the number of pixels by 4x, significantly improving processing speed.

Low VRAM Configuration

If you run out of GPU memory, offload mosaicking to CPU:

pred_paths = predict_from_load_func(
    scene_paths=scene_paths,
    load_func=load_s2,
    inference_dtype="bf16",
    batch_size=1,
    mosaic_device="cpu",  # offload patch mosaicking to CPU
)

CPU Inference

For CPU-only systems, use full precision:

import torch

# Optional: set thread count for better CPU performance
torch.set_num_threads(4)

pred_paths = predict_from_load_func(
    scene_paths=scene_paths,
    load_func=load_s2,
    inference_dtype="fp32",   # important: avoid fp16/bf16 on CPU
    batch_size=1,
    inference_device="cpu",
    mosaic_device="cpu",
)

Confidence Maps

Export probability scores instead of class predictions:

mask = predict_from_array(
    input_array,
    export_confidence=True,
    softmax_output=True,  # normalize to probabilities
)
# Output shape: (4, height, width) - one channel per class

Output Directory

Save predictions to a specific directory:

pred_paths = predict_from_load_func(
    scene_paths=scene_paths,
    load_func=load_s2,
    output_dir="/path/to/output",
)

Skip Existing Predictions

Avoid re-processing scenes that already have predictions:

pred_paths = predict_from_load_func(
    scene_paths=scene_paths,
    load_func=load_s2,
    overwrite=False,  # skip if output file exists
)

Custom Data Loaders

Create a custom loader for other sensors. The loader must return a numpy array of shape (3, height, width) containing Red, Green, and NIR bands, plus a rasterio profile:

import numpy as np
import rasterio as rio
from omnicloudmask import predict_from_load_func

def my_loader(input_path):
    with rio.open(input_path) as src:
        # Read bands in order: Red, Green, NIR
        data = src.read([4, 3, 5])  # adjust band indices for your sensor
        profile = src.profile
    return data, profile

pred_paths = predict_from_load_func(scene_paths, my_loader)

Multiband GeoTIFF Loader

For generic multiband GeoTIFFs, use the built-in load_multiband:

from functools import partial
from omnicloudmask import predict_from_load_func, load_multiband

# Specify band order: [Red, Green, NIR] (1-indexed)
loader = partial(load_multiband, band_order=[4, 3, 5])

pred_paths = predict_from_load_func(scene_paths, loader)

Model Versions

Use older model versions if needed:

mask = predict_from_array(input_array, model_version=3.0)

Available versions: 1.0, 2.0, 3.0, 4.0 (default is latest). Versions 1-3 require the legacy extra.

Custom Models

Use your own trained models:

import torch

my_model = torch.load("my_model.pt")

mask = predict_from_array(
    input_array,
    custom_models=my_model,
    pred_classes=4,  # number of output classes
)