Composition Strategies Deep Dive¤

Metadata	Value
Level	Intermediate
Runtime	~30 min
Prerequisites	Operators Tutorial, Pipeline Tutorial
Format	Python + Jupyter

Overview¤

Master the 11 composition strategies in Datarax for combining operators. This tutorial covers sequential chaining, parallel application, ensemble reductions, and dynamic branching - all with JAX vmap/JIT compatibility.

Learning Goals¤

By the end of this tutorial, you will be able to:

Chain operators with Sequential strategies (basic, conditional, dynamic)
Apply operators in parallel with different merge modes
Use weighted combinations for learnable augmentation
Build ensemble reductions (mean, sum, max, min)
Route data through branches based on conditions
Write vmap/JIT-compatible composition patterns

Coming from PyTorch?¤

PyTorch	Datarax
`transforms.Compose([t1, t2])`	`CompositeOperatorModule(..., strategy=SEQUENTIAL)`
`transforms.RandomChoice([t1, t2])`	`CompositeOperatorModule(..., strategy=BRANCHING)`
`transforms.RandomApply([t], p=0.5)`	`CompositeOperatorModule(..., strategy=CONDITIONAL_SEQUENTIAL)`
Manual weighted ensemble	`CompositeOperatorModule(..., strategy=WEIGHTED_PARALLEL)`

Coming from TensorFlow?¤

TensorFlow	Datarax
`tf.keras.Sequential([l1, l2])`	`CompositeOperatorModule(..., strategy=SEQUENTIAL)`
`tf.keras.layers.Average([o1, o2])`	`CompositeOperatorModule(..., strategy=ENSEMBLE_MEAN)`
Custom conditional logic	`CompositeOperatorModule(..., strategy=CONDITIONAL_*)`

Files¤

Python Script: examples/core/08_composition_strategies_tutorial.py
Jupyter Notebook: examples/core/08_composition_strategies_tutorial.ipynb

Quick Start¤

Run the Python Script¤

python examples/core/08_composition_strategies_tutorial.py

Run the Jupyter Notebook¤

jupyter lab examples/core/08_composition_strategies_tutorial.ipynb

Strategy Overview¤

Datarax provides 11 composition strategies organized into 4 categories:

graph TB
    subgraph Sequential["Sequential Strategies"]
        SEQ["SEQUENTIAL<br/>Chain: op1 - op2 - op3"]
        CSEQ["CONDITIONAL_SEQUENTIAL<br/>Chain with per-op conditions"]
        DSEQ["DYNAMIC_SEQUENTIAL<br/>Runtime-modifiable chain"]
    end

    subgraph Parallel["Parallel Strategies"]
        PAR["PARALLEL<br/>Apply all, merge outputs"]
        WPAR["WEIGHTED_PARALLEL<br/>Apply all with weights"]
        CPAR["CONDITIONAL_PARALLEL<br/>Apply subset, merge"]
    end

    subgraph Ensemble["Ensemble Strategies"]
        EMEAN["ENSEMBLE_MEAN<br/>Parallel + average"]
        ESUM["ENSEMBLE_SUM<br/>Parallel + sum"]
        EMAX["ENSEMBLE_MAX<br/>Parallel + max"]
        EMIN["ENSEMBLE_MIN<br/>Parallel + min"]
    end

    subgraph Routing["Routing Strategies"]
        BRANCH["BRANCHING<br/>Route through paths"]
    end

    style Sequential fill:#e1f5fe
    style Parallel fill:#f3e5f5
    style Ensemble fill:#e8f5e9
    style Routing fill:#fff3e0

Key Concepts¤

Helper Factories¤

The tutorial builds its operators through small factory functions, so each composition example can create fresh operators with fixed parameters:

def make_brightness_op(delta: float, seed: int = 0) -> BrightnessOperator:
    """Create a brightness operator with fixed delta."""
    return BrightnessOperator(
        BrightnessOperatorConfig(
            field_key="image",
            brightness_range=(delta, delta),  # Fixed delta
            stochastic=False,
        ),
        rngs=nnx.Rngs(seed),
    )


def make_contrast_op(factor: float, seed: int = 0) -> ContrastOperator:
    """Create a contrast operator with fixed factor."""
    return ContrastOperator(
        ContrastOperatorConfig(
            field_key="image",
            contrast_range=(factor, factor),  # Fixed factor
            stochastic=False,
        ),
        rngs=nnx.Rngs(seed),
    )


def make_noise_op(std: float, seed: int = 0) -> NoiseOperator:
    """Create a noise operator."""
    return NoiseOperator(
        NoiseOperatorConfig(
            field_key="image",
            mode="gaussian",
            noise_std=std,
            stochastic=True,
            stream_name="noise",
        ),
        rngs=nnx.Rngs(noise=seed),
    )

Part 1: Sequential Strategies¤

Sequential strategies chain operators where output of one becomes input of next.

from datarax.operators.composite_operator import (
    CompositeOperatorConfig,
    CompositeOperatorModule,
    CompositionStrategy,
)

# SEQUENTIAL: Basic chaining
bright_op = make_brightness_op(0.1, seed=1)
contrast_op = make_contrast_op(1.2, seed=2)

sequential_composite = CompositeOperatorModule(
    CompositeOperatorConfig(
        strategy=CompositionStrategy.SEQUENTIAL,
        operators=[bright_op, contrast_op],
    ),
    rngs=nnx.Rngs(0),
)

Terminal Output:

SEQUENTIAL Strategy:
  Chain: Brightness(+0.1) → Contrast(×1.2)
  Input range: [0.0, 1.0]
  Output range: [0.000, 1.440]

Part 2: Parallel Strategies¤

Apply ALL operators to the SAME input, then merge outputs.

Merge Mode	Description	Output Shape
`"concat"`	Concatenate along axis	`(N, H, W, C×num_ops)`
`"stack"`	Stack into new dimension	`(num_ops, N, H, W, C)`
`"sum"`	Element-wise sum	Same as input
`"mean"`	Element-wise mean	Same as input
`"dict"`	Keep separate in dict	`{op_0: ..., op_1: ...}`

# PARALLEL with mean merge
op_bright = make_brightness_op(0.15, seed=10)
op_contrast = make_contrast_op(1.3, seed=11)
op_noise = make_noise_op(0.05, seed=12)

parallel_mean = CompositeOperatorModule(
    CompositeOperatorConfig(
        strategy=CompositionStrategy.PARALLEL,
        operators=[op_bright, op_contrast, op_noise],
        merge_strategy="mean",
    ),
    rngs=nnx.Rngs(0),
)

Terminal Output:

PARALLEL Strategy (merge='mean'):
  Operators: [Brightness, Contrast, Noise]
  Output shape: (16, 32, 32, 3) (same as input)
  Output is the mean of all three augmented versions

Part 3: Weighted Parallel¤

Apply operators in parallel with learnable or fixed weights.

op1 = make_brightness_op(0.2, seed=40)
op2 = make_contrast_op(1.4, seed=41)
op3 = make_noise_op(0.03, seed=42)

weighted_parallel = CompositeOperatorModule(
    CompositeOperatorConfig(
        strategy=CompositionStrategy.WEIGHTED_PARALLEL,
        operators=[op1, op2, op3],
        weights=[0.5, 0.3, 0.2],  # 50% brightness, 30% contrast, 20% noise
        learnable_weights=False,  # Set True for gradient-based learning
    ),
    rngs=nnx.Rngs(0),
)

Part 4: Ensemble Strategies¤

Parallel application with mathematical reduction.

Strategy	Reduction	Formula
`ENSEMBLE_MEAN`	Average	`(op₁ + op₂ + ... + opₙ) / n`
`ENSEMBLE_SUM`	Sum	`op₁ + op₂ + ... + opₙ`
`ENSEMBLE_MAX`	Maximum	`max(op₁, op₂, ..., opₙ)`
`ENSEMBLE_MIN`	Minimum	`min(op₁, op₂, ..., opₙ)`

ensemble_ops = [
    make_brightness_op(0.1, seed=50),
    make_brightness_op(-0.1, seed=51),
    make_contrast_op(1.2, seed=52),
]

ensemble_mean = CompositeOperatorModule(
    CompositeOperatorConfig(
        strategy=CompositionStrategy.ENSEMBLE_MEAN,
        operators=ensemble_ops,
    ),
    rngs=nnx.Rngs(0),
)

Part 5: Branching Strategy¤

Route data through different operator branches based on conditions.

def label_router(data):
    """Route based on label: 0-5 → branch 0, 6-9 → branch 1."""
    label = data["label"]
    return jax.lax.cond(label > 5, lambda: 1, lambda: 0)

branch_ops = [
    make_brightness_op(0.2, seed=70),  # Branch 0: for labels 0-5
    make_contrast_op(1.4, seed=71),  # Branch 1: for labels 6-9
]

branching = CompositeOperatorModule(
    CompositeOperatorConfig(
        strategy=CompositionStrategy.BRANCHING,
        operators=branch_ops,
        router=label_router,
        default_branch=0,
    ),
    rngs=nnx.Rngs(0),
)

Terminal Output:

BRANCHING Strategy:
  Router: label <= 5 → Brightness, label > 5 → Contrast
  Batch labels: [3 7 2 8 4 6 1 9]...
  Each sample routed to appropriate augmentation branch

JAX Compatibility Notes¤

The composition strategies are designed for jax.vmap and jax.jit compatibility:

Pattern	Why Needed
Integer routing	`jax.lax.switch` requires int index
`jax.lax.cond` for conditions	Python `if` breaks tracing
Fixed output shapes	vmap requires consistent shapes
No dict key from traced values	Dict keys must be static

Strategy Selection Guide¤

Use Case	Recommended Strategy
Standard augmentation chain	`SEQUENTIAL`
Skip augmentation conditionally	`CONDITIONAL_SEQUENTIAL`
Multi-view generation	`PARALLEL` (merge='dict')
Averaged augmentation	`PARALLEL` (merge='mean') or `ENSEMBLE_MEAN`
Learnable augmentation policy	`WEIGHTED_PARALLEL`
Class-specific augmentation	`BRANCHING`
Test-time augmentation	`ENSEMBLE_MEAN`

Results¤

Running the tutorial produces:

============================================================
Composition Strategies Tutorial
============================================================

1. SEQUENTIAL: Chain operators
   Output shape: (16, 32, 32, 3)

2. ENSEMBLE_MEAN: Average augmentations
   Output range: [0.000, 1.100]

3. BRANCHING: Route by label
   Labels: [3 7 2 8 4]... → routed to different branches

============================================================
Tutorial completed successfully!
============================================================

Next Steps¤

DAG Fundamentals - Pipeline architecture
Sharding Guide - Distributed pipelines
Performance Guide - Optimization tips

API Reference¤

CompositeOperatorModule - Full API documentation
CompositionStrategy - Strategy enum