training_executor

`orchard.optimization.objective.training_executor` ¶

Training execution utilities for Optuna trials.

Provides TrialTrainingExecutor, which orchestrates the training and validation loop for a single Optuna trial with built-in pruning, metric tracking, and scheduler management. Per-epoch training is delegated to _loop.TrainingLoop (shared with ModelTrainer), while validation remains local with error-resilient fallback metrics.

Key responsibilities:

Execute epoch-level training/validation cycles
Apply Optuna pruning logic with warmup period
Track and report metrics to Optuna
Handle scheduler stepping (plateau-aware)
Provide error-resilient validation with fallback metrics

.. todo:: Unify TrialTrainingExecutor and ModelTrainer into a single engine with pluggable epoch-end callbacks (early stopping, checkpointing, Optuna pruning). Both already share the full training kernel (TrainingLoop, validate_epoch, step_scheduler, AMP scaler, Mixup); the only divergence is the epoch-level loop and post-validation actions.

`TrialTrainingExecutor(model, train_loader, val_loader, optimizer, scheduler, criterion, training, optuna, log_interval, device, metric_extractor)` ¶

Executes training loop with Optuna pruning integration.

Orchestrates a complete training cycle for a single Optuna trial, including:

Training and validation epochs
Metric extraction and tracking
Pruning decisions with warmup period
Learning rate scheduling
Progress logging

Pruning and warmup parameters are read from the optuna sub-config; training hyperparameters from training.

Attributes:

Name	Type	Description
`model`		PyTorch model to train.
`train_loader`		Training data loader.
`val_loader`		Validation data loader.
`optimizer`		Optimizer instance.
`scheduler`		Learning rate scheduler.
`criterion`		Loss function.
`device`		Training device (CPU/CUDA/MPS).
`metric_extractor`		Handles metric extraction and best-value tracking.
`enable_pruning`		Whether to enable trial pruning.
`warmup_epochs`		Epochs before pruning activates.
`monitor_metric`		Name of the metric driving scheduling.
`scaler`	`GradScaler \| None`	AMP gradient scaler (None when use_amp is False).
`mixup_fn`	`callable \| None`	Mixup augmentation function (None when alpha is 0).
`epochs`		Total training epochs.
`log_interval`		Epoch interval for progress logging.
`_loop`	`TrainingLoop`	Shared epoch kernel for training steps (train only, no validation).

Example

executor = TrialTrainingExecutor( ... model=model, ... train_loader=train_loader, ... val_loader=val_loader, ... optimizer=optimizer, ... scheduler=scheduler, ... criterion=criterion, ... training=trial_cfg.training, ... optuna=trial_cfg.optuna, ... log_interval=trial_cfg.telemetry.log_interval, ... device=device, ... metric_extractor=MetricExtractor("auc"), ... ) best_metric = executor.execute(trial)

Initialize training executor.

Parameters:

Name	Type	Description	Default
`model`	`Module`	PyTorch model to train.	required
`train_loader`	`DataLoader[Any]`	Training data loader.	required
`val_loader`	`DataLoader[Any]`	Validation data loader.	required
`optimizer`	`Optimizer`	Optimizer instance.	required
`scheduler`	`LRScheduler`	Learning rate scheduler.	required
`criterion`	`Module`	Loss function.	required
`training`	`TrainingConfig`	Training hyperparameters sub-config.	required
`optuna`	`OptunaConfig`	Optuna pruning/warmup sub-config.	required
`log_interval`	`int`	Epoch interval for progress logging.	required
`device`	`device`	Training device.	required
`metric_extractor`	`MetricExtractor`	Metric extraction and tracking handler.	required

Source code in orchard/optimization/objective/training_executor.py

def __init__(
    self,
    model: torch.nn.Module,
    train_loader: torch.utils.data.DataLoader[Any],
    val_loader: torch.utils.data.DataLoader[Any],
    optimizer: torch.optim.Optimizer,
    scheduler: torch.optim.lr_scheduler.LRScheduler,
    criterion: torch.nn.Module,
    training: TrainingConfig,
    optuna: OptunaConfig,
    log_interval: int,
    device: torch.device,
    metric_extractor: MetricExtractor,
) -> None:
    """
    Initialize training executor.

    Args:
        model: PyTorch model to train.
        train_loader: Training data loader.
        val_loader: Validation data loader.
        optimizer: Optimizer instance.
        scheduler: Learning rate scheduler.
        criterion: Loss function.
        training: Training hyperparameters sub-config.
        optuna: Optuna pruning/warmup sub-config.
        log_interval: Epoch interval for progress logging.
        device: Training device.
        metric_extractor: Metric extraction and tracking handler.
    """
    self.model = model
    self.train_loader = train_loader
    self.val_loader = val_loader
    self.optimizer = optimizer
    self.scheduler = scheduler
    self.criterion = criterion
    self.device = device
    self.metric_extractor = metric_extractor

    # Pruning config
    self.enable_pruning = optuna.enable_pruning
    self.warmup_epochs = optuna.pruning_warmup_epochs

    # Training state
    self.scaler = create_amp_scaler(training, device=str(device))
    self.mixup_fn = create_mixup_fn(training)
    self.epochs = training.epochs
    self.monitor_metric = training.monitor_metric
    self.log_interval = log_interval
    self._consecutive_val_failures: int = 0

    # Shared epoch kernel (train step only — validation is error-resilient here)
    self._loop = TrainingLoop(
        model=model,
        train_loader=train_loader,
        val_loader=val_loader,
        optimizer=optimizer,
        scheduler=scheduler,
        criterion=criterion,
        device=device,
        scaler=self.scaler,
        mixup_fn=self.mixup_fn,
        options=LoopOptions(
            grad_clip=training.grad_clip,
            total_epochs=self.epochs,
            mixup_epochs=training.mixup_epochs,
            use_tqdm=False,
            monitor_metric=self.monitor_metric,
        ),
    )

`execute(trial)` ¶

Execute full training loop with pruning.

Runs training for cfg.training.epochs, reporting metrics to Optuna after each epoch. Applies pruning logic after warmup period.

Parameters:

Name	Type	Description	Default
`trial`	`Trial`	Optuna trial for reporting and pruning	required

Returns:

Type	Description
`float`	Best validation metric achieved during training

Raises:

Type	Description
`TrialPruned`	If trial should terminate early

Source code in orchard/optimization/objective/training_executor.py

def execute(self, trial: optuna.Trial) -> float:
    """
    Execute full training loop with pruning.

    Runs training for cfg.training.epochs, reporting metrics to Optuna
    after each epoch. Applies pruning logic after warmup period.

    Args:
        trial: Optuna trial for reporting and pruning

    Returns:
        Best validation metric achieved during training

    Raises:
        optuna.TrialPruned: If trial should terminate early
    """
    for epoch in range(1, self.epochs + 1):
        # Train (delegated to shared loop)
        epoch_loss = self._loop.run_train_step(epoch)

        # Validate
        val_metrics = self._validate_epoch()

        # Extract and track metric
        current_metric = self.metric_extractor.extract(val_metrics)
        best_metric = self.metric_extractor.update_best(current_metric)

        # Report to Optuna (skip NaN to avoid poisoning the pruner)
        if not math.isnan(current_metric):
            trial.report(current_metric, epoch)

        # Check pruning
        if self._should_prune(trial, epoch):
            logger.info(
                "%s%s Trial %d pruned at epoch %d (%s=%.4f)",
                LogStyle.INDENT,
                LogStyle.ARROW,
                trial.number,
                epoch,
                self.metric_extractor.metric_name,
                current_metric,
            )
            raise optuna.TrialPruned()

        # Scheduler step (uses monitor_metric, consistent with ModelTrainer)
        step_scheduler(self.scheduler, val_metrics[self.monitor_metric])

        # Logging
        if epoch % self.log_interval == 0 or epoch == self.epochs:
            logger.info(
                "%sT%d E%d/%d | Loss:%.4f | %s:%.4f (Best:%.4f)",
                LogStyle.DOUBLE_INDENT,
                trial.number,
                epoch,
                self.epochs,
                epoch_loss,
                self.metric_extractor.metric_name,
                current_metric,
                best_metric,
            )

    self._log_trial_complete(trial, best_metric, epoch_loss)
    return best_metric

training_executor

orchard.optimization.objective.training_executor ¶

TrialTrainingExecutor(model, train_loader, val_loader, optimizer, scheduler, criterion, training, optuna, log_interval, device, metric_extractor) ¶

execute(trial) ¶

`orchard.optimization.objective.training_executor` ¶

`TrialTrainingExecutor(model, train_loader, val_loader, optimizer, scheduler, criterion, training, optuna, log_interval, device, metric_extractor)` ¶

`execute(trial)` ¶