evaluation_adapter
orchard.tasks.classification.evaluation_adapter
¶
Classification Evaluation Pipeline Adapter.
Wraps :func:orchard.evaluation.evaluation_pipeline.run_final_evaluation
to satisfy :class:~orchard.core.task_protocols.TaskEvalPipeline.
ClassificationEvalPipelineAdapter
¶
Orchestrates classification inference, visualization, and reporting.
run_evaluation(model, test_loader, train_losses, val_metrics_history, class_names, paths, training, dataset, augmentation, evaluation, arch_name, aug_info='N/A', tracker=None)
¶
Delegate to the existing final evaluation pipeline.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
model
|
Module
|
Trained model (already on target device). |
required |
test_loader
|
DataLoader[Any]
|
DataLoader for test set. |
required |
train_losses
|
list[float]
|
Training loss history per epoch. |
required |
val_metrics_history
|
list[Mapping[str, float]]
|
Validation metrics history per epoch. |
required |
class_names
|
list[str]
|
List of class label strings. |
required |
paths
|
RunPaths
|
RunPaths for artifact output. |
required |
training
|
TrainingConfig
|
Training sub-config. |
required |
dataset
|
DatasetConfig
|
Dataset sub-config. |
required |
augmentation
|
AugmentationConfig
|
Augmentation sub-config. |
required |
evaluation
|
EvaluationConfig
|
Evaluation sub-config. |
required |
arch_name
|
str
|
Architecture identifier. |
required |
aug_info
|
str
|
Augmentation description string. |
'N/A'
|
tracker
|
TrackerProtocol | None
|
Optional experiment tracker for final metrics. |
None
|
Returns:
| Type | Description |
|---|---|
tuple[float, float, float]
|
3-tuple of (macro_f1, test_acc, test_auc). |