Training and Evaluation pipeline by Sharkyii · Pull Request #105 · openclimatefix/open-data-pvnet

Sharkyii · 2025-12-18T13:45:39Z

Description

This PR introduces a stable, end-to-end training and evaluation pipeline for PVNet, currently validated on GSP-only data.

To simplify debugging and ensure pipeline stability during initial integration, the optimizer was intentionally switched from EmbAdamWReduceLROnPlateau to a plain AdamW. This change reduces training complexity while validating data flow, model wiring, and configuration handling.
The original optimizer and learning-rate scheduling will be reintroduced in a follow-up PR once the full multi-encoder setup is finalized.

At this stage:

The pipeline is fully functional for GSP data
Training and evaluation pipelines run end-to-end without errors
Configuration and Hydra overrides are verified and stable

Planned follow-ups:

Extend support to NWP and satellite data
Properly integrate and validate GSP + NWP multi-encoder setup
Restore EmbAdamWReduceLROnPlateau once architecture and inputs are finalized

Two reference documents have been added to explain the pipeline design and usage for future contributors.

Fixes #7

How Has This Been Tested?

End-to-end training and evaluation pipelines executed successfully
Verified data loading, batching, model forward pass, logging, and checkpointing
Tested using GSP-only configuration
Metrics logged correctly via W&B (offline mode)
Sanity checks performed on loss and MAE trends

If your changes affect data processing, have you plotted any changes?

Yes (basic sanity checks on metrics and loss behaviour)

Checklist

My code follows [OCF's coding style guidelines](https://github.com/openclimatefix/.github/blob/main/coding_style.md)
I have performed a self-review of my own code
I have made corresponding changes to the documentation
I have added tests
I have checked my code and corrected any misspellings

peterdudfield · 2025-12-19T07:54:22Z

    "zarr==2.18.3",
    "pvnet==4.1.19",
-    "ocf-data-sampler==0.2.32",
+    "ocf-data-sampler==0.2.10",


how come you had to move this down?

peterdudfield · 2025-12-19T07:55:04Z

-  #   number_of_conv3d_layers: 6
-  #   conv3d_channels: 32
-  #   image_size_pixels: 24
+  ukv:


how did you use ukv? I thought only gfs was accessible?

peterdudfield · 2025-12-19T07:56:08Z

+        "cpu", "--device", help="Device to run evaluation on ('cpu' or 'cuda')"
+    ),
+    quantiles: str = typer.Option(
+        "0.02,0.1,0.25,0.5,0.75,0.9,0.98",


Could you get this from the config?

peterdudfield · 2025-12-19T07:58:04Z

@@ -0,0 +1,183 @@
+import logging


does https://github.com/openclimatefix/open-data-pvnet/blob/main/run.py not already do this?

Sharkyii added 4 commits December 18, 2025 18:28

Added training and evaluation pipeline

5af2a41

Added training and evaluation pipeline

9aac6f8

Added training and evaluation pipeline

d62f4b6

restored notebooks

33f2909

Sharkyii mentioned this pull request Dec 18, 2025

train model + evaluate model #7

Open

peterdudfield reviewed Dec 19, 2025

View reviewed changes

Sharkyii closed this Dec 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Training and Evaluation pipeline#105

Training and Evaluation pipeline#105
Sharkyii wants to merge 4 commits intoopenclimatefix:mainfrom
Sharkyii:feature/training-eval-pipeline

Sharkyii commented Dec 18, 2025

Uh oh!

peterdudfield Dec 19, 2025

Uh oh!

peterdudfield Dec 19, 2025

Uh oh!

peterdudfield Dec 19, 2025

Uh oh!

peterdudfield Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Sharkyii commented Dec 18, 2025

Description

How Has This Been Tested?

Checklist

Uh oh!

peterdudfield Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

peterdudfield Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

peterdudfield Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

peterdudfield Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants