Improve unit conversion error reporting by PhilMiller · Pull Request #959 · NOAA-OWP/ngen

PhilMiller · 2026-04-28T00:01:59Z

Improve how errors in unit conversion are tracked and reported. Each particular mis-match between distinct models and their variables will be logged exactly once. The reports have comprehensive detail allowing for straightforward correction of the underlying modules or configuration.

Additions

Define and use a unit_conversion_exception type ("UCE") when UDUNITS fails to apply a conversion, storing the provider and variable details
Catch UCEs at the requester call site, and log the corresponding requester details along with those of the provider
Keep a set of the already-reported errors so that they don't get repeated

Changes

Report UCEs from BMI modules, CSV forcings, and NetCDF forcings
Tests and test models have units corrected to match up
Separate model update() from the existing get_response(), which is now specifically used as getting the effective depth of surface runoff
Normalization of how units of dimensionless quantities are spelled
Explicitly test CSV forcing reader's handling of when it can't parse units from the field headers
Lots of preprocessor include cleanups resulting from refactoring

Testing

All integrated tests pass, include some that have been revised and expanded

Notes

This has been cherry-picked from the NGWPC codebase with subsequent conflict fixups, and then substantial revisions to address reviewer comments

Checklist

…ce data_access to support CSV and NetCDF reporting

…than printing locally

…used in Bmi_Multi_Formulation testing

…rather than printing locally

… test

…ic units in tests, and disable now-throwing non-conversion cases

…eporting

hellkite500

While I support the general idea behind this, I have some design and implementation concerns that may require a bit of dialogue and/or refactoring to get through.

PhilMiller · 2026-04-29T17:02:10Z

All useful feedback. I'll try to address it shortly.

robertbartel

FWIW, I'm not fully done with review - still trying to make sure I have my head wrapped around how this executes, especially any differences with scalar versus vector values. But I did want to go ahead and make a few initial comments.

The only clear thing at this point that I can say needs changing is that the BMIconventions.md document needs to be updated to reflect these behavior changes, especially related to unitless and none-ish conventions.

A couple other early impressions that aren’t fully formed yet, but might stimulate some useful discussion …

This feels a little brittle. In fairness, that’s probably got a lot to due with the larger design of things around it, although this doesn’t necessarily disqualify the observation from relevance.
Throwing an(other) exception on every variable read with units issues seems like it could get a little computationally expensive. Cheaper than the IO overhead of cerr-ing every time a warning condition is encountered, but that could be turned off. It doesn’t look like this exception-based approach can be.

PhilMiller · 2026-04-29T20:35:50Z

My experience with these changes on the NGWPC side is that they ultimately pushed the overall system toward an overall more robust situation, but there was a lot of reconciliation of these warning messages along the way. There's a follow-on PR coming that also optionally specifies the desired units of output variables named in the realization config, which would avoid modeling misinterpretations and a current inconsistency between mosaiced formulations for different catchments that output the same variables but in different units. Maybe I'll put that up as a PR against this branch, so we can see and evaluate it in context.

PhilMiller · 2026-04-29T20:36:59Z

Regarding the exceptions concern, the actual exception throw is generally not expected to be much more expensive than a return. We are generating a bunch of strings in that path, though, which I can see being more of a concern. I think that can mostly be squeezed out, which I'll try to do.

Hydrologic simulation in NGen is driven by Layer calling catchment_formulation->get_response() on each catchment's formulation instance in turn. The get_response() method served two related but distinct roles: 1. Advancing the simulation by one step within a formulation or one element thereof 2. Calculating and returning the hydrologic response from that step Within a Bmi_Multi_Formulation, the latter responsibility was only applicable to modules actually providing the runoff variable, and not others that generated inputs to that module or ancillary output variables. However, get_response() was the means to advance them in time, and their primary output variable had to be queried accordingly. Advancing in time is now split out to an update() member function(). By making that split, get_response() can be updated to consistently return results in the units of "m" (depth) expected by the caller in Layer::update_models(). Without the associated changes, every other module would incur spurious unit conversion errors at every time step, when asked for some arbitrary 'main output variable' via get_response() that may be in other units.

PhilMiller · 2026-04-30T01:11:00Z

I haven't directly addressed the review comments yet, but I did push an additional commit that better applies and motivates these changes. That change could potentially be made on its own, but it kinda comes with the rest of what's here, so that's how I'm presenting it at the moment. If asked, I could propose that in more isolation instead or in advance.

PhilMiller · 2026-04-30T18:51:14Z

On the string passing overheads in the error path, I think I'm going to push back a little bit on trying to change that right away. I agree it could become a burden. However, a 'healthy' simulation run that's not incurring unit mismatches will never incur that burden. For pre-production (benchmarking, calibration, regionalization, retrospective, etc) and operational use cases, where we're most concerned about performance, we should have hammered out those issues before running. For a scientifically meaningful run, I really don't think there's an excuse for accepting unit mismatches. If we need some targeted adapters for (e.g.) precip_rate mixing up mass vs volume, we should build those and push forward.

More broadly, we need to profile and see if/where passing around so many strings actually does cost ngen performance, and start grinding those away.

…nitsHelper

…olidated into UnitsHelper

PhilMiller and others added 25 commits April 27, 2026 23:32

Improve unit conversion error logging

f203ed5

Print model name in unit conversion error message

1872114

Move unit conversion error instrumentation up to Bmi_Formulation

78ef24e

Don't error out every Bmi_Multi_Formulation

123a94c

Move unit conversion error instrumentation up to DataProvider/namespa…

36c80a8

…ce data_access to support CSV and NetCDF reporting

fixup - wrong name in Bmi_Module_Formulation thrown UCE

f13b365

CsvPerFeatureForcingProvider: Throw on unit conversion errors rather …

48b4d6f

…than printing locally

Give more and better structured information on unit conversion errors

160cc2e

Match up units of input and output variables for test_bmi_foo models …

39662bd

…used in Bmi_Multi_Formulation testing

Update Bmi_Cpp_Adapter_Test in correspondence to earlier changes

1b93ea8

NetCDFPerFeatureForcingDataProvider: Throw on unit conversion errors …

dbf5dbd

…rather than printing locally

CsvPerFeatureForcingProvider: add missing space in message

4f763f3

Fixup diff

3e9b2b7

Log about output variables not having any unit conversion applied

5681168

CSV Provider: Correct construction of model name string

62a73a1

Match BMI Fortran Adapter test to changed units for Multi_Formulation…

1ecfdfc

… test

CsvPerFeatureForcingProvider: Log variables and units, request specif…

aa505da

…ic units in tests, and disable now-throwing non-conversion cases

Remove catchment_id from string construction. Not available in method.

d8465d0

NetCDF Provider: Reformat constructor initialization

490476b

NetCDF Provider: Store file path for later use in logging and error r…

a10847e

…eporting

NetCDF Provider: Report file path in unit conversion exception

f9b8e78

Reword output var unit conversion warning

13a7cb4

Reword output var unit conversion warning in Bmi_Multi_Formulation.hpp

6c8ecf7

unit conversion error changes

fe7c170

fixup CSV logging

8fd532f

PhilMiller requested review from aaraney, hellkite500 and robertbartel April 28, 2026 00:01

fixup formulation outputs message - grab Carolyn's too

1ffa31f

PhilMiller force-pushed the PhilMiller/unit-conversion-error-reporting branch from 2bd6aee to 1ffa31f Compare April 28, 2026 00:07

PhilMiller marked this pull request as ready for review April 28, 2026 00:07

Add newline to logged unit conversion failure message

634324c

hellkite500 reviewed Apr 29, 2026

View reviewed changes

robertbartel requested changes Apr 29, 2026

View reviewed changes

Rename provider_bmi_var_name to provider_var_name per review comments

eaef52b

PhilMiller added 5 commits May 4, 2026 07:30

Move unit_conversion_exception down to UnitsHelper

e1d69fa

Partially consolidate unit-conversion error handling

c7d0c5b

Refactor unit normalization and short-circuiting

dc87c7b

Move unit conversion code out of header, and reduce header dependencies

54439fe

Add missing include that was uncovered by earlier changes

48d5ce0

PhilMiller force-pushed the PhilMiller/unit-conversion-error-reporting branch from 687c078 to a97c44a Compare May 4, 2026 17:04

Add another missing include that was uncovered by earlier changes

c55c433

PhilMiller force-pushed the PhilMiller/unit-conversion-error-reporting branch from a97c44a to c55c433 Compare May 4, 2026 17:08

PhilMiller added 9 commits May 4, 2026 10:26

Delete duplicative unit normalization that's been consolidated into U…

b2d86b6

…nitsHelper

Clean back out added blank lines

175ac33

Delete another bit of duplicative unit normalization that's been cons…

10043bb

…olidated into UnitsHelper

A couple more header cleanups

147fff6

Delete stray added blank lines

3208f7d

Break up CSV unit header parsing test

414f473

Improve some const safety

ca4883b

Shift output line generation from get_var_value_as_double to get_value

467ccb3

Stop exposing Bmi_Formulation::get_var_value_as_double publicly

6463138

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve unit conversion error reporting#959

Improve unit conversion error reporting#959
PhilMiller wants to merge 44 commits into
NOAA-OWP:masterfrom
PhilMiller:PhilMiller/unit-conversion-error-reporting

PhilMiller commented Apr 28, 2026 •

edited

Loading

Uh oh!

hellkite500 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

PhilMiller commented Apr 29, 2026

Uh oh!

robertbartel left a comment

Uh oh!

PhilMiller commented Apr 29, 2026

Uh oh!

PhilMiller commented Apr 29, 2026

Uh oh!

PhilMiller commented Apr 30, 2026

Uh oh!

PhilMiller commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

PhilMiller commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Additions

Changes

Testing

Notes

Checklist

Uh oh!

hellkite500 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

PhilMiller commented Apr 29, 2026

Uh oh!

robertbartel left a comment

Choose a reason for hiding this comment

Uh oh!

PhilMiller commented Apr 29, 2026

Uh oh!

PhilMiller commented Apr 29, 2026

Uh oh!

PhilMiller commented Apr 30, 2026

Uh oh!

PhilMiller commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

PhilMiller commented Apr 28, 2026 •

edited

Loading