Fuzzer #1389: ARG_XXX Decimal Casts #10742

hawkfish · 2024-02-19T07:02:53Z

Instead of trying to track the combinatorial explosion of ordering types for DECIMAL arguments, just return a function that will force the binder to inject casts as if there were no custom bind function.

carlopi · 2024-02-19T09:06:10Z

I was trying this out, bumped against this problem (that is independent, so can be addressed separately):

CREATE TABLE T (z HUGEINT);
insert into t values (-168123123123200005565479978461862821890);
insert into t values (-168123123123200005565479978461862821889);
insert into t values (-168123123123200005565479978461862821888);
insert into t values (-168123123123200005565479978461862821893);
SELECT min(z) - arg_min(z,z) FROM t;
-3

Connected to this, are casts performed by this PR always information-preserving?

Mytherin · 2024-02-19T11:50:56Z

Yes casting to double is lossy for numerics - it would be better to add the hugeint option to the switch. Good catch @carlopi

The cast from HUGEINT to DOUBLE is lossy and can generate incorrect results. fixes: duckdblabs-duckdb-internal#1294

hawkfish · 2024-02-19T17:43:47Z

Combined both into this PR since it was small and @carlopi had such a nice test!

carlopi

I think in a nicer world GetArgMinMaxFunctionBy should never throw and have some other way to communicate it can't handle a type (given previously we were building functions that would only end up throwing), and after this PR also potentially (say with some more changes in).

Also, I am curious how we can guarantee that a Cast preserves information, I am not sure if we have that information (or maybe it's implicitly guaranteed?).

BUT, both problems were there before, and this PR is a clear improvement, so I think it might be just fine to merge this (after adding back at least INT16).

carlopi · 2024-02-23T08:20:28Z

src/core_functions/aggregate/distributive/arg_min_max.cpp

-	case PhysicalType::INT8:
-		return GetArgMinMaxFunctionInternal<OP, ARG_TYPE, int8_t>(by_type, type);
-	case PhysicalType::INT16:
-		return GetArgMinMaxFunctionInternal<OP, ARG_TYPE, int16_t>(by_type, type);
 	case PhysicalType::INT32:
 		return GetArgMinMaxFunctionInternal<OP, ARG_TYPE, int32_t>(by_type, type);
 	case PhysicalType::INT64:
 		return GetArgMinMaxFunctionInternal<OP, ARG_TYPE, int64_t>(by_type, type);
-	case PhysicalType::FLOAT:
-		return GetArgMinMaxFunctionInternal<OP, ARG_TYPE, float>(by_type, type);


I would avoid removing INT16 (in particular) since it's used by GetDecimalArgMinMaxFunction. Also INT8 and FLOAT probably are fine staying.

I removed them to reduce the code footprint back to what it was before I made the fix.

The impact on runtime memory is minimal because we only store one value. There is some casting overhead but that is the same as for non-DECIMAL types.

Right, forgot it was in the connected PR. All cool then.

Mytherin · 2024-02-26T12:35:50Z

Thanks! LGTM

Merge pull request duckdb/duckdb#10742 from hawkfish/fuzzer-decimal-order Merge pull request duckdb/duckdb#10832 from krlmlr/f-setup-python-v5

Fuzzer duckdb#1389: ARG_XXX Decimal Casts

f0e134f

Instead of trying to track the combinatorial explosion of ordering types for DECIMAL arguments, just return a function that will force the binder to inject casts as if there were no custom bind function.

hawkfish requested review from Mytherin and carlopi February 19, 2024 07:03

hawkfish added the Ready For Review label Feb 19, 2024

Mytherin added Changes Requested and removed Ready For Review labels Feb 19, 2024

hawkfish added 2 commits February 20, 2024 06:15

Merge branch 'main' into fuzzer-decimal-order

0197224

Internal duckdb#1294: ARG_XXX By HUGEINT

67a3f1b

The cast from HUGEINT to DOUBLE is lossy and can generate incorrect results. fixes: duckdblabs-duckdb-internal#1294

hawkfish added Ready For Review and removed Changes Requested labels Feb 19, 2024

github-actions bot marked this pull request as draft February 19, 2024 17:43

hawkfish marked this pull request as ready for review February 21, 2024 19:17

carlopi suggested changes Feb 23, 2024

View reviewed changes

carlopi added Changes Requested Ready To Merge and removed Ready For Review Changes Requested labels Feb 23, 2024

Mytherin merged commit 26696a1 into duckdb:main Feb 26, 2024

hawkfish deleted the fuzzer-decimal-order branch February 27, 2024 02:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fuzzer #1389: ARG_XXX Decimal Casts #10742

Fuzzer #1389: ARG_XXX Decimal Casts #10742

Uh oh!

hawkfish commented Feb 19, 2024

Uh oh!

carlopi commented Feb 19, 2024

Uh oh!

Mytherin commented Feb 19, 2024

Uh oh!

hawkfish commented Feb 19, 2024

Uh oh!

carlopi left a comment

Uh oh!

carlopi Feb 23, 2024

Uh oh!

hawkfish Feb 23, 2024

Uh oh!

hawkfish Feb 23, 2024

Uh oh!

carlopi Feb 24, 2024

Uh oh!

Mytherin commented Feb 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fuzzer #1389: ARG_XXX Decimal Casts #10742

Fuzzer #1389: ARG_XXX Decimal Casts #10742

Uh oh!

Conversation

hawkfish commented Feb 19, 2024

Uh oh!

carlopi commented Feb 19, 2024

Uh oh!

Mytherin commented Feb 19, 2024

Uh oh!

hawkfish commented Feb 19, 2024

Uh oh!

carlopi left a comment

Choose a reason for hiding this comment

Uh oh!

carlopi Feb 23, 2024

Choose a reason for hiding this comment

Uh oh!

hawkfish Feb 23, 2024

Choose a reason for hiding this comment

Uh oh!

hawkfish Feb 23, 2024

Choose a reason for hiding this comment

Uh oh!

carlopi Feb 24, 2024

Choose a reason for hiding this comment

Uh oh!

Mytherin commented Feb 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants