Skip to content

Upgrade datafusion 39 #728

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Merged
merged 23 commits into from
Jun 14, 2024
Merged

Conversation

Michael-J-Ward
Copy link
Contributor

@Michael-J-Ward Michael-J-Ward commented Jun 13, 2024

Which issue does this PR close?

Part of #727
Closes #670
Closes #677
Closes #665

Note: this performs the bulk of the upgrade, but see #727 for things to clean up before releasing.

What changes are included in this PR?

Datafusion, pyo3, and object_store deps are upgraded.

Are there any user-facing changes?

Upstream datafusion continues migrating from built-in functions to UDFs, with the focus now moving to aggregate functions.

pyarrow had a bug in its IntervalDayTime and IntervalMonthDayNano definitions, so if users had hacked around that like we did in our tpch examples, things will change unexpectedly. Refer to #665 for details.

…0.10.1

`datafusion-common` also depends on `pyo3`, so they need to be upgraded together.
datafusion replaced Expr::GetIndexField with a FieldAccessor trait.

Ref apache/datafusion#10568
Ref apache/datafusion#10769
The field `func_name` was changed to `func` as part of removing `ScalarFunctionDefinition` upstream.

Ref apache/datafusion#10325
This silences pyo3's deprecation warnings for its new Bounds api.

It's the 1st step of the migration, and should be removed before merge.

Ref https://pyo3.rs/v0.21.0/migration#from-020-to-021
AggregateUDF expressions now have a builder API design, which removes arguments like filter and order_by

Ref apache/datafusion#10545
Ref apache/datafusion#10492
@Michael-J-Ward Michael-J-Ward marked this pull request as ready for review June 14, 2024 16:59
Copy link
Member

@andygrove andygrove left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks @Michael-J-Ward!

@andygrove andygrove merged commit b5446ef into apache:main Jun 14, 2024
22 checks passed
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
2 participants