Skip to content

Tracking Upgrade to Datafusion 38 #690

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Closed
3 tasks done
Michael-J-Ward opened this issue May 13, 2024 · 1 comment · Fixed by #691
Closed
3 tasks done

Tracking Upgrade to Datafusion 38 #690

Michael-J-Ward opened this issue May 13, 2024 · 1 comment · Fixed by #691

Comments

@Michael-J-Ward
Copy link
Contributor

Michael-J-Ward commented May 13, 2024

Upstream Changelog

Changes

Incomplete list.

Michael-J-Ward added a commit to Michael-J-Ward/datafusion-python that referenced this issue May 13, 2024
@Michael-J-Ward
Copy link
Contributor Author

I have a functioning upgrade PR open. Once again, the guiding "design" decision was "get to compiling and passing tests" as quickly as possible.

If I removed something that you'd prefer to see implemented as a workaround, please let me know.

#691

andygrove pushed a commit that referenced this issue May 14, 2024
* chore: upgrade datafusion Deps

Ref #690

* update concat and concat_ws to use datafusion_functions

Moved in apache/datafusion#10089

* feat: upgrade functions.rs

Upstream is continuing it's migration to UDFs.

Ref apache/datafusion#10098
Ref apache/datafusion#10372

* fix ScalarUDF import

* feat: remove deprecated suppors_filter_pushdown and impl supports_filters_pushdown

Deprecated function removed in apache/datafusion#9923

* use `unnest_columns_with_options` instead of deprecated `unnest_column_with_option`

* remove ScalarFunction wrappers

These relied on upstream BuiltinScalarFunction, which are now removed.

Ref apache/datafusion#10098

* update dataframe `test_describe`

`null_count` was fixed upstream.

Ref apache/datafusion#10260

* remove PyDFField and related methods

DFField was removed upstream.

Ref: apache/datafusion#9595

* bump `datafusion-python` package version to 38.0.0

* re-implement `PyExpr::column_name`

The previous implementation relied on `DFField` which was removed upstream.

Ref: apache/datafusion#9595
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant