Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

bug: HDFS Storage flush error #9147

Closed
yufansong opened this issue Apr 12, 2023 · 7 comments
Closed

bug: HDFS Storage flush error #9147

yufansong opened this issue Apr 12, 2023 · 7 comments
Assignees
Labels
type/bug Something isn't working
Milestone

Comments

@yufansong
Copy link
Contributor

Describe the bug

When trying to use HDFS as the storage object and flush content, hummock will meet some problems. And also, when selecting HDFS as the storage, some e2e tests will fail.
image

In my environment, when launched by ./risedev d, flush is ok, e2e test is also ok. When opening HDFS feature, build, ./risedev d hdfs, flush and e2e test will meet the problem above.

When I check the HDFS, there are some data already storage successfully.
image

To Reproduce

  1. Use the branch wcy/push_for_yufan which already enables the hdfs feature.
  2. Launch the EC2 machine and try to build the branch and then ./risedev d hdfs
  3. Connect the database, write data, and flush it.

Expected behavior

No response

Additional context

No response

@yufansong yufansong added the type/bug Something isn't working label Apr 12, 2023
@github-actions github-actions bot added this to the release-0.19 milestone Apr 12, 2023
@wcy-fdu
Copy link
Contributor

wcy-fdu commented Apr 13, 2023

I believe this is some hdfs config issue, will check it later.

@Xuanwo
Copy link
Contributor

Xuanwo commented Apr 13, 2023

Hi, can this issue be reproduced consistently or is it random?


Just a quick note, OpenDAL has built-in support for retrying operations in case of temporary errors. You can find more information about it at this link: https://docs.rs/opendal/latest/opendal/layers/struct.RetryLayer.html.

@wcy-fdu
Copy link
Contributor

wcy-fdu commented Apr 24, 2023

May due to the hdfs classpath is incorrect.

loadFileSystems error:
(unable to get stack trace for java.lang.NoClassDefFoundError exception: ExceptionUtils::getStackTrace error.)
hdfsBuilderConnect(forceNewInstance=0, nn=127.0.0.1:9000, port=0, kerbTicketCachePath=(NULL), userName=(NULL)) error:
(unable to get stack trace for java.lang.NoClassDefFoundError exception: ExceptionUtils::getStackTrace error.)
thread 'risingwave-main' panicked at 'called `Result::unwrap()` on an `Err` value: Unexpected (temporary) at  => uncategorized error, source: Unknown error 255 (os error 255)

@yufansong
Copy link
Contributor Author

I think in the test case I mentioned, the database has already flushed part of the data into the system, which means the program should have already found the correct HDFS classpath.

@Xuanwo
Copy link
Contributor

Xuanwo commented Apr 25, 2023

I think the problem has been resolved with apache/opendal#2114. Kindly upgrade to opendal v0.33.1 and try again.

@wcy-fdu
Copy link
Contributor

wcy-fdu commented Apr 25, 2023

I think the problem has been resolved with apache/incubator-opendal#2114. Kindly upgrade to opendal v0.33.1 and try again.

Thanks, let me try it later.

@wcy-fdu
Copy link
Contributor

wcy-fdu commented Apr 25, 2023

After a quick verification, I think hdfs works now. I will recheck this later, thanks for your work~ @Xuanwo
image

@wcy-fdu wcy-fdu closed this as completed Apr 26, 2023
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
type/bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants