Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

ClassNotFoundException:HoodieInputFormat #581

Closed
daikon12 opened this issue Feb 21, 2019 · 16 comments
Closed

ClassNotFoundException:HoodieInputFormat #581

daikon12 opened this issue Feb 21, 2019 · 16 comments
Assignees

Comments

@daikon12
Copy link

hi , When I followed the steps of quickstart (https://uber.github.io/hudi/quickstart.html ) ,I have a problem:
image
and I found HoodieInputFormat.class belongs to this jar:
image
So I introduced this dependency in run_sync_tool.sh , but It did not work.

Is there any good solution to this problem ? Thank you very much

@vinothchandar
Copy link
Member

Hi.. is this from running the sync tool? Can you throw some light on your setup?

@vinothchandar
Copy link
Member

@daikon12 do you still have this issue?

@daikon12
Copy link
Author

@daikon12 do you still have this issue?
Sorry for the late reply . Yes , It still does not work . Would you like to offer some ideas

@vinothchandar
Copy link
Member

It cannot find the HoodieInputFormat which is contained in the hoodie-hadoop-mr-bundle. Did you add the hoodie-hadoop-mr-hundle jar to your hive server?

We can also move this discussion to the dev mailing list, so we can respond quicker..
http://hudi.incubator.apache.org/community.html

@daikon12
Copy link
Author

@vinothchandar thx your idea。
But I imported the package manually ,The detailed commands are shown below:
image
But It doesn't seem to work。

@vinothchandar
Copy link
Member

I think we removed the suffix in 0.4.5

All you need is the hive bundle jar. It has the mr stuff internally

$ ls -l packaging/hoodie-hive-bundle/target/
total 101144
drwxr-xr-x  3 vinoth  xxxx        96 Mar 15 11:18 classes
drwxr-xr-x  3 vinoth  xxxx        96 Mar 15 11:18 generated-sources
-rw-r--r--  1 vinoth  xxxx  50725597 Mar 15 11:18 hoodie-hive-bundle-0.4.6-SNAPSHOT.jar
drwxr-xr-x  3 vinoth  xxxx        96 Mar 15 11:18 maven-archiver
drwxr-xr-x  3 vinoth  xxxx        96 Mar 15 11:18 maven-status
-rw-r--r--  1 vinoth  xxxx      4220 Mar 15 11:18 original-hoodie-hive-bundle-0.4.6-SNAPSHOT.jar
-rw-r--r--  1 vinoth  xxxx       971 Mar 15 11:18 rat.txt

I will double check and update the docs..

Can you try running with just the packaging/hoodie-hive-bundle/target/hoodie-hive-bundle-0.4.6-SNAPSHOT.jar off master

@vinothchandar
Copy link
Member

Btw, you can use the https://github.com/apache/incubator-hudi/blob/master/hoodie-hive/run_sync_tool.sh tool to run it instead of doing it by hand?

@daikon12
Copy link
Author

Btw, you can use the https://github.com/apache/incubator-hudi/blob/master/hoodie-hive/run_sync_tool.sh tool to run it instead of doing it by hand?
When I run this script,The program output looks like this :
image

Maybe there is a colon missing .

@daikon12
Copy link
Author

Btw, you can use the https://github.com/apache/incubator-hudi/blob/master/hoodie-hive/run_sync_tool.sh tool to run it instead of doing it by hand?

So I runit manually again and it works fine until I got the following error:
image
I need to import many such jars manually

@vinothchandar
Copy link
Member

You mean the run_sync_tool.sh gives you this error? Thats odd. Are the Hive jars themselves there? antlr should be provided by Hive I think?
@bvaradar as well to take a look..

@bvaradar
Copy link
Contributor

@daikon12 :

The run_hive_sync tool failed because there were multiple standalone jdbc jars found in your hive installation. We have not seen this case in our installations. I have created a PR (#609) to handle this case. Can you please apply this patch, try this out and let us know if this solved the issue.

Thanks,
Balaji.V

@Achuth17
Copy link

I faced the antlr recognition exception as well, Fixed it by adding a few jars (like the antlr jar) manually. Just thinking out loud, Why can't we pick all the jars from "{HIVE_HOME}/lib/" instead of selectively picking?

@bvaradar
Copy link
Contributor

@Achuth17 : This was done on purpose. There are lot of different hadoop/hive environments out there. With Hudi having to interoperate across different query engines, it was practical to choose a set of hadoop/hive releases for which we can certify different hoodie tools/jars. As part of this certification, we found that blindly picking all jars results in package conflicts with this ones present in the installations.

@bvaradar
Copy link
Contributor

@daikon12 : Is run_hive_sync tool working fine now ?

@daikon12
Copy link
Author

@daikon12 : Is run_hive_sync tool working fine now ?
Thank you @vinothchandar @bvaradar You inspire me , This problem is probably caused by my version of hadoop and hive , Now I only have one client of parcels way . I will try another version later to see if it works.

@bvaradar bvaradar self-assigned this Apr 9, 2019
@bvaradar
Copy link
Contributor

Closing due to inactivity

vinishjail97 pushed a commit to vinishjail97/hudi that referenced this issue Mar 13, 2024
…s and add tests for filtering out extensions (apache#581)

Co-authored-by: rmahindra123 <rmahindra@Rajeshs-MacBook-Pro.local>
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants