Native HDFS Python bindings based on hdfs-native Rust package.
pip install hdfs-native
from hdfs_native import Client
client = Client("hdfs://localhost:9000")
status = client.get_file_info("/file.txt")
Kerberos (SASL GSSAPI) is supported through a runtime dynamic link to libgssapi_krb5
. This must be installed separately, but is likely already installed on your system. If not you can install it by:
apt-get install libgssapi-krb5-2
yum install krb5-libs
brew install krb5
The same requirements apply as the Rust tests, requiring Java, Maven, Hadoop, and Kerberos tools to be on your path. Then you can:
python3 -m venv .venv
source .venv/bin/activate
pip3 install maturin
maturin develop -E devel
pytest