Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Update GCP doc and scripts [skip ci] #5326

Merged
merged 5 commits into from
Apr 28, 2022

Conversation

viadea
Copy link
Collaborator

@viadea viadea commented Apr 27, 2022

Updated the GCP doc and scripts to reflect the most recent GCP init scripts to address #5282 .

  1. custom image script was out of date. Updated it based on latest image version and latest GCP init script. [I have tested the new custom image script and it worked fine.]
  2. changed "key value" to "key=value" for Dataproc creation scripts for every place to match GCP doc.
  3. Added some notes in notebook to mention how to download the input file.
  4. Some rewording based on @tgravescs 's drafted PR

Signed-off-by: Hao Zhu <hazhu@nvidia.com>
@viadea viadea added the documentation Improvements or additions to documentation label Apr 27, 2022
Signed-off-by: Hao Zhu <hazhu@nvidia.com>
@nvliyuan
Copy link
Collaborator

verified “## Submit Spark jobs to a Dataproc Cluster Accelerated by GPUs” section, works fine

@@ -89,8 +93,11 @@ If you'd like to further accelerate init time to 4-5 minutes, create a custom Da
To use notebooks with a Dataproc cluster, click on the cluster name under the Dataproc cluster tab
and navigate to the "Web Interfaces" tab. Under "Web Interfaces", click on the JupyterLab or
Jupyter link to start to use sample
[Mortgage ETL on GPU Jupyter Notebook](../demo/GCP/Mortgage-ETL-GPU.ipynb) to process full 17 years
[Mortgage data](https://docs.rapids.ai/datasets/mortgage-data).
[Mortgage ETL on GPU Jupyter Notebook](../demo/GCP/Mortgage-ETL-GPU.ipynb).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

how does hte ipynb file get to the notebook, is there an easy way to open it from the notebook directly or do you have to download and then reupload?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As per my test, only upload/download option. So I just added some words to let user firstly download and upload it in jupyter.

Signed-off-by: Hao Zhu <hazhu@nvidia.com>
viadea added 2 commits April 27, 2022 20:09
Signed-off-by: Hao Zhu <hazhu@nvidia.com>
Signed-off-by: Hao Zhu <hazhu@nvidia.com>
@nvliyuan nvliyuan self-requested a review April 28, 2022 05:47
@tgravescs tgravescs changed the title Update GCP doc and scripts Update GCP doc and scripts [skip ci] Apr 28, 2022
@tgravescs
Copy link
Collaborator

build

@viadea viadea merged commit c347b80 into NVIDIA:branch-22.06 Apr 28, 2022
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants