Skip to content

v1.8

Latest
Compare
Choose a tag to compare
@AlexBulankou AlexBulankou released this 06 Dec 15:26
· 12 commits to main since this release
412621f

This release includes a number of new features, improvements and bug fixes.

New Features

  • Add HTTP streaming support for JetStream: Added functionality for streaming responses via HTTP, enhancing real-time data processing. (#877)
  • SkyPilot tutorial: Created a tutorial to demonstrate using SkyPilot to launch batch workloads across regions. (#887)
  • Define ephemeral-storage in ClusterQueue: Introduced support for workloads requesting ephemeral storage. (#872)
  • Slurm on GKE Guide: Published a guide for deploying Slurm clusters on GKE for AI/ML workloads. (#864)

Improvements

  • Benchmarking improvements:
    • Avoid repeating identical prompts by making prompts reusable for large datasets. (#880)
    • Added --output-bucket flag for uploading benchmark results to Google Cloud Storage. (#871)
  • Ray job optimization: Explicitly shut down Ray jobs after completing vector embedding tasks to avoid unnecessary runtime. (#735)
  • Update permissions for guides:
    • Added storage.objectViewer permission to boot disk guide to resolve access issues. (#893)
  • Update Jupyter Notebook image: Introduced a new tag prefix to mitigate internal vulnerability checks. (#886)
  • Use TPU network optimizer image: Shifted to an image-based approach for network optimization, improving maintainability. (#870)

Bug Fixes

  • Fix vLLM PodMonitoring: Addressed issues related to vLLM monitoring configurations. (#889)
  • Fix jupyter hub helm chart version: Pinned the JupyterHub helm chart version to mitigate server spawning errors. (#879)
  • TF version dependency mismatch: Resolved TensorFlow version mismatch issues. (#885)
  • Ray TPU webhook image update: Bumped image version to fix inconsistent PodInformer updates for large TPU slices. (#891)

New Contributors

Full Changelog: v1.7...v1.8