This release includes a number of new features, improvements and bug fixes.
New Features
- Add HTTP streaming support for JetStream: Added functionality for streaming responses via HTTP, enhancing real-time data processing. (#877)
- SkyPilot tutorial: Created a tutorial to demonstrate using SkyPilot to launch batch workloads across regions. (#887)
- Define ephemeral-storage in ClusterQueue: Introduced support for workloads requesting ephemeral storage. (#872)
- Slurm on GKE Guide: Published a guide for deploying Slurm clusters on GKE for AI/ML workloads. (#864)
Improvements
- Benchmarking improvements:
- Ray job optimization: Explicitly shut down Ray jobs after completing vector embedding tasks to avoid unnecessary runtime. (#735)
- Update permissions for guides:
- Added storage.objectViewer permission to boot disk guide to resolve access issues. (#893)
- Update Jupyter Notebook image: Introduced a new tag prefix to mitigate internal vulnerability checks. (#886)
- Use TPU network optimizer image: Shifted to an image-based approach for network optimization, improving maintainability. (#870)
Bug Fixes
- Fix vLLM PodMonitoring: Addressed issues related to vLLM monitoring configurations. (#889)
- Fix jupyter hub helm chart version: Pinned the JupyterHub helm chart version to mitigate server spawning errors. (#879)
- TF version dependency mismatch: Resolved TensorFlow version mismatch issues. (#885)
- Ray TPU webhook image update: Bumped image version to fix inconsistent PodInformer updates for large TPU slices. (#891)
New Contributors
- @dsafdsa1 made their first contribution in #872
- @danielmarzini made their first contribution in #864
- @darinpeetz made their first contribution in #870
Full Changelog: v1.7...v1.8