Update Release Notes for v1.0.8a–v1.0.8e #636
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Included updates:
v1.0.8e (Aug 20, 2025): Added cached support for OpenAI open-weight models (gpt-oss-120b, gpt-oss-20b).
v1.0.8d (Aug 4, 2025): Streaming endpoint integration, Granite Speech 3.3 (speech-to-text), Granite Timeseries TTM R1, policy verification CLI tools, and Granite 4.0 preview.
v1.0.8c (Jul 24, 2025): Expansion to OCI Gov and ONSR regions, cached model support for Llama 3.2 and Llama 4.
v1.0.8b (Jul 21, 2025): Bring Your Own Reservation (BYOR) UI enhancement for GPU reservations.
v1.0.8a (Jun 26, 2025): Added cached support for IBM Granite 3 models (Instruct, Vision, and Embedding variants).