Supercharge your LLM performance with Amazon SageMaker Large Model Inference container v15
Today, we’re excited to announce the launch of Amazon SageMaker Large Model Inference (LMI) container v15, powered by vLLM 0.8.4 with support for the vLLM V1 engine. This release introduces…