Amazon SageMaker Archives - Page 12 of 21

Reduce model deployment costs by 50% on average using the latest features of Amazon SageMaker | Amazon Web Services

As organizations deploy models to production, they are constantly looking for ways to optimize the performance of their foundation models (FMs) running on the latest accelerators, such as AWS Inferentia…

Minimize real-time inference latency by using Amazon SageMaker routing strategies | Amazon Web Services

Amazon SageMaker makes it straightforward to deploy machine learning (ML) models for real-time inference and offers a broad selection of ML instances spanning CPUs and accelerators such as AWS Inferentia.…

Build and evaluate machine learning models with advanced configurations using the SageMaker Canvas model leaderboard | Amazon Web Services

Amazon SageMaker Canvas is a no-code workspace that enables analysts and citizen data scientists to generate accurate machine learning (ML) predictions for their business needs. Starting today, SageMaker Canvas supports…

Accelerate data preparation for ML in Amazon SageMaker Canvas | Amazon Web Services

Data preparation is a crucial step in any machine learning (ML) workflow, yet it often involves tedious and time-consuming tasks. Amazon SageMaker Canvas now supports comprehensive data preparation capabilities powered…

Boost inference performance for LLMs with new Amazon SageMaker containers | Amazon Web Services

Today, Amazon SageMaker launches a new version (0.25.0) of Large Model Inference (LMI) Deep Learning Containers (DLCs) and adds support for NVIDIA’s TensorRT-LLM Library. With these upgrades, you can effortlessly…

Simplify data prep for generative AI with Amazon SageMaker Data Wrangler | Amazon Web Services

Generative artificial intelligence (generative AI) models have demonstrated impressive capabilities in generating high-quality text, images, and other content. However, these models require massive amounts of clean, structured training data to…

Optimizing costs for Amazon SageMaker Canvas with automatic shutdown of idle apps | Amazon Web Services

Amazon SageMaker Canvas is a rich, no-code Machine Learning (ML) and Generative AI workspace that has allowed customers all over the world to more easily adopt ML technologies to solve…

Build well-architected IDP solutions with a custom lens – Part 1: Operational excellence | Amazon Web Services

The IDP Well-Architected Lens is intended for all AWS customers who use AWS to run intelligent document processing (IDP) solutions and are searching for guidance on how to build secure,…

How Amazon Music uses SageMaker with NVIDIA to optimize ML training and inference performance and cost | Amazon Web Services

In the dynamic world of streaming on Amazon Music, every search for a song, podcast, or playlist holds a story, a mood, or a flood of emotions waiting to be…

Retrieval-Augmented Generation with LangChain, Amazon SageMaker JumpStart, and MongoDB Atlas semantic search | Amazon Web Services

Generative AI models have the potential to revolutionize enterprise operations, but businesses must carefully consider how to harness their power while overcoming challenges such as safeguarding data and ensuring the…