AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2566191Time Stamp: May 2, 2024
Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2564067Time Stamp: May 1, 2024
Develop and train large models cost-efficiently with Metaflow and AWS Trainium | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2561992Time Stamp: Apr 29, 2024
Open source observability for AWS Inferentia nodes within Amazon EKS clusters | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2549583Time Stamp: Apr 17, 2024
Gradient makes LLM benchmarking cost-effective and effortless with AWS Inferentia | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2537220Time Stamp: Apr 2, 2024
Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2446410Time Stamp: Jan 17, 2024
Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2 | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2419443Time Stamp: Dec 13, 2023
Welcome to a New Era of Building in the Cloud with Generative AI on AWS | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2405905Time Stamp: Nov 30, 2023
Intuitivo achieves higher throughput while saving on AI/ML costs using AWS Inferentia and PyTorch | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2350881Time Stamp: Oct 26, 2023
Retrieval-Augmented Generation & RAG Workflows Source Cluster: AI & Machine Learning Source Node: 2347553Time Stamp: Oct 24, 2023
Optimize generative AI workloads for environmental sustainability | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2286678Time Stamp: Sep 21, 2023
Machine learning with decentralized training data using federated learning on Amazon SageMaker | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2228239Time Stamp: Aug 22, 2023
Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2184201Time Stamp: Jul 24, 2023
Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2141205Time Stamp: Jun 20, 2023
AWS Inferentia2 builds on AWS Inferentia1 by delivering 4x higher throughput and 10x lower latency | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2134532Time Stamp: Jun 13, 2023
Scale your machine learning workloads on Amazon ECS powered by AWS Trainium instances | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2115409Time Stamp: May 31, 2023
Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker Source Cluster: AWS Machine Learning Source Node: 2082738Time Stamp: May 4, 2023
How to extend the functionality of AWS Trainium with custom operators Source Cluster: AWS Machine Learning Source Node: 2074548Time Stamp: Apr 27, 2023
Deploy large models at high performance using FasterTransformer on Amazon SageMaker Source Cluster: AWS Machine Learning Source Node: 2062551Time Stamp: Apr 17, 2023
Announcing New Tools for Building with Generative AI on AWS Source Cluster: AWS Machine Learning Source Node: 2056844Time Stamp: Apr 13, 2023