AWS AI Managed Services

In this section, we're going to see a lot more AWS AI managed services.
So why do we want them?
These services are pre-trained machine learning services that are geared towards very specific use cases. For example, we've seen that we have Amazon Bedrock to do GenAI, and we have even seen higher level GenAI services, such as Amazon Q Business and Amazon Q Developer. We'll have a look soon at SageMaker, but you may want to do other things than GenAI, and so there are lots of services that we'll learn about in this section.

alt text

AWS AI Service Categories

Text and Document Processing

Amazon Comprehend - Process text
Amazon Translate - Language translation
Amazon Textract - Document processing

Vision Services

Amazon Rekognition - Image and video analysis

Search and Communication

Amazon Kendra - Intelligent search
Amazon Lex - Chatbot creation

Speech Services

Amazon Polly - Text-to-speech
Amazon Transcribe - Speech-to-text

Personalization

Amazon Personalize - Recommendation engine

Complete Machine Learning Platform

Amazon SageMaker - Comprehensive ML service (a huge service in AWS)

alt text

Why Use AWS AI Managed Services?

You can do everything on your own computer or on your own server in the cloud, but you may want to use these services for several key reasons:

Responsiveness and Availability

Available in many different regions

Redundancy and Regional Coverage

Always available with built-in redundancy
Deployed across multiple Availability Zones
- Meaning that if there is a failure in the cloud, then these services may still work

Performance Optimization

Specialized CPUs and GPUs embedded in these services
Optimized for best cost savings for your use case

Cost-Effective Pricing

Most services use token-based pricing
- Meaning that you Pay only for what you use
- Because you No need to over-provision servers for your use case

Provisioned throughput

Option for provisioned throughput on some services
These are for predictable workloads that provides more cost savings
And Delivers more predictable performance

What are Predictable Workloads?
Predictable workloads are applications with consistent, large-scale usage patterns that need guaranteed throughput and performance

Exam Perspective

AWS will want you to know about these services from an exam perspective, and this is what we're going to explore in this section.

Keyboard shortcuts

AWS AI Practitioner Study Guide