awsAWS Bedrock

Serverless AI on AWS with Claude, Llama, Mistral and 13 foundation models

Enterprise AI with Claude, Llama, Mistral, and more on AWS infrastructure


Overview

Amazon Bedrock provides serverless access to foundation models from leading AI companies including Anthropic, Meta, Mistral, Cohere, and Amazon. Perfect for enterprise deployments requiring AWS integration, scalability, and compliance.

!!! danger "Inference Profile ARN Required" For Anthropic Claude models, you MUST use the full inference profile ARN, not simple model names. See configuration examples below for the correct format.

Key Benefits

  • 🤖 Multiple Models: Claude, Llama 3, Mistral, Titan, Command

  • 🏢 AWS Integration: IAM, VPC, CloudWatch, S3

  • 🌍 Global Regions: 10+ AWS regions

  • 🔒 Enterprise Security: PrivateLink, KMS encryption

  • 💰 Pay-per-use: No infrastructure costs

  • 📊 Serverless: Automatic scaling

  • 🛡️ Compliance: SOC 2, HIPAA, ISO 27001

Available Model Providers

Provider
Models
Best For

Anthropic

Claude 3.5 Sonnet, Claude 3 Opus/Haiku

Complex reasoning, coding

Meta

Llama 3.1 (8B, 70B, 405B)

Open source, cost-effective

Mistral AI

Mistral Large, Mixtral 8x7B

European compliance, coding

Cohere

Command R+, Embed

Enterprise search, RAG

Amazon

Titan Text, Titan Embeddings

AWS-native, affordable

AI21 Labs

Jamba-Instruct

Long context

Stability AI

Stable Diffusion XL

Image generation


Quick Start

1. Enable Model Access

Or via AWS Console:

  1. Select region (us-east-1 recommended)

  2. Click "Model access"

  3. Enable desired models (instant for most, approval needed for some)

2. Setup IAM Permissions

3. Configure AWS Credentials


Regional Deployment

Available Regions

Region
Location
Models Available
Data Residency

us-east-1

N. Virginia

All models

USA

us-west-2

Oregon

All models

USA

us-gov-west-1

GovCloud West

Select models

USA Gov

ca-central-1

Canada

Most models

Canada

eu-west-1

Ireland

All models

EU

eu-west-2

London

Most models

UK

eu-west-3

Paris

Most models

EU

eu-central-1

Frankfurt

All models

EU

ap-southeast-1

Singapore

Most models

Asia

ap-northeast-1

Tokyo

Most models

Asia

ap-south-1

Mumbai

Select models

India

Multi-Region Setup


Model Selection Guide

Anthropic Claude Models

Claude Model IDs:

  • anthropic.claude-3-5-sonnet-20241022-v2:0 - Latest Sonnet

  • anthropic.claude-3-opus-20240229-v1:0 - Opus

  • anthropic.claude-3-haiku-20240307-v1:0 - Haiku

Meta Llama Models

Llama Model IDs:

  • meta.llama3-1-405b-instruct-v1:0 - 405B (most capable)

  • meta.llama3-1-70b-instruct-v1:0 - 70B (balanced)

  • meta.llama3-1-8b-instruct-v1:0 - 8B (fast)

Mistral AI Models

Mistral Model IDs:

  • mistral.mistral-large-2402-v1:0 - Mistral Large

  • mistral.mixtral-8x7b-instruct-v0:1 - Mixtral 8x7B

Amazon Titan Models

Titan Model IDs:

  • amazon.titan-text-premier-v1:0 - Text generation

  • amazon.titan-text-express-v1 - Fast text

  • amazon.titan-embed-text-v2:0 - Embeddings (1024 dim)

  • amazon.titan-embed-text-v1 - Embeddings (1536 dim)

Cohere Models

Cohere Model IDs:

  • cohere.command-r-plus-v1:0 - Command R+

  • cohere.command-r-v1:0 - Command R

  • cohere.embed-english-v3 - Embeddings


IAM Roles & Permissions

EC2 Instance Role

Lambda Execution Role

EKS Service Account


VPC & Private Connectivity

Security Group Configuration

Private Endpoint Usage


Monitoring & Logging

CloudWatch Metrics

CloudWatch Logs


Cost Management

Pricing Overview

Cost Budgets

Cost Tracking


Production Patterns

Pattern 1: Multi-Model Strategy

Pattern 2: Guardrails

Pattern 3: Knowledge Base Integration


Best Practices

1. ✅ Use IAM Roles Instead of Keys

2. ✅ Enable VPC Endpoints

3. ✅ Monitor Costs

4. ✅ Use Appropriate Model for Task

5. ✅ Enable CloudWatch Logging


Troubleshooting

Common Issues

1. "Model Access Denied"

Problem: Model not enabled in your account.

Solution:

2. "Throttling Exception"

Problem: Exceeded rate limits.

Solution:

3. "Invalid Model ID"

Problem: Wrong model identifier.

Solution:



Additional Resources


Need Help? Join our GitHub Discussionsarrow-up-right or open an issuearrow-up-right.

Last updated

Was this helpful?