AWS Bedrock
Serverless AI on AWS with Claude, Llama, Mistral and 13 foundation models
Enterprise AI with Claude, Llama, Mistral, and more on AWS infrastructure
Overview
Amazon Bedrock provides serverless access to foundation models from leading AI companies including Anthropic, Meta, Mistral, Cohere, and Amazon. Perfect for enterprise deployments requiring AWS integration, scalability, and compliance.
!!! danger "Inference Profile ARN Required" For Anthropic Claude models, you MUST use the full inference profile ARN, not simple model names. See configuration examples below for the correct format.
Key Benefits
🤖 Multiple Models: Claude, Llama 3, Mistral, Titan, Command
🏢 AWS Integration: IAM, VPC, CloudWatch, S3
🌍 Global Regions: 10+ AWS regions
🔒 Enterprise Security: PrivateLink, KMS encryption
💰 Pay-per-use: No infrastructure costs
📊 Serverless: Automatic scaling
🛡️ Compliance: SOC 2, HIPAA, ISO 27001
Available Model Providers
Anthropic
Claude 3.5 Sonnet, Claude 3 Opus/Haiku
Complex reasoning, coding
Meta
Llama 3.1 (8B, 70B, 405B)
Open source, cost-effective
Mistral AI
Mistral Large, Mixtral 8x7B
European compliance, coding
Cohere
Command R+, Embed
Enterprise search, RAG
Amazon
Titan Text, Titan Embeddings
AWS-native, affordable
AI21 Labs
Jamba-Instruct
Long context
Stability AI
Stable Diffusion XL
Image generation
Quick Start
1. Enable Model Access
Or via AWS Console:
Open Bedrock Console
Select region (us-east-1 recommended)
Click "Model access"
Enable desired models (instant for most, approval needed for some)
2. Setup IAM Permissions
3. Configure AWS Credentials
4. Configure NeurosLink AI
Regional Deployment
Available Regions
us-east-1
N. Virginia
All models
USA
us-west-2
Oregon
All models
USA
us-gov-west-1
GovCloud West
Select models
USA Gov
ca-central-1
Canada
Most models
Canada
eu-west-1
Ireland
All models
EU
eu-west-2
London
Most models
UK
eu-west-3
Paris
Most models
EU
eu-central-1
Frankfurt
All models
EU
ap-southeast-1
Singapore
Most models
Asia
ap-northeast-1
Tokyo
Most models
Asia
ap-south-1
Mumbai
Select models
India
Multi-Region Setup
Model Selection Guide
Anthropic Claude Models
Claude Model IDs:
anthropic.claude-3-5-sonnet-20241022-v2:0- Latest Sonnetanthropic.claude-3-opus-20240229-v1:0- Opusanthropic.claude-3-haiku-20240307-v1:0- Haiku
Meta Llama Models
Llama Model IDs:
meta.llama3-1-405b-instruct-v1:0- 405B (most capable)meta.llama3-1-70b-instruct-v1:0- 70B (balanced)meta.llama3-1-8b-instruct-v1:0- 8B (fast)
Mistral AI Models
Mistral Model IDs:
mistral.mistral-large-2402-v1:0- Mistral Largemistral.mixtral-8x7b-instruct-v0:1- Mixtral 8x7B
Amazon Titan Models
Titan Model IDs:
amazon.titan-text-premier-v1:0- Text generationamazon.titan-text-express-v1- Fast textamazon.titan-embed-text-v2:0- Embeddings (1024 dim)amazon.titan-embed-text-v1- Embeddings (1536 dim)
Cohere Models
Cohere Model IDs:
cohere.command-r-plus-v1:0- Command R+cohere.command-r-v1:0- Command Rcohere.embed-english-v3- Embeddings
IAM Roles & Permissions
EC2 Instance Role
Lambda Execution Role
EKS Service Account
VPC & Private Connectivity
VPC Endpoint (PrivateLink)
Security Group Configuration
Private Endpoint Usage
Monitoring & Logging
CloudWatch Metrics
CloudWatch Logs
Cost Management
Pricing Overview
Cost Budgets
Cost Tracking
Production Patterns
Pattern 1: Multi-Model Strategy
Pattern 2: Guardrails
Pattern 3: Knowledge Base Integration
Best Practices
1. ✅ Use IAM Roles Instead of Keys
2. ✅ Enable VPC Endpoints
3. ✅ Monitor Costs
4. ✅ Use Appropriate Model for Task
5. ✅ Enable CloudWatch Logging
Troubleshooting
Common Issues
1. "Model Access Denied"
Problem: Model not enabled in your account.
Solution:
2. "Throttling Exception"
Problem: Exceeded rate limits.
Solution:
3. "Invalid Model ID"
Problem: Wrong model identifier.
Solution:
Related Documentation
Provider Setup - General configuration
Multi-Region - Geographic distribution
Cost Optimization - Reduce costs
Compliance - Security
Additional Resources
AWS Bedrock Docs - Official documentation
Bedrock Pricing - Pricing details
Bedrock Console - Manage models
AWS CLI Reference - CLI commands
Need Help? Join our GitHub Discussions or open an issue.
Last updated
Was this helpful?

