🏗️ Real-World Scenarios#
This chapter contains real-world architecture scenarios that you’ll encounter as an AWS Solutions Architect. Each scenario includes the problem, solution, and key takeaways.
Scenario 1: E-Commerce Platform Migration#
Problem: A retail company runs their e-commerce platform on-premises. During Black Friday, traffic spikes cause outages. They want to migrate to AWS with high availability and auto-scaling.
Architecture:
CloudFront → WAF → ALB → Auto Scaling EC2 → ElastiCache → RDS Multi-AZ
↓
SQS → Lambda (order processing) → DynamoDB
↓
SNS → Email/SMS notificationsSolution:
- Route53 with latency-based routing to ALB
- CloudFront + WAF for DDoS protection
- ALB with cross-zone LB across 3 AZs
- Auto Scaling with target tracking (CPU at 60%)
- ElastiCache for session state (stateless app)
- RDS Multi-AZ for database HA
- SQS to decouple order processing
- SNS for order notifications
Takeaways: Decouple with SQS → handle traffic spikes. Stateless app → scales horizontally. Multi-AZ → survives AZ failure.
Scenario 2: Data Lake for Analytics#
Problem: A media company generates 5 TB of logs daily. They need to analyze this data for trends and store it cost-effectively.
Architecture:
On-prem logs → Kinesis Data Firehose → S3 (raw) → Glue ETL → S3 (curated)
↓
Athena / Redshift SpectrumSolution:
- Kinesis Data Firehose for streaming ingestion
- S3 as the data lake (raw → cleaned → curated zones)
- AWS Glue for ETL and catalog
- Athena for serverless SQL queries
- Lifecycle policies: Standard → IA → Glacier → Deep Archive
Takeaways: S3 is the most cost-effective data lake. Lifecycle policies reduce storage costs by 80%. Athena is serverless — no clusters to manage.
Scenario 3: Hybrid Cloud Backup#
Problem: A healthcare company needs to backup 50 TB of on-premises data to AWS for disaster recovery. They need low latency access to recent backups and long-term archival for compliance.
Solution:
On-prem servers → Storage Gateway (File) → S3 (active)
↓
Lifecycle → Glacier (compliance, 7yr)- Storage Gateway — File Gateway for cached access to S3
- S3 for primary backup storage
- Lifecycle policy → Glacier after 90 days, Deep Archive after 1 year
- S3 Object Lock for WORM compliance
- Cross-region replication for DR
Takeaways: File Gateway gives on-prem low-latency access to cloud storage. Object Lock satisfies compliance requirements.
Scenario 4: Serverless Microservices#
Problem: A startup wants to build a REST API for their mobile app with minimal operational overhead. Traffic is unpredictable.
Architecture:
Mobile App → API Gateway → Lambda → DynamoDB
↓
Cognito (Auth)- API Gateway (HTTP API) for REST endpoints
- Lambda for business logic (auto-scales)
- DynamoDB (On-Demand) for NoSQL storage
- Cognito for user authentication
Takeaways: Fully serverless. Zero servers to manage. Pay per request. Scales from 0 to millions automatically.
Scenario 5: Global Content Delivery#
Problem: A SaaS company serves customers worldwide. Their US-based origin server causes high latency for European and Asian users.
Solution:
User → Route53 (Latency) → CloudFront → ALB (us-east-1)
↓ (cache hit)
Edge Location- CloudFront with Edge Locations globally
- Route53 latency-based routing to closest region
- Origin Shield to reduce origin load
- Lambda@Edge for URL rewrites and A/B testing
Takeaways: CloudFront reduces latency dramatically (Edge > 400 locations). Lambda@Edge runs code at Edge with sub-ms startup.
Scenario 6: Database Migration with Zero Downtime#
Problem: A financial services company needs to migrate from Oracle to Aurora PostgreSQL with minimal downtime (< 5 minutes).
Solution:
Oracle (On-prem) → DMS (CDC) → Aurora PostgreSQL
↓
SCT (Schema Conversion)- AWS SCT to convert Oracle schema to PostgreSQL
- AWS DMS with full load + CDC replication
- Aurora PostgreSQL (Multi-AZ) as target
- Cutover: Stop app → apply final CDC → switch DNS
Takeaways: DMS supports ongoing replication so cutover is minutes. SCT automates schema conversion but may need manual review.
Scenario 7: Cost Optimization Audit#
Problem: A company’s AWS bill is $50K/month. The CFO wants to reduce it by 30% without affecting performance.
Solution:
- Compute Optimizer → Right-size 40% over-provisioned EC2 instances
- Reserved Instances / Compute SP → 30% savings on steady-state
- Spot Instances → Replace 50% of non-prod with Spot
- S3 Lifecycle → Move old data to Glacier (60% cheaper)
- Delete stale resources → 15 unattached EBS volumes, 3 idle ALBs
- NAT Gateway → VPC Endpoints → Save $200/month on S3 data transfer
Result: Monthly bill reduced from $50K to $33K (34% savings).
Scenario 8: High Availability Web App#
Problem: A travel booking site must maintain 99.99% availability. They currently run in a single AZ.
Architecture:
Route53 (Failover)
├── Primary: us-east-1 (Multi-AZ)
│ ├── ALB → ASG (3 AZs) → RDS Multi-AZ
│ └── ElastiCache Redis (Cluster mode)
└── DR: us-west-2 (Warm Standby)
└── RDS (cross-region replica)
└── ASG (min 2 instances)- Multi-AZ for all services
- RDS Multi-AZ with auto-failover
- Cross-region read replica for DR
- Route53 failover routing with health checks
- Warm standby in us-west-2 (scaled-down)
Takeaways: Multi-AZ handles AZ failure. Cross-region replica handles region failure. Route53 health checks automate DNS failover.
Scenario 9: Secure Cross-Account Access#
Problem: A central security team needs read-only access to S3 buckets across 5 AWS accounts.
Solution:
Management Account
└── IAM User (security-auditor)
└── Assume Role → Account A: S3-ReadOnly-Role
→ Account B: S3-ReadOnly-Role
→ Account C: S3-ReadOnly-Role- Create IAM role in each account with
AmazonS3ReadOnlyAccess - Set trust policy allowing management account to assume role
- Use
aws sts assume-roleto switch accounts
Takeaways: Cross-account roles are the secure way to grant access. No IAM users needed in each account.
Scenario 10: Event-Driven Image Processing#
Problem: A social media app needs to process user-uploaded images (create thumbnails, detect objects, moderate content).
Architecture:
User → API Gateway → Lambda (upload URL) → S3 (raw)
↓
S3 Event → Lambda (thumbnail)
→ Lambda (Rekognition)
→ Lambda (moderation)
↓
S3 (processed)- S3 triggers Lambda on object creation
- Lambda generates thumbnails using Sharp/Pillow
- Rekognition for object detection
- Lambda for content moderation
- SNS to notify user when processing is complete
Takeaways: Event-driven architecture is decoupled and scalable. S3 events + Lambda = powerful serverless processing pipeline.
✅ Chapter Quiz#
-
An e-commerce platform experiences traffic spikes during Black Friday. Which architecture decouples order processing from the web tier?
- A) Direct database writes from the web tier
- B) SQS queue between web tier and order processing
- C) Same EC2 instance for web and processing
- D) Synchronous API calls
-
A media company ingests 5 TB of daily logs. Which service is best for streaming data ingestion?
- A) S3
- B) Kinesis Data Firehose
- C) SQS
- D) DynamoDB
-
A healthcare company needs low-latency access to recent backups stored in S3. Which hybrid storage solution should they use?
- A) Direct Connect
- B) Storage Gateway File Gateway
- C) Snowball
- D) DataSync
-
A startup wants to build a REST API with zero server management. Which services form a fully serverless architecture?
- A) EC2, ALB, RDS
- B) API Gateway, Lambda, DynamoDB
- C) ECS, ALB, RDS
- D) Elastic Beanstalk, RDS
-
A SaaS company serves global users. Which service reduces latency by caching content at edge locations?
- A) Route53
- B) CloudFront
- C) Global Accelerator
- D) Direct Connect
-
A financial company needs to migrate from Oracle to Aurora PostgreSQL with minimal downtime. Which services are required?
- A) SCT and DMS with CDC
- B) DataSync and Storage Gateway
- C) Snowball and SCT
- D) DMS and Snowball
-
A company wants to reduce a $50K/month AWS bill by 30% without affecting performance. Which tool identifies over-provisioned EC2 instances?
- A) Cost Explorer
- B) Compute Optimizer
- C) Trusted Advisor
- D) Budgets
-
A travel booking site needs 99.99% availability. In addition to Multi-AZ, what should they implement?
- A) Single-region deployment
- B) Multi-region warm standby with Route53 failover
- C) Spot Instances
- D) T2 burstable instances
-
A security team needs read-only access to S3 buckets across 5 AWS accounts. Which approach is most secure?
- A) Create IAM users in each account
- B) Use cross-account IAM roles with trust policies
- C) Share root credentials
- D) Use S3 bucket policies only
-
A social media app needs to process user uploaded images. Which service triggers the processing workflow?
- A) API Gateway
- B) S3 event notifications
- C) CloudWatch alarm
- D) SQS
-
A company needs to analyze data in S3 using SQL without provisioning a data warehouse. Which service should they use?
- A) Redshift
- B) Athena
- C) EMR
- D) OpenSearch
-
Which service should an IoT company use to collect and process millions of device events per second in real time?
- A) Kinesis Data Streams
- B) SQS
- C) SNS
- D) EventBridge
-
A company needs to share 10 GB files with external partners securely. Which solution provides time-limited access?
- A) S3 pre-signed URLs
- B) S3 public bucket
- C) EFS with NFS export
- D) FTP server on EC2
-
A company wants to decouple a monolithic application into microservices. Which service enables asynchronous communication between new microservices?
- A) API Gateway
- B) SQS
- C) ELB
- D) EC2
-
Which AWS service provides serverless GPU access for machine learning inference?
- A) SageMaker
- B) EC2 G5 instances
- C) Lambda with GPU
- D) EKS
-
A company runs a stateful web application on EC2. They need high availability but the app stores session data locally. What should they do?
- A) Enable sticky sessions on the ALB
- B) Move session data to ElastiCache for a stateless architecture
- C) Increase instance size
- D) Add more instances without changing the architecture
-
Which AWS service provides a fully managed CI/CD service for building, testing, and deploying code?
- A) CodeBuild
- B) CodePipeline
- C) CodeDeploy
- D) All of the above
-
A company needs to create a private network connection between two VPCs in different AWS regions. Which service should they use?
- A) VPC Peering
- B) Direct Connect
- C) VPN
- D) Transit Gateway
-
A company wants to automate the creation of a multi-account AWS environment with governance controls. Which service should they use?
- A) AWS Organizations
- B) AWS Control Tower
- C) CloudFormation
- D) Service Catalog
-
Which AWS service should be used to send push notifications to mobile devices at scale?
- A) SNS
- B) SQS
- C) EventBridge
- D) Pinpoint
-
A company needs to enforce that all resources in an AWS account are tagged with a CostCenter tag. Which service can enforce this?
- A) AWS Config
- B) CloudTrail
- C) Trusted Advisor
- D) Cost Explorer
-
A company is deploying a containerized application and needs a fully managed container registry. Which AWS service should they use?
- A) Docker Hub
- B) ECR (Elastic Container Registry)
- C) S3
- D) EFS
-
A company needs to run a SQL query across multiple S3 buckets containing CSV data. Which service is MOST cost-effective for this use case?
- A) Redshift
- B) Athena
- C) RDS
- D) EMR
-
Which AWS service provides a managed API for blockchain networks?
- A) Managed Blockchain
- B) Quantum Ledger Database (QLDB)
- C) DynamoDB
- D) RDS
-
A company needs to store audit logs for 7 years to meet regulatory compliance. Which S3 storage class is MOST cost-effective?
- A) S3 Standard
- B) S3 Standard-IA
- C) S3 Glacier
- D) S3 Deep Archive
📝 Answer Key
- B — SQS decouples the web tier from order processing, handling traffic spikes via message queuing.
- B — Kinesis Data Firehose is the best managed service for streaming data ingestion.
- B — Storage Gateway File Gateway provides cached, low-latency access to S3 from on-premises.
- B — API Gateway + Lambda + DynamoDB is the classic fully serverless architecture.
- B — CloudFront caches content at 400+ edge locations for low-latency global delivery.
- A — SCT converts the schema; DMS with CDC enables migration with minimal downtime.
- B — Compute Optimizer uses ML to analyze utilization and recommend right-sizing.
- B — Multi-region warm standby with Route53 failover routing provides DR beyond AZ failures.
- B — Cross-account IAM roles with trust policies are the secure way to grant cross-account access.
- B — S3 event notifications trigger Lambda functions automatically on object creation.
- B — Athena enables serverless SQL queries directly on data in S3.
- A — Kinesis Data Streams handles real-time ingestion of large data streams from IoT devices.
- A — Pre-signed URLs provide time-limited, secure access to private S3 objects.
- B — SQS provides asynchronous message passing to decouple microservices.
- A — SageMaker provides managed ML training and inference with GPU support.
- B — Moving session data to ElastiCache makes the app stateless and enables horizontal scaling.
- D — CodeCommit, CodeBuild, CodeDeploy, and CodePipeline together form a complete CI/CD pipeline.
- A — VPC Peering connects VPCs across AWS regions using private IP addresses.
- B — AWS Control Tower automates landing zone setup with built-in governance guardrails.
- A — SNS sends push notifications to mobile devices via push, SMS, and email.
- A — AWS Config rules can enforce tagging compliance across resources.
- B — ECR is a fully managed Docker container registry integrated with ECS and EKS.
- B — Athena is serverless and charges per query — most cost-effective for ad-hoc S3 data queries.
- A — Managed Blockchain provides a managed service for creating blockchain networks.
- D — S3 Deep Archive is the lowest-cost storage class for long-term retention of 7+ years.
📚 Additional Resources#
Next → Practice Test 1