SkarpSkarp

Chapter 10 of 20

AWS Global Infrastructure: Regions, Availability Zones, and Edge Locations

Unpack how AWS builds its global footprint so you can reason about latency, resilience, and regulatory boundaries when choosing where to run workloads.

27 min readen

Big Picture: Why AWS Global Infrastructure Matters

Why This Topic Matters

In this module, you connect AWS global infrastructure to three things: latency, resilience, and regulatory boundaries. These drive where you place workloads on AWS.

What Exams Test Here

Cloud Practitioner questions focus on knowing what a Region is vs an Availability Zone, predicting availability if an AZ fails, and understanding what edge locations are used for.

Link to Security & Compliance

You learned that Security and compliance are shared responsibilities between AWS and the customer. Where you place resources affects your compliance and security posture.

Regulation, Latency, Resilience

  • Compliance: Laws like GDPR can require data to stay in certain geographies.
  • Latency: Placing compute near users speeds responses.
  • Resilience: Spreading across isolated locations helps survive failures.

Global Scale, Local Choices

As of May 2026, AWS has many Regions, each with multiple AZs, plus thousands of edge locations. You do not need counts; you must understand how the pieces fit together.

What You Will Do

Next, you will learn precise definitions, see visual mental models, walk through deployment patterns, and practice with quizzes and flashcards linked to exam-style thinking.

AWS Regions: The Global Building Blocks

Canonical Definition

AWS Region: "An AWS Region is a physical location in the world where we cluster data centers." This is the exact wording to remember.

Regions as Big Geographic Buckets

A Region maps to a broad area, like `us-east-1` (Northern Virginia) or `eu-central-1` (Frankfurt). Each Region is separate and contains multiple data centers.

Why Regions Matter: Latency

Choosing a Region close to users usually reduces network round-trip time. Users in Europe will often get better latency from an EU Region than from Asia-Pacific.

Why Regions Matter: Regulation

Regions often align with legal boundaries. For data residency rules (for example, EU personal data), you choose an appropriate Region in that jurisdiction.

Why Regions Matter: Services & Isolation

New services roll out Region by Region, and Regions are isolated from each other. Issues in one Region should not impact another Region.

Region Naming Pattern

Region names look like `xx-yyy-n` (for example `us-east-1`). For the exam, recognize they represent geographic areas, not individual data centers or AZs.

Availability Zones: Inside a Region

Canonical AZ Definition

Availability Zone: "An Availability Zone is one or more discrete data centers with redundant power, networking, and connectivity in an AWS Region."

AZs as Building Blocks

An AZ can be one or more data centers. Each AZ is treated as one logical zone for deploying resources like EC2 instances and RDS databases.

Physical Separation, Logical Proximity

AZs are physically separated to avoid shared failures, but connected by high-bandwidth, low-latency links so you can replicate data and fail over quickly.

AZ Naming Quirk

AZ names like `us-east-1a` are account-specific. Your `1a` might map to a different physical AZ than another account’s `1a`. The exam may hint at this.

How You Use AZs

Spread EC2 instances across at least two AZs, enable multi-AZ for RDS, and place a load balancer in front to achieve high availability within a Region.

Exam Clues for AZs

Phrases like "isolated locations within a Region" or "fault tolerance inside a Region" are strong hints the answer involves Availability Zones.

Mental Model: Region vs AZ vs Data Center

Three-Level Hierarchy

Think: data center < Availability Zone < Region. Data centers make up AZs, and AZs are grouped into Regions, which are the main geographic units.

Visualizing a Region

Picture a big box labeled `Region`. Inside it, three boxes labeled `AZ A`, `AZ B`, and `AZ C`. Inside each AZ, several small boxes representing data centers.

Region vs AZ Roles

Region defines geography and legal boundary. AZ defines fault isolation and high availability within that Region. They solve different problems.

AZ vs Data Center

You choose AZs in the console, not individual data centers. AWS handles which physical building your resources run in.

Failure Scenarios

A data center can fail without killing an AZ; an AZ can fail without killing the Region. Regions failing is rare and usually a disaster recovery concern.

Exam Trap Alert

If a question says "across multiple data centers in the same Region", the concept is Availability Zones. Edge locations and Regions are wrong here.

Edge Locations and Global Content Delivery

What Are Edge Locations?

Edge locations are smaller AWS sites used by services like CloudFront, Route 53, and Global Accelerator. They are not full Regions or AZs.

Caching at the Edge

CloudFront caches content at edge locations near users. This reduces latency because users fetch from a nearby edge instead of a distant Region.

Global Routing Over AWS Backbone

Global Accelerator and CloudFront route traffic into the nearest edge, then across AWS’s private backbone network to your application’s Region.

DNS at the Edge

Route 53 uses a global network of DNS servers, often co-located with edges, to answer DNS queries quickly anywhere in the world.

How Edge Differs from Regions

You do not deploy EC2 into edge locations. Edges handle caching and network entry, while Regions and AZs run the full application stack.

Exam Clue for Edge

If you see "deliver content globally with low latency" or "cache content near users", think CloudFront and edge locations, not new Regions.

Putting It Together: Three Real-World Scenarios

Scenario 1: Global News Site

App stack runs in `eu-west-1`. CloudFront in front caches assets at edge locations worldwide, so readers hit nearby edges instead of Ireland directly.

What This Achieves

You keep a single Region deployment but use edge locations to lower latency globally and reduce load on your origin servers.

Scenario 2: EU Banking App

Regulation requires EU data residency. You choose an EU Region and deploy across at least two AZs, with multi-AZ RDS for database resilience.

Edge + Residency

Static content may be cached globally, but sensitive data at rest remains in the EU Region, supporting regulatory requirements and resilience.

Scenario 3: Regional Gaming Backend

Players in South America connect to a South America Region. EC2 game servers are spread across AZs, with CloudFront serving static game assets.

Key Pattern Across Scenarios

Region handles geography and law, AZs handle high availability, and edge locations fine-tune latency and performance for users.

Thought Exercise: Design a Simple Deployment

Work through this design mentally (or jot notes) to strengthen your reasoning.

Situation:

You are building a learning platform for a university. Most students are in the United States, but some are on exchange in Europe and Asia. The university has a policy that student records for US students must remain in the US. You expect moderate traffic but require high availability during exam periods.

Your task: Decide how to use Regions, Availability Zones, and edge locations.

Questions to answer (pause and think before checking the hints):

  1. Which Region would you choose for the main application and database? Why?
  2. How many Availability Zones will you use, and for which components?
  3. Where do edge locations fit into this design?
  4. How does this design respect the data residency policy?

Hints (unfold mentally only after you decide):

  • 1: You likely choose a US Region (for example, an east or west coast Region) to keep data in the US and minimize latency for most students.
  • 2: Use at least two AZs for critical components (web/app tier, database with multi-AZ) to survive an AZ failure.
  • 3: Use CloudFront with edge locations so exchange students can fetch static assets from nearby edges, even though the origin is in a US Region.
  • 4: Store student records only in the US Region. Edge caching is typically used for static content, not sensitive records.

After this, ask yourself: Which part of the design is about resilience? Which is about latency? Which is about compliance? Being able to separate these concerns is a common exam skill.

Quiz 1: Regions vs AZs vs Edge Locations

Test your understanding of the core concepts before we add more nuance.

Which statement best describes an Availability Zone in AWS?

  1. A geographic area that contains multiple clusters of data centers and defines a regulatory boundary.
  2. A network of edge locations used to cache content closer to users globally.
  3. One or more discrete data centers with redundant power, networking, and connectivity in an AWS Region.
  4. A single data center where you deploy EC2 instances directly.
Show Answer

Answer: C) One or more discrete data centers with redundant power, networking, and connectivity in an AWS Region.

The canonical definition is: "An Availability Zone is one or more discrete data centers with redundant power, networking, and connectivity in an AWS Region." Regions are geographic areas (option 1), edge locations form the global caching network (option 2), and you do not deploy to a specific data center (option 4).

Quiz 2: Choosing the Right Concept

Apply what you know to a realistic decision-style question.

Your company wants to improve performance for users worldwide when they download large video files from an S3 bucket in one Region, without deploying the application to more Regions. Which AWS feature should you use first?

  1. Deploy EC2 instances in every AWS Region.
  2. Use Amazon CloudFront with edge locations in front of the S3 bucket.
  3. Enable multi-AZ on the S3 bucket to replicate it across Availability Zones.
  4. Move the S3 bucket to a Region closer to your headquarters.
Show Answer

Answer: B) Use Amazon CloudFront with edge locations in front of the S3 bucket.

To improve global download performance without multi-Region deployment, use **Amazon CloudFront**. It caches content at **edge locations** near users. Deploying EC2 in every Region is costly and unnecessary, S3 does not use "multi-AZ" in that sense, and moving the bucket closer to HQ does not help users in other continents.

High Availability and Fault Tolerance Using Regions and AZs

HA Within One Region

Use multiple AZs in a single Region for high availability. Load balancers send traffic to instances in at least two AZs, so one AZ can fail without downtime.

Multi-AZ Patterns

Stateless web tiers use Auto Scaling across AZs. Databases often use multi-AZ RDS, keeping a standby in another AZ for automatic failover.

Multi-Region = Disaster Recovery

Using more than one Region is typically for disaster recovery or very high resilience, not basic high availability, at Cloud Practitioner level.

Trade-offs: Multi-AZ vs Multi-Region

Multi-AZ is simpler and keeps data in one Region. Multi-Region adds complexity and data residency questions but protects against full Region failures.

Exam Traps on HA

For AZ failure, think multi-AZ. For Region failure, think multi-Region. Edge locations do not replace AZ or Region level resilience for core apps.

Key Term Flashcards: Regions, AZs, and Edge

Use these flashcards to lock in core definitions and exam distinctions.

AWS Region
An AWS Region is a physical location in the world where we cluster data centers.
Availability Zone (AZ)
An Availability Zone is one or more discrete data centers with redundant power, networking, and connectivity in an AWS Region.
Edge location
A smaller AWS site used by services like Amazon CloudFront, Route 53, and AWS Global Accelerator to cache content and route traffic closer to users.
Main purpose of Regions
Define geographic and regulatory boundaries, and provide isolated groups of Availability Zones where you run workloads.
Main purpose of AZs
Provide isolated locations within a Region so you can design high availability and fault-tolerant architectures.
Main purpose of edge locations
Improve latency and performance for global users by caching and routing traffic at locations close to them.
High availability within a Region
Achieved by deploying resources such as EC2 and RDS across multiple Availability Zones in the same Region.
Disaster recovery across Regions
Using multiple AWS Regions so your application can recover from a full Region outage or large-scale disaster.
Exam clue: "cache content closer to users"
This usually refers to Amazon CloudFront and edge locations, not Regions or Availability Zones.
Exam clue: "isolated locations within a Region"
This describes Availability Zones, not edge locations or separate Regions.

Key Terms

AWS Region
An AWS Region is a physical location in the world where we cluster data centers.
Edge location
A smaller AWS site used by services like Amazon CloudFront, Route 53, and AWS Global Accelerator to cache content and route traffic closer to users.
Data residency
Requirements or policies that specify where data must be stored geographically, such as keeping personal data within a particular country or region.
Amazon Route 53
AWS’s highly available and scalable Domain Name System (DNS) web service, using a global network of DNS servers, often co-located with edge locations.
Fault tolerance
The ability of a system to continue operating correctly even when some components fail, for example by using redundant resources across AZs or Regions.
Amazon CloudFront
AWS’s content delivery network (CDN) service that uses edge locations to cache and deliver content to users with low latency.
Availability Zone
An Availability Zone is one or more discrete data centers with redundant power, networking, and connectivity in an AWS Region.
Multi-AZ deployment
An architecture where resources such as databases or EC2 instances are deployed across multiple Availability Zones in the same Region to improve availability.
AWS Global Accelerator
A networking service that uses the AWS global network and edge locations to improve availability and performance of your applications.
High availability (HA)
The ability of a system to remain accessible and operational for a high percentage of time, often achieved on AWS by deploying across multiple Availability Zones in a Region.

Finished reading?

Test your understanding with a custom practice exam on this chapter.

Test yourself