datadocs

Networking & Content Delivery

CloudFront

A Content Delivery Network (CDN) is a system of distributed servers (network) that deliver webpages and other web content to a user based on the geographic locations of the user, the origin of the webpage, and a content delivery server.
- Origin is the origin of all the files that the CDN will distribute.
- Distribution is the name of the CDN that consists of a collection of edge locations.
- Web distributions are typically used for websites.
- RTMP is used for media streaming.
Amazon CloudFront is a fast content delivery network (CDN) service.
- Can deliver an entire website, including dynamic, static, streaming, and interactive content using a global network of edge locations.
- Requests for the content are automatically routed to the nearest edge location, so the content is delivered with the best possible performance.
Edge locations:
- They are not just read-only, one can write to them too.
- Objects are cached based on the TTL (time to live)
- You can clear cached objects (i.e. invalidate them), but you will be charged.
You cannot combine blacklisting and whitelisting in CloudFront.
There are multiple options to securely access files stored in S3:
- CloudFront Signed URLs: Generates URLs for access. Primarily used for providing access to individual files and when cookies cannot be used.
- CloudFront Signed Cookies: Generates special cookies for access. Primarily used when you don't want to change your current URL.
- CloudFront Origin Access Identity: A virtual user identity that is used to give the CloudFront distribution permission to fetch a private object from S3.
- Choosing Between Signed URLs and Signed Cookies
Lambda@Edge:
- You can use Lambda@Edge to allow your Lambda functions to customize the content that CloudFront delivers and to execute the authentication process in AWS locations closer to the users.
- Using AWS Lambda with CloudFront Lambda@Edge
Origin Failover:
- You can set up an origin failover by creating an origin group with two origins with one as the primary origin and the other as the second origin which CloudFront automatically switches to when the primary origin fails.
- Optimizing High Availability with CloudFront Origin Failover
SNI Custom SSL:
- SNI Custom SSL relies on the SNI extension of the Transport Layer Security protocol, which allows multiple domains to serve SSL traffic over the same IP address by including the hostname which the viewers are trying to connect to.
- Use the Dedicated IP Custom SSL feature to support non-SNI compliant browsers for HTTPS content.
- Amazon CloudFront Adds SNI Custom SSL
Amazon CloudFront Cheat Sheet

Request routing

DNS routes the request to the nearest CloudFront POP in terms of latency.
In the POP, CloudFront checks its cache for the requested files.
If the files are not in the cache, CloudFront will query the origin server and then cache the resource on the edge location.

Routing

DNS

The Domain Name Systems (DNS) is the phonebook of the Internet.
- Converts a hostname (www.example.com) into a computer-friendly IP address (192.168.1.1)
The DNS recursor is a server designed to receive queries from client machines.
- Makes a series of requests until it reaches the authoritative DNS nameserver.
- Cached either locally inside the querying computer or remotely in the DNS infrastructure.
- Examples include Google DNS, OpenDNS, and providers like Comcast.
Root nameserver responds to the resolver with the address of a TLD server (such as .com or .net)
Top Level Domain (TLD) nameserver stores the information for its domains.
- The TLD server responds with the IP address of the domain’s nameserver (example.com)
- The top-level domain (TLD) names (e.g. .com) are controlled by the Internet Assigned Numbers Authority (IANA) in a root zone database, which is essentially a database of all available TLD names.
- The database can be viewed under https://www.iana.org/domains/root/db
Authoritative nameserver is a server that actually holds, and is responsible for, DNS resource records.
- This is the server at the bottom of the DNS lookup chain.
- Can satisfy queries from its own data without needing to query another source.
- But requires an additional nameserver for serving queries for a subdomain (blog.cloudflare.com)
A registrar is an authority that can assign domain names under one or multiple TLDs.
- This domains are registered with InterNIC, a service of ICANN, which enforces uniqueness of domain names across the Internet.
- Each domain name becomes registered in a central database known as the WhoIS database.
- For example, Amazon, GoDaddy.com

Records

The State of Authority Record (SOA) stores information about:
- The name of the server that supplied the data for the zone.
- The administrator of the zone.
- The current version of the data file.
- The default number of seconds for the TTL file on resource records.
- When any record in the zone is updated, the SOA record serial number is incremented.
The Name Server (NS) record is used for recursively resolving the required domain name.
- Used to redirect the resolver to the DNS server hosting the next level domain.
The record A specifies IP address (IPv4) for given host.
The record AAAA (also quad-A record) specifies IPv6 address for given host.
The Canonical Name (CNAME) record maps a name to another name.
- Use a CNAME record if you want to alias one name to another name, and you don’t need other records (such as MX records for emails) for the same name.
- Can be used to map two domain names to the same IP address.
- Should only be used when there are no other records on that name.
The ALIAS record maps a name to another name, but can coexist with other records on that name.
- Use an ALIAS record if you’re trying to alias the root domain (apex zone), or if you need other records for the same name.
- Given the choice, always choose an ALIAS record over a CNAME.
- ALIAS records have special functions that are not present in other DNS servers. Their main function is to provide special functionality and integration into AWS services.
- ALIAS records can also point to AWS resources that are hosted in other accounts.
- Choosing Between Alias and Non-Alias Records
- How do I create alias records in Route 53?
The MX record is used for emails.
The PTR record is used to lookup the domain name of an IP address (also called reversed A record)
The duration that the DNS record is cached (on Resolving Server or user PC) is TTL in seconds.
- The default is 48 hours, that is, a DNS change may need 48 hours to fully propagate.

Route 53

Route 53 is Amazon's DNS Service.
Route 53 is named so because the DNS port is 53.
You can buy domain names directly with AWS.
- Can take up to 3 days to register.
- There is a default limit of 50 domain names. However, this limit can be increased by contacting the AWS support.
Health checks:
- You can set health checks on individual record sets.
- If a record set fails a health check, it will be removed until it passes the check.
- You can also set up SNS notifications to be notified about any failed health checks.
Failover configurations:
- Use active-active failover configuration when you want all of your resources to be available the majority of the time. You configure active-active failover using any routing policy.
- Use active-passive failover configuration when you want a secondary resource or group of resources to be on standby in case all the primary resources become unavailable. You configure active-passive failover using the failover routing policy.
- Active-Active and Active-Passive Failover
Route 53 has a security feature that prevents internal DNS from being read by external sources.
- The work around is to create a EC2 hosted DNS instance that does zone transfers from the internal DNS, and allows itself to be queried by external servers.
Prerequisites for routing traffic to a website that is hosted in an Amazon S3 Bucket:
- An S3 bucket that is configured to host a static website.
- The bucket must have the same name as your domain or subdomain.
- A registered domain name.
- Route 53 as the DNS service for the domain.

Amazon Route 53 Cheat Sheet

Routing policies

Simple routing:
- One record with multiple IP addresses.
- If multiple values are specified, Route 53 returns them in random order.
- For example, the user can be redirected to Ohio and the next time to Sydney (after TTL expires)
- Cannot be associated with health checks.
Weighted routing:
- Allows you to split your traffic based on different weights assigned.
- For example, you can set 10% of traffic to go to US-EAST-1 and 90% to EU-WEST-1.
Latency-based routing:
- Routes traffic based on the lowest network latency for the end user.
- To use it, create a latency RRset for the AWS resource in each region it is hosted.
- Use VPN clients (such as NordVPN) to test the service.
- Latency based routing will approximate geographic balance only when all routes and traffic evenly supported which is rarely the case due to infrastructure and day night variations.
Failover routing:
- Used when you want to create an active/passive setup.
- For example, when your primary server is in US-EAST-1 and secondary in EU-WEST-1.
- Route 53 will monitor the health of the primary server using a health check.
- Configuring DNS Failover
Geolocation routing:
- Routes traffic based on the geolocation of the end user (national boundaries)
- For example, route all queries from Europe to a fleet of EC2 instances in EU-WEST-1. These servers may have the local language of European customers and all price are displayed in Euros.
- Not to be confused with latency-based routing.
Geoproximity routing (traffic flow only):
- Routes traffic based on the geolocation of the end user and resource (latitude & longitude)
- You can also specify to route more traffic or less to a given resource by specifying a value (bias)
- A bias expands or shrinks the size of the geographic region from which traffic is routed.
- To use it, you must use Route 53 traffic flow.
Multivalue Answer routing:
- Lets configure Route 53 to return multiple IP addresses in response to DNS queries.
- Compared to simple routing, allows health checks and returns only values for healthy resources.
- The choice of which to use is left to the requesting service effectively creating a form or randomization.
Choosing a Routing Policy

VPC

Amazon Virtual Private Cloud (Amazon VPC) lets you provision a logically isolated section of the AWS where you can launch AWS resources in a virtual network that you define.
- Think of VPC as a logical datacenter on AWS.
You have complete control over your network environment, including selection of your IP address range, creation of subnets, and configuration of route tables and network gateways.
- For example, create a public-facing subnet (10.0.1.0/24) for web servers and private-facing subnet (10.0.2.0/24) for databases.
Allows creation of a hardware VPN connection between private datacenters and Amazon VPC.
Consists of gateways, route tables, ACLs, subnets, and security groups.
Default VPC:
- User friendly, allowing to immediately deploy instances.
- Your default VPC includes a subnet per AZ, an internet gateway (each default subnet is a public subnet), a route table with the route to the internet gateway, a default NACL that allows everything, and a default security group that also allows everything. Each EC2 instance created under a subnet of this VPC has both a private and public IP address.
- If you do delete the default VPC, it will remove any network objects associated with it. But you can recover it now.
Dedicated hosting tenancy:
- Once a VPC is set to Dedicated, it can be changed back to Default via the CLI, SDK or API.
- Note that this will not change hosting settings for existing instances, only future ones.
- Existing instances can be changed via CLI, SDK or API but need to be in a stopped state to do so.
Vulnerability scans:
- Until recently customers were not permitted to conduct penetration testing without AWS engagement. However that has changed. There are still conditions though.
How to communicate to the back-end services hosted on-premise:
- You can do this by either using a site to site VPN or Direct Connect.
- You must also make sure that your internal IP addresses do not overlap.
You can create up to 5 VPCs in a single region.
In Amazon VPC, an instance always retains its private IP address.
VPCs and Subnets
Amazon VPC Cheat Sheet

Components

Inside a region, we have our VPC. On the outside of the VPC, we have two ways to connecting to it: Internet Gateway and Virtual Private Gateway. The traffic from an Internet gateway is routed to the appropriate subnet using the routes in the routing table. The rules of the network ACL associated with the subnet control which traffic is allowed to the subnet. The rules of the security group associated with an instance control which traffic is allowed to the instance.
AWS has removed the Firewall appliance from the hub of the network and implemented the firewall functionality as stateful Security Groups, and stateless subnet NACLs.

Subnets

A subnet is a range of IP addresses in your VPC.
To protect the AWS resources in each subnet, you can use multiple layers of security, including security groups and ACL.
Subnets cannot span multiple AZs.
By default, instances in new subnets in a custom VPC can communicate with each other across AZs. In a custom VPC with new subnets in each AZ, there is a route that supports communication across all subnets/AZs. Plus a default SG with an allow rule 'All traffic, all protocols, all ports, from anything using this default SG'.

Classless Inter-Domain Routing (CIDR)

Some private IP address ranges include 10.0.0.0/8, 172.16.0.0/12, and 192.168.0.0/16
The largest prefix Amazon allows is 16 (65,536 addresses), the smallest is 28 (16 addresses)
The CIDR block must not overlap with any existing CIDR block that's associated with the VPC.
You cannot increase or decrease the size of an existing CIDR block.
You have a limit on the number of CIDR blocks you can associate with a VPC.
The CIDR block must not be the same or larger than the CIDR range of a route.
The first four IP addresses and the last IP address in each subnet CIDR block are not available.
To translate an IP address with prefix (192.64.0.0/14):
- Convert the prefix into the bitmask (14 bits from left is 11111111.11111100.00000000.00000000)
- Then into the decimal subnet mask (255.252.0.0)
- Then into the wildcard (remaining 0.3.255.255)
- Finally, add the prefix to the IP address (the first available host is 192.64.0.1, the last 192.67.255.254)
- AN INTERACTIVE IP ADDRESS AND CIDR RANGE VISUALIZER

Route tables

A route table contains a set of rules (routes) that are used to determine where network traffic is directed.
The route table entries enable instances in the subnet to communicate with other instances in the VPC (also in other subnets), and to communicate directly over the Internet.
Every time we create a subnet, it will be automatically associated with the main route table.
- Thus, create a separate, custom route table for any public subnets.
Every route table contains a local route for communication within the VPC.
The default route table also contains route to the IGW.

IGW

An internet gateway (IGW) is a horizontally scaled, redundant, and highly available VPC component that allows communication between instances in your VPC and the internet.
Imposes no availability risks or bandwidth constraints on your network traffic.
A subnet that's associated with a route table that has a route to an Internet gateway (IGW) is known as a public subnet.
You can only attach one internet gateway per VPC.

NACL

A Network Access Control List (Network ACL) is an optional layer of security for your VPC that acts as a firewall for controlling traffic in and out of one or more subnets.
- Think of Homeland Security.
Operate at the subnet level.
NACLs have separate inbound and outbound rules, and each rule can either allow or deny traffic.
- For example, you can block a specific IP address.
Stateless:
- Return traffic must be explicitly allowed by rules.
Automatically apply to all instances in the subnets they're associated with.
- Therefore, an additional layer of defense if the security group rules are too permissive.
The rules are evaluated in numerical order, from the lowest to the highest.
- Amazon recommends that you use increments of 100 for each rule (and of 1 for IPv4 and IPv6)
- Rules are executed immediately when a matching allow/deny rule is found.
- For example, if you allow port 80 in the rule "100" and disable it in the rule "200", the port will still be open. That's also why everything under the rule "*" is denied at the end.
For a new NACL:
- All traffic is blocked by default.
For the default NACL:
- All traffic is allowed by default.
Your VPC automatically comes with a default network ACL that allows everything.
When creating a new NACL, all inbound and outbound rules are denied by default.
Each subnet in the VPC must be associated with a network ACL.
- If you don't explicitly associate a subnet with a NACL, the subnet is automatically associated with the default NACL.
- You can associate a NACL with multiple subnets.
- However, a subnet can only be associated with a single NACL at a time.
- When you associate a subnet with a new NACL, starting with the lowest numbered rule.
Network ACLs

Security groups

Security groups act as a firewall for associated Amazon EC2 instances:
- Think of local law enforcement.
Operate at the instance level.
Support "allow" rules only.
Evaluate all rules before deciding whether to allow traffic.
Stateful:
- Return traffic is automatically allowed, regardless of any rules.
Changes to security groups take effect immediately.
For a new security group:
- All inbound traffic is blocked by default.
- All outbound traffic is allowed by default.
For the default security group:
- All inbound traffic is allowed only for instances sharing this SG.
- All outbound traffic is allowed by default.
By default, security groups do not have access to each other.
EC2 instances:
- Any number of EC2 instances can be within a security group.
- You can assign up to five security groups to an instance.
Security Groups for Your VPC

Custom VPC

Create a new VPC (10.0.0.0/16). When you create a custom VPC, a default route table, a default ACL, and a default VPC security group are created. It won't create any subnets, nor it will create any gateways.
Create two subnets under this VPC. Assign custom IP address ranges in each subnet: 10.0.1.0/24 for public and 10.0.2.0/24 for private. Select the AZ for each one (Keep in mind: An AZ in one AWS account can be different from the same AZ in a different AWS account, since they are randomized). By default, any user-created VPC subnet WILL NOT automatically assign public IPv4 addresses to instances, thus for the public one, turn on "Auto-assign IPv4".
Create an internet gateway and attach it to the VPC.
Configure route tables between subnets. Since the created subnets are associated with the main route table by default, create a new route table and add the 0.0.0.0/0 route to the internet gateway created earlier. Associate then the public subnet 10.0.1.0/24 with this table.
Go over to the AWS resources, and create them under the corresponding subnets.
Specify the appropriate security groups for your resources for them to communicate. For example, for SG 'A' to talk to a private SG 'B', create a new rule for SG 'B' and add SG 'A' as Source.
VPC with Public and Private Subnets (NAT)

VPC Peering

Connect one VPC with another via a direct network route using private IP addresses.
Instances behave as they are on the same private network.
You can peer VPCs with VPCs in another regions or even AWS accounts.
You may peer a VPC to another VPC that's in your same account, or to any VPC in any other account.
The following VPC peering connection configurations are not supported.
- Overlapping CIDR Blocks
- Transitive Peering: If both VPC A and VPC B are peered with VPC C, VPC A and VPC B cannot communicate with each other directly.
- Edge to Edge Routing Through a Gateway or Private Connection

NAT

You can use a network address translation (NAT) gateway to enable instances in a private subnet to connect to the internet or other AWS services, but prevent the internet from initiating a connection with those instances.
- For example, this can be used to do software updates in private subnets.

NAT instances (outdated)

A NAT instance is acting as a bridge between subnets and the internet gateway.
To set it up, create a NAT instance, and in the route table of the subnet where your EC2 instance is located, enter a route to this NAT instance to direct the internet traffic through this instance.
A NAT instance is always behind the security group.
Requirements:
- Must be in a public subnet.
- The source/destination checks must be disabled.
Limitations:
- The amount of traffic that this instance can support depends on its size.
  - A massive bottleneck: A single VM, which can be easily overwhelmed. You can create HA using Autoscaling Groups, multiple subnets in different AZs, and a script to automate failover.
- When the instance is removed, the corresponding route is marked as "blackhole"

NAT Gateway

Redundant in the AZ and thus highly available.
Starts at 5Gbps and scales currently to 45Gbps.
There is no need to patch operation system.
Automatically assigned a public IP address.
Remember to update your route tables.
Uses ephemeral ports 1024-65535.
Not behind any security group.
If you have resources in multiple AZs and they share one NAT gateway, in the event of that the NAT gateway's AZ is down, resources in the other AZs lose internet access.
- To create an AZ-independent architecture, create a NAT gateway in each AZ.

Egress-Only Internet Gateway

The purpose of an "Egress-Only Internet Gateway" is to allow IPv6 based traffic within a VPC to access the Internet, whilst denying any Internet based resources the possibility of initiating a connection back into the VPC.
An egress-only Internet gateway is for use with IPv6 traffic only.
To enable outbound-only Internet communication over IPv4, use a NAT gateway instead.

Bastion host

A NAT gateway is used to provide internet traffic to EC2 instances in private subnets.
A bastion is used to securely administer EC2 instances (SSH/RDP)
Don't confuse Bastions and NATs, which allow outside traffic to reach an instance in a private subnet.
A bastion host is a special computer on a network designed and configured to withstand attacks.
- Generally hosts a single application (e.g. proxy server)
- All other services are removed or limited to reduce the threat to the computer.
- Located either on the outside of the firewall or in a demilitarized zone (DMZ)
- Usually involves access from untrusted networks or computers.
The best way to implement a bastion host is to create a small EC2 instance which should only have a security group from a particular IP address for maximum security.
- Small because this host will only act as a jump server to connect to other instances.
Bastion Architecture
How to Record SSH Sessions Established Through a Bastion Host

VPC Flow Logs

VPC Flow Logs enables you to capture information about the IP traffic.
Logs are stored either using S3 or Amazon CloudWatch Logs.
- For this, create a log group in Amazon CloudWatch and an IAM role first.
Can be created at three levels:
- VPC
- Subnet
- Network Interface Level
Limitations:
- You cannot enable flow logs for VPCs that are peered with your VPC unless the peer VPCs are under your account.
- After you've created a flow log, you cannot change its configuration.
- You cannot tag a flow log.
Not all IP traffic is monitored:
- Traffic generated by EC2 instances when they contact the Amazon DNS server (unless its your own DNS server)
- Traffic generated by a Windows instance for Amazon Windows license activation.
- Traffic to and from 169.254.169.254 for instance metadata.
- DHCP traffic as well as the traffic to the reserved IP address for the default VPC router.

VPC Endpoints

A VPC endpoint enables you to privately connect from your VPC to other AWS services without requiring an internet gateway, NAT device, VPN connection, or AWS Direct Connect connection.
- In contrast to a NAT gateway, traffic does not leave the Amazon network.
- Instances in your VPC do not require public IP addresses.
Endpoints are virtual devices that are horizontally scaled, redundant, and highly available.
There are two types of VPC endpoints:
- Interface endpoints: A whole bunch of services are supported.
- Gateway endpoints: S3 and DynamoDB are supported.

Site-to-site connection

VPN

Step 1: Create a Customer Gateway:
- A Customer Gateway is the anchor on on-premise side of that connection.
- Contains information about the third party site e.g. the external IP address and type of routing.
- It can be a physical or software appliance.
Step 2: Create a Virtual Private Gateway (VGW):
- A Virtual Private Gateway (VPG) is the anchor on the AWS side of the VPN connection.
- Connects a specified VPC to the VPN.
Step 3: Enable route propagation in your route table:
- To enable instances in your VPC to reach your customer gateway.
Step 4: Update your security group:
- To allow access to instances in your VPC from your network.
- Update your security group rules to enable inbound SSH, RDP, and ICMP access.
Step 5: Get the VPN connection configuration and configure your customer gateway.
What is AWS Site-to-Site VPN?

Direct Connect

Direct Connect makes it easy to establish a dedicated network connection from your premises to AWS.
AWS Direct Connect does not involve the Internet; instead, it uses dedicated, private network connections.
Benefits:
- Useful for high throughput workloads (i.e. lots of network traffic)
- Or if you want to establish a stable and reliable network connection.
- Provides a more consistent network experience than Internet-based applications.
To connect to your on-premise servers via Direct Connect:
- By default, there is no route connecting your VPC back to the on premise data center.
- Edit the VPC subnet route table, adding a route back to the on-premise datacenter.
- Enable route propagation on your Virtual Private Gateway (VPG)

High availability

Availability is the % of a time period when the service will be able to respond to request in some fashion.
Durability refers to the on-going existence of the object or resource.
Resiliency can be described as the ability to a system to self heal after damage or an event.
Reliability is the probability that a system will work as designed.
Everything fails. Everything. You should always plan for failure.
- Use multiple AZs and multiple regions wherever you can.
- Know the difference between Multi-AZ (for disaster recovery) and read replicas (for performance) for RDS.
- Know the difference between scaling out (auto-scaling) and up (increase resources)
- Know the different S3 storage classes.
- Always consider the cost element.
- Perform manual robustness testing: The Netflix Simian Army

How to achieve HA when ELB is impossible in the current setting?
- Write a custom script that checks the health of the EC2 instance and if the instance stops responding, the script will switch the Elastic IP address to a standby EC2 instance.
- Leveraging Multiple IP Addresses for Virtual IP Address Fail-over

ELB

Elastic Load Balancing (ELB) automatically distributes incoming application traffic across multiple targets, such as Amazon EC2 instances, containers, IP addresses, and Lambda functions.
X-Forwarded:
- When a request is going through an ELB, the EC2 instance receives the IP address of the ELB instead of the client. To forward the client's IP address, use X-Forwarded-For field.
Monitoring:
- Instances monitored by ELB are either InService or OutService.
- The ELB periodically sends pings, attempts connections, or sends requests to test the EC2 instances.
- Configure Health Checks for Your Classic Load Balancer
- If you configure the Auto Scaling group to use Elastic Load Balancing health checks, it considers the instance unhealthy if it fails either the EC2 status checks or the load balancer health checks.
- The load balancer will route the incoming requests only to the healthy instances.
- Adding Elastic Load Balancing Health Checks to an Auto Scaling Group
Sticky sessions:
- Sticky session allow you to bind a user's session to a specific EC2 instance.
- Can be useful if you're storing information locally on that instance.
- If you're confronted with an EC2 instance that receives no traffic, disable sticky sessions.
Cross-Zone Load Balancing:
- Direct traffic across multiple AZs.
- Configure Cross-Zone Load Balancing
Path patterns:
- Direct traffic based on the URL contained in the request.
- For example, for microservices, you can route traffic to multiple backend services (www.myurl.com to US-EAST-1A and www.myurl.com/images/ to US-EAST-1B)
Scaling:
- Proactive Cyclic Scaling allows you to scale during the desired time window.
Restrictions:
- They do not have IPv4 addresses; you can resolve to them using a DNS name.
- You need at least two public subnets when creating a load balancer.
Perfect Forward Secrecy:
- Perfect Forward Secrecy is a feature that provides additional safeguards against the eavesdropping of encrypted data, through the use of a unique random session key.
- This prevents the decoding of captured data, even if the secret long-term key is compromised.
- Perfect Forward Secrecy and more new security features
Connection draining:
- This enables the load balancer to complete requests to instances that are de-registering or unhealthy.
Elastic Load Balancing FAQs
AWS ELB Cheat Sheet

Types

Application ELBs:
- Best suited for load balancing of HTTP and HTTPs traffic.
- Operate at LAYER 7 and are application aware.
- Capable of intelligent request routing: based on rules for target group, condition, and priority.
- Support path-based routing, host-based routing and containerized applications.
- For example, ELB can see changing currency to $ on a website and redirect traffic to US.
- Benefits of Migrating from a Classic Load Balancer
Network ELBs:
- Best suited for load balancing of TCP traffic where extreme performance is required.
- Specifically designed for high performance traffic that is not conventional web traffic.
- Operate at LAYER 4 and are connection aware.
- Capable of handling millions of requests per second while maintaining low latencies.
Classic ELBs:
- Classic ELBs are the previous generation ELBs.
- Can operate at LAYER 7 (e.g. X-Forwarded and sticky sessions)
- Can also use strict LAYER 4 load balancing for applications that rely on TCP.
- Cheaper than others and is usually used for round-robin DNS.
- If your application stops responding, the ELB responds with "504 Gateway Time-out" error.

Auto Scaling

Auto Scaling allows you to set up dynamic and predictive scaling for scalable AWS resources.
A launch configuration is a template that an Auto Scaling group uses to launch EC2 instances.
- You specify information for the instances such as the ID of the Amazon Machine Image (AMI), the instance type, a key pair, one or more security groups, and a block device mapping.
- Always check that the launch configuration is being triggered properly.
- You can only specify one launch configuration for an Auto Scaling group at a time.
- Launch Configurations
Auto Scaling scales-in according to a hierarchy of decisions
- Choose the Availability Zone with the most instances.
- Choose the Availability Zone with the instances that use the oldest launch configuration.
- Terminate the instances that use the oldest launch configuration.
- Terminate unprotected instances that are closest to the next billing hour.
- Choose one of these instances at random.
Scheduled scaling:
- Allows you to set your own scaling schedule for predictable load changes.
Dynamic scaling:
- You must define how to scale in response to changing demand.
- Target tracking scaling: Change the current capacity based on a target value for a metric.
- Step scaling: Change the current capacity based on a set of scaling adjustments.
- Simple scaling: Change the current capacity based on a single scaling adjustment.
Cooldown timers:
- Useful to keep Auto Scaling from scaling up and down so rapidly.
- Ensures that the Auto Scaling group does not launch or terminate additional EC2 instances before the previous scaling activity takes effect.
- Its default value is 300 seconds.
- Modify the Amazon CloudWatch alarm period that triggers your Auto Scaling scale down policy.
Scaling based on a schedule allows you to scale your application in response to predictable load changes.
- By the time either the CPU or Memory hits a peak, the application already has performance issues.
- You can plan your scaling activities beforehand instead.
AWS Auto Scaling Cheat Sheet

Last updated on 2019-11-6

← Databases Security, Identity, & Compliance →