2. AWS Object Storage
2
▪ Cloud object storage makes it possible to store practically limitless amounts of
data in its native format
▪ As businesses grow, you're managing rapidly expanding but isolated pools of
data from many sources that are used by any number of applications and
business processes.
▪ Today, many companies struggle with a fragmented storage portfolio that
adds complexity and slows down innovation for business applications.
▪ Object storage helps you break down these silos by providing massively
scalable, cost-effective storage to store any type of data in its native format.
▪ With AWS object storage solutions like Amazon Simple Storage Service
(Amazon S3) and Amazon Glacier, you manage your storage in one place with
an easy-to-use application interface.
▪ You can use policies to optimize storage costs, tiering between different
storage classes automatically. AWS makes storage easier to use to perform
analysis, gain insights, and make better decisions faster.
3. AWS S3
3
▪ Amazon Simple Storage Service (Amazon S3) is storage for the
Internet.You can use Amazon S3 to store and retrieve any amount
of data at any time, from anywhere on the web.
▪ S3 = Simple Storage Service
▪ A Service Oriented Architecture which provides online storage
using web services.
▪ Allows read, write, and delete permissions on objects . Uses REST
and SOAP protocols for messaging, so you can use various
development toolkits with S3.
▪ This guide introduces you to Amazon S3 and how to use the AWS
Management Console to complete the tasks shown in the following
figure.
4. Create an S3 bucket
4
To create an S3 bucket
1. Sign in to the AWS Management Console and open the
Amazon S3 console athttps://console.aws.amazon.com/s3/.
2. Choose Create bucket.
5. Create an S3 bucket
5
3. In the Bucket name field, type a unique DNS-compliant
name for your new bucket. (The example screen shot uses
the bucket name admin-created.You cannot use this name
because S3 bucket names must be unique.) Create your own
bucket name using the follow naming guidelines:
▪ The name must be unique across all existing bucket names in
Amazon S3.
▪ After you create the bucket you cannot change the name, so
choose wisely.
▪ Choose a bucket name that reflects the objects in the bucket
because the bucket name is visible in the URL that points to
the objects that you're going to put in your bucket.
6. Create an S3 bucket
6
4. For Region, choose USWest (Oregon) as the region where
you want the bucket to reside.
5. Choose Create.
6. Else you could
configure for
Versioning , access Log,
Encryption, permissions
etc.
7. Version Control
7
▪ Versioning is a means of keeping multiple variants of an object in
the same bucket.
▪ You can use versioning to preserve, retrieve, and restore every
version of every object stored in your Amazon S3 bucket.
▪ With versioning, you can easily recover from both unintended user
actions and application failures.
▪ Versioning-enabled buckets enable you to recover objects from
accidental deletion or overwrite. For example:
▪ If you delete an object, instead of removing it permanently,
Amazon S3 inserts a delete marker, which becomes the current
object version.You can always restore the previous version.
▪ If you overwrite an object, it results in a new object version in the
bucket.You can always restore the previous version.
8. Version Control
8
Key = a.txt
ID: 1
Key = a.txt
ID: 2
Key = a.txt
ID: 3
• In one bucket, for example, you can have three
objects with the same key, but different version
IDs, such as a.txt (version 1) , a.txt (version 2)
and a.txt (version 3) .
• Buckets can be in one of three states:
unversioned (the default), versioning-enabled,
or versioning-suspended.
• The versioning state applies to all (never some) of the objects in that
bucket.The first time you enable a bucket for versioning, objects in it
are thereafter always versioned and given a unique version ID.
Important
Once you version-enable a bucket, it can never return to an unversioned state.You
can, however, suspend versioning on that bucket.
9. Cross Region Replication(CRR)
9
▪ Cross-region replication is a bucket-level configuration that
enables automatic, asynchronous copying of objects across
buckets in different AWS Regions. We refer to these buckets
as source bucket and destination bucket.These buckets can be
owned by different AWS accounts.
▪ To activate this feature, you add a replication configuration to
your source bucket to direct Amazon S3 to replicate objects
according to the configuration. In the replication
configuration, you provide information such as the following:
– destination bucket
– objects you want to replicate
– storage class to use for object replicas in the destination bucket
(storage class of source bucket)
10. Cross Region Replication(CRR)
Use-Case Scenarios
▪ Compliance requirements – Although, by default, Amazon S3 stores
your data across multiple geographically distant Availability Zones,
compliance requirements might dictate that you store data at even
further distances. Cross-region replication allows you to replicate data
between distant AWS Regions to satisfy these compliance requirements.
▪ Minimize latency –Your customers are in two geographic locations.To
minimize latency in accessing objects, you can maintain object copies in
AWS Regions that are geographically closer to your users.
▪ Operational reasons –You have compute clusters in two different AWS
Regions that analyze the same set of objects.You might choose to
maintain object copies in those Regions.
10
11. Life Cycle Management
11
▪ To manage your objects so that they are stored cost effectively
throughout their lifecycle, configure their lifecycle. A lifecycle
configuration is a set of rules that define actions that Amazon S3
applies to a group of objects.There are two types of actions:
▪ Transition actions - Define when objects transition to
another storage class. For example, you might choose to transition
objects to the STANDARD_IA storage class 30 days after you
created them, or archive objects to the GLACIER storage class one
year after creating them.
▪ There are costs associated with the lifecycle transition requests. For
pricing information, see Amazon S3 Pricing.
▪ Expiration actions - Define when objects expire. Amazon S3
deletes expired objects on your behalf.
▪ The lifecycle expiration costs depend on when you choose to expire
objects.
13. Amazon Glacier
13
What Is Amazon Glacier?
• Amazon Glacier is a storage service optimized for infrequently used data, or "cold
data."
• Amazon Glacier is an extremely low-cost storage service that provides durable
storage with security features for data archiving and backup.
• With Amazon Glacier, customers can store their data cost effectively for months,
years, or even decades.
• Amazon Glacier enables customers to offload the administrative burdens of operating
and scaling storage to AWS, so they don't have to worry about capacity planning,
hardware provisioning, data replication, hardware failure detection and recovery, or
time-consuming hardware migrations.
14. Amazon Glacier
14
What Is Amazon Glacier?
• Amazon Simple Storage Service (Amazon S3) supports lifecycle configuration on an
S3 bucket, which enables you to transition objects to the Amazon S3 GLACIER storage
class for archival.
• When you transition Amazon S3 objects to the GLACIER storage class,Amazon S3
internally usesAmazon Glacier for durable storage at lower cost.
• Although the objects are stored in Amazon Glacier, they remain Amazon S3 objects
that you manage in Amazon S3, and you cannot access them directly through
Amazon Glacier.