From the course: AWS Certified Data Engineer Associate (DEA-C01) Cert Prep

Unlock this course with a free trial

Join today to access over 25,500 courses taught by industry experts.

Amazon Redshift

Amazon Redshift

- [Instructor] Acquiring large datasets of up to petabytes presents a challenge and requires a data storage solution that is optimized for analytical processing. In this lesson, we'll introduce Amazon's managed data warehouse solution Redshift. Amazon Redshift is a managed data warehouse that uses a massively parallel processing architecture to query large datasets up to more than a petabyte in size. You have your choice of creating a provision cluster or using a serverless option. Since most data warehouses run for a long time, you want to reserve the node capacity to get the benefit of discounts, which are up to 60% off. Redshift deploys a cluster of nodes in a single AZ. If you need instant failover in the case of an AZ outage, you could choose the multi-AZ option, which is like the multi-AZ option for RDS. Basically, there's a second cluster in a different AZ and data is replicated to it, so it could take over in the case of a failure. Redshift continually backs up your data using…

Contents