Category: Big Data

AWS Case Studies / AWS Services / Big Data / Database

September 9, 2018

Why Use AWS Redshift Spectrum with Data Lake

A data lake is a centralized repository that allows you to store all your structured and unstructured data at any scale. AWS uses S3 to store data in any format, securely, and at a...

AWS Case Studies / AWS Services / Big Data / Database

September 5, 2018

Data Storage for Big Data: Aurora, Redshift or Hadoop?

I mentioned in my post “Which is Right Hadoop Solution for You?” that if your Big Data is below 1.6PB, you may want to take a look at the Redshift data warehouse option. When we...

Big Data

August 19, 2018

Which is Right Hadoop Solution for You?

As I mentioned in the Hadoop ecosystem cheat sheet, the Hadoop ecosystem is open-source with plenty of add-on packages; additionally, you can build your own Hadoop system with these free resources. However, it will be...

Big Data

August 13, 2018

Apache Hadoop Ecosystem Cheat Sheet

Apache Hadoop 3.1.1 was released on the eighth of August with major changes to YARN such as GPU and FPGA scheduling/isolation on YARN, docker container on YARN, and more expressive placement constraints in YARN. Apache...

Category: Big Data

Why Use AWS Redshift Spectrum with Data Lake

Data Storage for Big Data: Aurora, Redshift or Hadoop?

Which is Right Hadoop Solution for You?

Apache Hadoop Ecosystem Cheat Sheet

Recent Posts

About Our Blog

Archives

Sign Up

CLOUD COMPUTING CERTS

Training Courses

Category: Big Data

Why Use AWS Redshift Spectrum with Data Lake

Data Storage for Big Data: Aurora, Redshift or Hadoop?

Which is Right Hadoop Solution for You?

Apache Hadoop Ecosystem Cheat Sheet

Recent Posts

About Our Blog

Archives

Sign Up

CLOUD COMPUTING CERTS

Training Courses

Popular Posts