AWS Guest Blog: Accelerating Apache and Hadoop Migrations with Cazena’s Data Lake as a Service on AWS

[field_blogimage]

March 23, 2020

By: Jacob Cokely (AWS), Dan Taoka (AWS), Lovan Chetty (Cazena)

Cazena is honored to be featured on the AWS APN blog with our recent blog: Accelerating Apache and Hadoop Migrations with Cazena's Data Lake as a Service on AWS.  

Are you thinking of migrating to an Apache Hadoop and Spark cluster or data lake for analytics on the Amazon Web Services (AWS) Cloud? You’re not alone. Companies migrate data and analytics workloads to AWS for a variety of reasons, including significant skills shortages for managing complex stacks in on-premises data centers.

Running Hadoop, Spark, and related technologies in the cloud provides the flexibility required by these distributed systems. Augmenting the Hadoop Distributed File System (HDFS) with an object store like Amazon Simple Storage Service (Amazon S3) also has a significant positive impact on cost and resiliency. Not having to manage the physical infrastructure is a big plus.

However, on-premises Hadoop and Spark instances typically contain a large amount of sensitive data. To protect the sensitive data, you need to move the on-premises security and compliance controls to the cloud before you move the sensitive data. You also need to determine the cloud resources each workload needs to meet the service level agreements (SLAs) for performance on the cloud system. Finally, you need to ensure the tools and data flows you have invested in over the years still work with this new cloud deployment.

A few cloud vendors and systems integrators have programs to handle these types of issues. The Amazon EMR migration program, for instance, offers a written guide and free local workshop to help you migrate on-premises workloads to the Amazon EMR big data platform.

Cazena provides a production-ready, continuously optimized and secured Data Lake as a Service. Available on AWS Marketplace, it has multiple features that make it easy to migrate your Hadoop and Spark analytics workloads to AWS without the need for specialized skills. In this post, we walk through those features and explain how they make it easy to migrate to AWS while ensuring your data is as secure on the cloud as it is on-premises.

Read full blog on AWS Partner Network >>

Full article written by:
Lovan Chetty, VP Product at Cazena
Jacob Cokely, ISV Workload Migration Program PDM at AWS
Dan Taoka, ISV Workload Migration Program PSA at AWS

 

Back ›