June 24, 2014

Getting the most out of Impala - presentation & whitepaper

By Ioana Hreninciuc in Big Data Benchmarks

In May, Alex, our techie Product Manager gave a presentation at the London Enterprise Technology Meetup on infrastructure best practices for getting the most out of Cloudera Impala.We apologize for being a bit slow to sharing our findings with the internet at large, here we are trying to make up for it now.We started testing Impala in an effort to understand what hardware setup would provide the best performance/price for it. We didn’t want to see it perform in extreme cases, but in regular situations that most users would encounter. We aimed to provide a quick practical guide for choosing the infrastructure to run Impala on.

In May, Alex, our techie Product Manager gave a presentation at the London Enterprise Technology Meetup on infrastructure best practices for getting the most out of Cloudera Impala.

We apologize for being a bit slow to sharing our findings with the internet at large, here we are trying to make up for it now.

We started testing Impala in an effort to understand what hardware setup would provide the best performance/price for it. We didn’t want to see it perform in extreme cases, but in regular situations that most users would encounter. We aimed to provide a quick practical guide for choosing the infrastructure to run Impala on.

With this in mind, we looked at a medium sized deployment of 4 Full Metal Compute Instances, and we scaled the hardware from single CPU, low RAM capacity to dual CPU, high RAM capacity.

We didn’t start out aiming to prove any particular assumption. The purpose of the project was to explore and understand how Impala works with hardware and our findings were quite surprising.

We also put together a whitepaper, that you can download or print by clicking here.

If you have any questions or would like us to do a benchmark on a certain technology, just drop a comment and we’ll get right back to you.

Got a question? Need advice? We're just one click away.

Sharing is caring:

Back to articles

Readers also enjoyed:

February 27, 2015

Choosing the Best Method for Outside Network Access

By Daniela Mustatea in Performance for Big Data Apps

Some lucky businesses are housed in a single building, making networking a cinch. Others, however, are sprawled across large campuses, a metro area, or…

July 17, 2014

What makes a good bare metal cloud?

By Daniela Mustatea in Performance for Big Data Apps

Bare metal cloud is a relatively new term to add to the technology lexicon. Emerging as a concept a few years ago, it has started to gain awareness and…

September 16, 2013

Why bare metal cloud gives a winning performance

By Daniela Mustatea in Big Data Use Cases

There is an elephant in the room when it comes to virtualisation. Of course there are many benefits, but ultimately virtualisation eats performance and…

Your email address will not be published.

Getting the most out of Impala - presentation & whitepaper

Readers also enjoyed:

Choosing the Best Method for Outside Network Access

What makes a good bare metal cloud?

Why bare metal cloud gives a winning performance

Leave a Reply