Bigstep Resources

Getting The Most Out Of Impala - Best Practices For Infrastructure Optimization

Getting The MostOut Of Impala Best Practices ForInfrastructure Optimization

Paper Abstract

We tested Cloudera Impala in an effort to understand what hardware setup would provide the best performance/price for it. Our aim is to provide a quick practical guide for choosing the infrastructure to run Impala on.

What's Inside

1
Query Execution Times
Results for a set of 20 queries run on the same data set, ten times on each hardware configuration.
2
Single vs. Dual-CPU Instances
We weren't expecting dramatic score changes between single and dual-CPU instances, but what we found was surprising.
3
Performance/Price Scores
Using our standard cost structure, we added the price per hour for each instance and paired it with the Impala performance score.

Free Download

For your convenience, you will also receive this whitepaper by email.

Discover the Bigstep Metal Cloud

The world's highest performance cloud, purpose-built for big data

Learn more

You Might Also Be Interested In...

Cloud vs. On-PremiseTCO

Cloud vs. On-Premise TCO

See what people often forget when comparing on-premise vs. the cloud.
Read more
NoSQL PerformanceBenchmarks Series Couchbase

NoSQL Performance Benchmarks Series: Couchbase

Learn about the scaling profiles of distributed database technologies & identify optimum performance/price environments.
Read more
NoSQL Performance Measuring CouchbasePerformance with Couchdoop

NoSQL Performance: Measuring Couchbase Performance with Couchdoop

Besides evaluating Couchdoop itself, we also pushed Couchbase to its limits by leveraging Hadoop’s parallelism.
Read more