Getting The Most Out Of Impala - Best Practices For Infrastructure Optimization
We tested Cloudera Impala in an effort to understand what hardware setup would provide the best performance/price for it. Our aim is to provide a quick practical guide for choosing the infrastructure to run Impala on.
Query Execution Times
Results for a set of 20 queries run on the same data set, ten times on each hardware configuration.
Single vs. Dual-CPU Instances
We weren't expecting dramatic score changes between single and dual-CPU instances, but what we found was surprising.
Using our standard cost structure, we added the price per hour for each instance and paired it with the Impala performance score.
For your convenience, you will also receive this whitepaper by email.
Discover the Bigstep Metal Cloud
The world's highest performance cloud, purpose-built for big data