Dell EMC is focused on providing information that helps customers make the most of their big data technology investment. The failure rate for Hadoop big data projects is still too high given the maturity of the technology.  Customers can’t afford to guess when designing and sizing a solution; they need to deliver optimal performance for their business use cases and to scale as needed. Dell EMC recently completed and published a new TPCx-BigBench (TPCx-BB) result that will help customers make the right choices for Hadoop performance and scalability. Today we are happy to announce that

Dell EMC is the industry leading supplier of hyper-converged, converged and “Ready” Solutions by many standards.  Dell EMC’s tested and validated Ready Bundle for Cloudera Hadoop, together with the right performance benchmark results, takes the guess work out of Hadoop implementations.

The Transaction Processing Council (TPC) is a non-profit corporation founded to define transaction processing and database benchmarks and to disseminate objective, verifiable TPC performance data to the industry. Benchmarking has long been a standard practice of the computer industry and is used to discover, measure and assess the relative performance of alternative systems and configurations. This information can then be customized or extrapolated as an input to system design for systems that will provide similar services for real world applications.

Similar to the years of development and maturation of Relational Database Management Systems (RDBMSs), there is a rapidly expanding ecosystem of both complimentary and competing Big Data Analytics Systems (BDAS). As the big data analytics ecosystem matures, the pressure to evaluate and compare performance and price performance of these systems becomes more useful. To address this need in the industry, the TPC has developed a big data benchmarking specification called TPCx-BB.  The TPCx-BB Benchmark was developed to cover essential functional and business aspects of big data use cases. The benchmark allows for an objective measurement of a BDAS and provides verifiable performance, price/performance, and availability metrics for those considering new investments.

Dell is the first to cross the significant milestone of publishing a result using SF10000 which is the largest data set executed thus far for TPCx-BB. SF10000 maps to a roughly 10TB data set which typically takes longer to execute than the smaller Scale Factors (1000 & 3000).

How realistic is the benchmark?

The TPCx-BB benchmark is designed to stress the CPU and IO systems of a BDAS using one or more concurrent streams.  The test includes 30 unique queries in a simulated workload that is typical of real-world analytic applications. For a test to run and successfully pass an audit, 2 sequential performance runs must be executed. Each run is performed under 3 phases: Load, Power and Throughput.

Results

The chart below shows the breadth of the query types and the average elapsed time in seconds to process the Power test.

The overall TPCx-BB performance data for the Dell R730/R730xd configuration is summarized in the table below:

 

Load Test 3,955.78s
Power Test 53,087.39s
Throughput Test 88,714.53s
Performance Metric 495.283 BBQpm@SF10000
Total System Cost $439,187
Price/Performance 886.75 $/BBQpm@SF10000
Availability Date May 12, 2017

 

The high failure rate of big data projects has left many organizations wary of adopting Hadoop despite the overwhelming evidence of the business benefits of big data technologies.  Dell EMC helps customers through the Data Analytics journey by providing a robust portfolio of solutions that can match their needs from early sandbox development through support of large deployments for multiple use cases.

We simplify implementing and/or expanding your Hadoop capabilities with certified architectures, custom solution design, hardware and software deployment, coupled with ongoing support and training.Please visit our site and take us on the journey with you!

 

Nicholas Wakou

Nicholas Wakou

Nicholas Wakou is a Senior Principal Performance Engineer with the Dell EMC Open Source Solutions team. Nicholas's role, interest and activity is focused on the characterization and optimization of the performance of Dell EMC Cloud and Big Data solutions. Nicholas has been involved and is engaged with Industry efforts to define performance benchmark specifications. He is active on the SPEC (www.spec.org) Cloud committee and several committees of the TPC (www.tpc.org). Nicholas represents Dell Technologies on the Board of Directors of the TPC and on its Technical Advisory Board (TAB). Previously, he was Chair of the TPC Public Relations standing committee. Nicholas has an MS. Electrical Engineering from Oklahoma State University, MS. Microelectronics Technology from Middlesex University, London and a BSc. Electrical Engineering from Makerere University, Kampala, Uganda.
Nicholas Wakou

Latest posts by Nicholas Wakou (see all)

Tags: ,

One Comment

  1. Betty Wakou says:

    I love the information and the flow. Thank you for your hard work and DELL in providing these solutions to customers.

Leave a Comment

Comments are moderated. Dell EMC reserves the right to remove any content it deems inappropriate, including but not limited to spam, promotional and offensive comments.

Follow Dell EMC

Dell EMC Big Data Portfolio

See how the Dell EMC Big Data Portfolio can make a difference for your analytics journey

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Dell EMC Community Network

Participate in the Everything Big Data technical community

Follow us on Twitter