by Sandeep R. Patil, Wei G. Gong, Pallavi Galgali, Piyush Chaudhary, Muthu Muthiah, Yong ZY Zheng, Larry Coyne, IBM Redbooks · 2018
ISBN: 0738456969 9780738456966
Category: Computers / System Administration / Storage & Retrieval
Page count: 30
<p>This IBM® RedpaperTM publication provides guidance on building an enterprise-grade data lake by using IBM SpectrumTM Scale and Hortonworks Data Platform for performing in-place Hadoop or Spark-based analytics. It covers the benefits of the integrated solution, and gives guidance about the types of deployment models and considerations during the implementation of these models.</p><p>Hortonworks Data Platform (HDP) is a leading Hadoop and Spark distribution. HDP addresses the complete needs of data-at-rest, powers real-time customer applications, and delivers robust analytics that accelerate decision making and innovation.</p><p>IBM Spectrum ScaleTM is flexible and scalable software-defined file storage for analytics workloads. Enterprises around the globe have deployed IBM Spectrum Scale to form large data lakes and content repositories to perform high-performance computing (HPC) and analytics workloads. It can scale performance and capacity both without bottlenecks.</p>