본문으로 건너뛰기

Optimize Data Management and Analytics with NetApp Solutions for Hadoop

Mike McNamara
Mike McNamara
88 조회수

With NetApp® technology, you get flexibility and choice based on your use case. If you want to use data management by Hadoop Distributed File System (HDFS), we provide NetApp FAS and AFF systems and NetApp E-Series and EF-Series systems. FAS and AFF systems are certified with Hortonworks Data Platform (HDP) 3.0.0, and E-Series and EF-Series systems are certified with HDP 2.6.5. E-Series and EF-Series systems are also Cloudera certified.



If you need a more optimized in-place analytics solution and you don’t require HDFS, you can use the NetApp In-Place Analytics Module. You gain the following key advantages from the NetApp In-Place Analytics Module compared with competitors’ offerings:

  • A single source or copy of data versus a minimum of two or three copies
  • A reduced data footprint by using deduplication and compression; data reduction of between 5 and 10 times compared with HDFS deployments
  • A unified data lake architecture between Hadoop and artificial intelligence (AI) and deep learning (DL) workloads
  • Quick NetApp Snapshots™ copies for testing and development and easy space-efficient backups
  • Ability to run analytics on data that’s in NFS storage
  • Support for new Hadoop features without upgrading the storage
  • Ability to scale storage and compute independently
  • Support for data security by using Ranger and Kerberos
  • Identical architectures between on-premises deployments and cloud deployments by using NetApp Cloud Volumes Service

The following figure depicts in-place analytics on a Hadoop/Spark cluster.

Optimize Data Management and Analytics with NetApp - Inline Image 1

HDP 3.0.0 Certified with NetApp Cloud Volumes

In the cloud, the configuration with HDP 3.0.0 is certified with NetApp Cloud Volumes ONTAP® technology and Cloud Volumes Service running HDFS over NFS. Cloud Volumes Service is a cloud-native service that gives you high-performance file storage in the cloud, with NFS and SMB connectivity. You get rich data management, such as efficient Snapshot copies and clones, as well as an integrated backup service. You can deploy Cloud Volumes Service in seconds with three different performance tiers on AWS or on Google Cloud Platform. If you use Azure, you can deploy a similar native Azure service called Azure NetApp Files, which is also built on NetApp technology.

Optimize Data Management and Analytics with NetApp - Inline Image 2The certified configuration with HDP 3.0.0 enables you to take advantage of the performance and reliability of Cloud Volumes Service in the cloud. At the same time, you benefit from the data management capabilities that are built into HDFS.

NetApp has been supporting Hadoop for many years to deliver advanced solutions for big data analytics. With NetApp technology, you get industry-leading storage and data management that complements your Hortonworks Data Platform that’s built on Hadoop.

Find out how you can optimize data management and accelerate data analytics with NetApp. For more information, visit netapp.com/bigdata.

Mike McNamara

Mike McNamara

Mike McNamara는 NetApp의 제품 및 솔루션 마케팅 분야의 고위 경영진이며 25년이 넘는 데이터 관리 및 클라우드 스토리지 마케팅 경험을 보유하고 있습니다. 10년 전 NetApp에 입사하기에 앞서, McNamara는 Adaptec, Dell EMC, HPE에서 근무했습니다. McNamara는 자사 클라우드 스토리지 오퍼링 및 업계 최초의 클라우드 연결형 AI/ML 솔루션(NetApp), 유니파이드 스케일아웃 및 하이브리드 클라우드 스토리지 시스템 및 소프트웨어(NetApp), iSCSI 및 SAS 스토리지 시스템 및 소프트웨어(Adaptec), 파이버 채널 스토리지 시스템(EMC CLARiiON)의 출시를 이끈 핵심 팀 리더입니다.McNamara는 Fibre Channel Industry Association에서 마케팅 의장을 역임한 경력 외에도 Ethernet Technology Summit Conference Advisory Board와 Ethernet Alliance에서 회원으로 활동하고 있으며, 업계 저널의 고정 기고자로 활동하며 여러 행사에서 연설을 맡기도 했습니다. McNamara는 또한 FriesenPress에서 'Scale-Out Storage - The Next Frontier in Enterprise Data Management'라는 책을 출간했으며, Kapos가 선정한 눈 여겨 볼 상위 50대 B2B 제품 마케터에 이름을 올렸습니다.Mike McNamara의 모든 게시물 보기

다음 단계

Optimize Data Management & Analytics with NetApp Solutions for Hadoop | NetApp Blog