Best Big Data Processing and Distribution Software

Products Buyer's Guide
8.6
Google Cloud Dataprep
★★★★★

Google Cloud Dataprep

Google Cloud Dataprep is an intelligent data service for visually exploring, cleaning, and preparing structured and unstructured data for analysis. Cloud Dataprep is serverless and works at any scale.

Use this program daily, saves tons of time - Nathan L.

Ease of use
9.1
Support
8.5
Ease of Setup
8.7
7.4
Apache Storm
★★★★★

Apache Storm

Apache Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing.

Data Processing Engine - YOGESH B.

Ease of use
6.0
Support
7.4
Ease of Setup
7.7
8.8
HVR
★★★★★

HVR

HVR is designed to move large volumes of data fast and efficiently in complex environments for real-time updates.

Awesome powerful product - David K.

Ease of use
8.3
Support
8.8
Ease of Setup
0.0
8.0
Apache Spark for Azure HDInsight
★★★★★

Apache Spark for Azure HDInsight

Apache Spark for Azure HDInsight is an open source processing framework that runs large-scale data analytics applications.

How well Apache Spark can be efficient in the project - Consultant in Information Technology and Services

Ease of use
8.5
Support
7.5
Ease of Setup
0.0
8.2
Hazelcast IMDG
★★★★★

Hazelcast IMDG

A New Lightweight, Distributed Data Processing Engine

Hazlecast IMDG helped us to reduce service transaction response times by an order of magnitude. - Tharanga H.

Ease of use
8.8
Support
7.8
Ease of Setup
0.0
9.0
Apache Falcon
★★★★★

Apache Falcon

Apache Falcon is a feed processing and feed management system designed to make it easier for end consumers to onboard their feed processing and feed management on hadoop clusters.

Good Product - Internal Consultant in Banking

Ease of use
Support
Ease of Setup
9.0
Apache Bahir
★★★★★

Apache Bahir

Apache Bahir provides extensions to multiple distributed analytic platforms, extending their reach with a diversity of streaming connectors and SQL data sources.

Curate extensions and plugins with ease! - Jared H.

Ease of use
Support
Ease of Setup
9.0
AWS Lake Formation
★★★★★

AWS Lake Formation

AWS Lake Formation is a service that makes it easy to set up a secure data lake in days. A data lake is a centralized, curated, and secured repository that stores all your data, both in its original form and prepared for analysis.

My AWS Lake Formation Review - User in Information Technology and Services

Ease of use
Support
Ease of Setup
9.0
Oracle Enterprise Management
★★★★★

Oracle Enterprise Management

Oracle Big Data Cloud at Customer delivers the complete value of Oracle Big Data Cloud Service to customers who require their Big Data platform to be located on-premises.

Oracle Big Data Cloud at Customer - Aditya S.

Ease of use
Support
Ease of Setup
8.0
Apache Fluo
★★★★★

Apache Fluo

Apache Fluo is an open source implementation of Percolator (which populates Google's search index) for Apache Accumulo.

WorkFluos with Apache Fluo - Internal Consultant in Civil Engineering

Ease of use
Support
Ease of Setup
8.0
Qlik Catalog
★★★★★

Qlik Catalog

Qlik Data Catalyst accelerates the transition towards modern data management by providing essential capabilities in four areas.

Podium for data quality - User in Information Technology and Services

Ease of use
Support
Ease of Setup
10.0
FlinkML
★★★★★

FlinkML

FlinkML is the Machine Learning (ML) library for Flink it has a growing list of algorithms and contributors that aim to provide scalable ML algorithms, an intuitive API, and tools that help minimize glue code in end-to-end ML systems.

Very good software for worke - Marvin P.

Ease of use
Support
Ease of Setup
10.0
Apache Chukwa
★★★★★

Apache Chukwa

Apache Chukwa is an open source data collection system for monitoring large distributed systems.

Good performance - Consultant in Telecommunications

Ease of use
Support
Ease of Setup
10.0
Alibaba MaxCompute
★★★★★

Alibaba MaxCompute

Alibaba MaxCompute (previously known as ODPS) is a general purpose, fully managed, multi-tenancy data processing platform for large-scale data warehousing. MaxCompute supports various data importing solutions and distributed computing models, enabling users to effectively query massive datasets, reduce production costs, and ensure data security

An excellent solution for large scale data storage - Alexa T.

Ease of use
Support
Ease of Setup
10.0
Bright Cluster Manager
★★★★★

Bright Cluster Manager

Bright Computing provides comprehensive software solutions for provisioning and managing HPC clusters, Hadoop clusters, and OpenStack private clouds in your data center or in the cloud.

Clustered efficiency at its best - Administrator in Information Technology and Services

Ease of use
Support
Ease of Setup
10.0
IBM BigInsights
★★★★★

IBM BigInsights

IBMВ® BigInsightsВ® is an enterprise platform that combines Hadoop and Spark for fast analysis and processing of data. The solution includes Spark, SQL, text analytics and more to help you easily integrate and analyze big data. With IBM, spend less time creating an enterprise-ready Hadoop infrastructure, and more time gaining valuable insights.

IBM Biginsights for begineers and research projects. - User in Public Policy

Ease of use
Support
Ease of Setup
10.0
PHEMI Health DataLab
★★★★★

PHEMI Health DataLab

PHEMI Health DataLab is a cloud-based, big data management system that enables organizations to generate value from healthcare data by simplifying ingestion and de-identification of data with military-grade governance, privacy, and security built-in. This enables responsible access to more information that advances innovation by researchers, scientists, and clinicians.

Very reliable - User in Health, Wellness and Fitness

Ease of use
Support
Ease of Setup

Languages supported: English

B2B Software Guide