Best Big Data Processing and Distribution Software

Products Buyer's Guide
8.6
Hadoop HDFS
★★★★★

Hadoop HDFS

Hadoop HDFS is a distributed, scalable, and portable filesystem written in Java.

If you are from Bigdata or Analytics background, you need to use HDFS. My experience is very good. - AAKASH C.

Ease of use
7.0
Support
0.0
Ease of Setup
0.0
9.4
Upsolver
★★★★★

Upsolver

A Data Preparation Platform that lets you prepare and deliver data at massive scale in a matter of minutes.

A good solution handled by capable and available engineers - Assaf L.

Ease of use
8.8
Support
9.6
Ease of Setup
0.0

Languages supported: English

Platforms: Mac, Win, Linux

Price: $$$$$

Business Size: 1

8.8
ASG Enterprise Data Intelligence
★★★★★

ASG Enterprise Data Intelligence

ASG Technologies' Enterprise Data Intelligence Solution delivers a tool-agnostic solution that supports the creation of custom metadata interfaces for your enterprise sources, providing a complete data lineage knowledge base. The range and flexibility offered by ASG includes discovery of mainframe, distributed and other ETL code, analyzing to ensure there are no gaps in your end-to-end lineage.

Nice data governance and data lineage tool - Consultant in Insurance

Ease of use
6.3
Support
0.0
Ease of Setup
0.0
8.4
IBM Analytics Engine
★★★★★

IBM Analytics Engine

Build and deploy clusters within minutes with simplified user experience, scalability, and reliability. Custom configure the environment. Administer through multiple interfaces. Scale on demand.

Very Useful Software & You Can Easily Analyze Your Analytics - Preeti G.

Ease of use
Support
Ease of Setup
8.0
Amazon EMR
★★★★★

Amazon EMR

Amazon EMR is a web-based service that simplifies big data processing, providing a managed Hadoop framework that makes it easy, fast, and cost-effective to distribute and process vast amounts of data across dynamically scalable Amazon EC2 instances.

Fast Processing - Rishab S.

Ease of use
7.9
Support
7.9
Ease of Setup
8.1
9.8
TIMi Suite
★★★★★

TIMi Suite

The TIMi Suite: a complete and integrated suite of datamining tools that are covering all your analytical needs for your enterprise!

Highly optimized tool for data processing, predictive analysis and even data storage - loic M.

Ease of use
9.2
Support
0.0
Ease of Setup
0.0

Languages supported: English, French, Spanish

9.0
Apache Storm for HDInsight
★★★★★

Apache Storm for HDInsight

Apache Storm is a distributed, fault-tolerant, open-source, real-time event processing solution for large, fast streams of data.

this software is really good - Executive Sponsor in Higher Education

Ease of use
Support
Ease of Setup
9.6
Dremio
★★★★★

Dremio

Dremio is a data analysis software. It is self-service data platform provided that users discover, accelerate and share data at any time.

Made us rethink our whole architecture! - Mark Z.

Ease of use
9.2
Support
9.8
Ease of Setup
8.9

Languages supported: English

9.6
DNIF – BIG DATA ANALYTICS
★★★★★

DNIF – BIG DATA ANALYTICS

DNIF allows you to partition one data infrastructure and enable multiple teams to solve many challenges. DNIF makes it easy to have multiple users working and solving different problems using the same data layer

Powerful Data Analysis Software. - Annette M.

Ease of use
Support
Ease of Setup

Languages supported: English

8.6
Ataccama One
★★★★★

Ataccama One

Ataccama One is a master data management software that combines data profiling and analysis, data quality alteration and MDM capabilities.

One tool for everything - Consultant in Music

Ease of use
Support
Ease of Setup

Languages supported: German, English, French, Spanish

8.4
Druid
★★★★★

Druid

Open source streaming data store for interactive analytics at scale.

Druid, Kafka and your favourite Dashboard - Shashank N.

Ease of use
8.1
Support
8.3
Ease of Setup
0.0
8.2
Google Cloud Dataflow
★★★★★

Google Cloud Dataflow

Cloud Dataflow is a fully-managed service for transforming and enriching data in stream (real time) and batch (historical) modes with equal reliability and expressiveness.

Great tool to build both Batch and Stream BigData pipelines - Consultant in Airlines/Aviation

Ease of use
8.0
Support
8.1
Ease of Setup
7.4
9.2
Snowflake
★★★★★

Snowflake

Snowflake's cloud data platform shatters the barriers that have prevented organizations of all sizes from unleashing the true value from their data. Thousands of customers deploy Snowflake to advance their organizations beyond what was possible by deriving all the insights from all their data by all their business users. Snowflake equips organizations with a single, integrated platform that offers the only data warehouse built for the cloud; ...

Using actively Snowflake as base cloud warehouse solution. - valentin c.

Ease of use
9.0
Support
9.5
Ease of Setup
8.3

Languages supported: English

8.8
Google BigQuery
★★★★★

Google BigQuery

Analyze Big Data in the cloud with BigQuery. Run fast, SQL-like queries against multi-terabyte datasets in seconds. Scalable and easy to use, BigQuery gives you real-time insights about your data.

BigQuery is an amazing tool that has simplified my whole workflow - Nick B.

Ease of use
8.2
Support
8.9
Ease of Setup
7.8
8.0
Qubole
★★★★★

Qubole

Qubole delivers a Self-Service Platform for Big Data Analytics built on Amazon, Microsoft and Google Clouds

Great tool to manage Big Data. - Ferdinand P.

Ease of use
7.8
Support
7.9
Ease of Setup
6.2

Languages supported: English

9.2
Snowplow Analytics
★★★★★

Snowplow Analytics

Snowplow is a data delivery platform that collects and operationalizes behavioral data at scale. We empower you and your team to rise above the difficulties of data delivery and organization, enabling you to focus on your data journey.

Great tracking tool to own your own data at a granular level with less cost - Jason Y.

Ease of use
8.0
Support
0.0
Ease of Setup
0.0

Languages supported: English

8.2
Cloudera
★★★★★

Cloudera

Cloudera Enterprise Core provides a single Hadoop storage and management platform that natively combines storage, processing and exploration for the enterprise.

The Best hadoop Aplication. - Administrator in Telecommunications

Ease of use
7.8
Support
8.0
Ease of Setup
0.0
8.4
Databricks
★★★★★

Databricks

Making big data simple

Very powerful yet easy to use distributed computing and data warehousing platform - Prashidha K.

Ease of use
8.2
Support
8.2
Ease of Setup
7.7
8.4
Apache Ambari
★★★★★

Apache Ambari

Apache Ambari is a software project designed to enable system administrators to provision, manage and monitor a Hadoop cluster, and also to integrate Hadoop with the existing enterprise infrastructure.

Would recommend if you are towards BigData - Minhaj B.

Ease of use
8.9
Support
8.0
Ease of Setup
8.7
9.2
Azure Data Lake Store
★★★★★

Azure Data Lake Store

Azure Data Lake Store is secured, massively scalable, and built to the open HDFS standard, allowing you to run massively-parallel analytics.

Astonishing performance when processing data - Internal Consultant in Information Technology and Services

Ease of use
8.7
Support
9.3
Ease of Setup
0.0
8.0
Apache Apex
★★★★★

Apache Apex

Apache Apex is an enterprise grade native YARN big data-in-motion platform designed to unify stream processing as well as batch processing.

Apache Apex; An Open Source Streaming Analytics Solution - Sanskriti G.

Ease of use
Support
Ease of Setup
10.0
GI Big Data
★★★★★

GI Big Data

Mutualized Cloud Data Warehouse, Complete Big Data Platform

Great company, very responsive and adaptable - Pete W.

Ease of use
Support
Ease of Setup

Languages supported: English, French

8.8
Microsoft SQL
★★★★★

Microsoft SQL

SQL Server 2017 brings the power of SQL Server to Windows, Linux and Docker containers for the first time ever, enabling developers to build intelligent applications using their preferred language and environment. Experience industry-leading performance, rest assured with innovative security features, transform your business with AI built-in, and deliver insights wherever your users are with mobile BI.

A very powerful data management tool that is easier to utilize than you might expect. - David M.

Ease of use
8.5
Support
8.8
Ease of Setup
0.0

Languages supported: German, English, French, Italian, Japanese, Korean, Portuguese, Russian, Spanish, Chinese (Simplified)

8.8
Pepperdata Cloud Performance
★★★★★

Pepperdata Cloud Performance

Pepperdata automatically optimizes system resources while providing a detailed, correlated understanding of each application using hundreds of application and infrastructure metrics collected in real-time. It highlights applications that need attention, automatically identifies bottlenecks, and alerts on duration, failure conditions, and resource usage. In the cloud or on-premises, this automated approach gives you complete observability and ...

Pepperdata - Complete Spark Cluster performance monitoring - Uddipan M.

Ease of use
8.0
Support
9.2
Ease of Setup
8.3

Languages supported: English

7.8
Azure HDInsight
★★★★★

Azure HDInsight

HDInsight is a fully-managed cloud Hadoop offering that provides optimized open source analytic clusters for Spark, Hive, MapReduce, HBase, Storm, Kafka, and R Server backed by a 99.9% SLA.

Still Learning - Lisa H.

Ease of use
7.8
Support
7.5
Ease of Setup
8.7
9.0
GigaSpaces InsightEdge
★★★★★

GigaSpaces InsightEdge

GigaSpaces InsightEdge is an always-on platform for your mission-critical applications across cloud, on-premise or hybrid. The platform operationalizes machine learning and transactional processing, at scale; analyzing data as it's born, enriching it with historical context, for instant insight to action.

InsightEdge used with success in a major european car manufacturer to provide WLTP results - FrГ©dГ©ric W.

Ease of use
8.3
Support
9.2
Ease of Setup
7.0
8.0
Apache Beam
★★★★★

Apache Beam

Apache Beam is an open source unified programming model designed to define and execute data processing pipelines, including ETL, batch and stream processing.

Experience with Apache Beam ---> So far so Good. - Consultant in Automotive

Ease of use
7.9
Support
7.9
Ease of Setup
7.5
8.6
Pentaho Data Integration
★★★★★

Pentaho Data Integration

Enable users to ingest, blend, cleanse and prepare diverse data from any source. With visual tools to eliminate coding and complexity, Pentaho puts the best quality data at the fingertips of IT and the business.

ETL for Dashboards - Consultant in Information Technology and Services

Ease of use
8.6
Support
8.4
Ease of Setup
0.0
8.0
Oracle Big Data Cloud Service
★★★★★

Oracle Big Data Cloud Service

Oracle Big Data Cloud Service offers an integrated portfolio of products to help organize and analyze diverse data sources alongside existing data.

Helped Streamline our Data - Joseph S.

Ease of use
7.4
Support
7.8
Ease of Setup
0.0
B2B Software Guide