Peterindia logo

Big Data Analytics (BDA)
Platforms and Tools

18 Platforms & Tools

01
Apache Spark

A unified analytics engine for large-scale data processing.

Visit →
02
Apache Storm

Free and open source distributed realtime computation system. Reliably processes unbounded streams of data — doing for realtime processing what Hadoop did for batch processing.

Visit →
03
Trino

Fast distributed SQL query engine for big data analytics that helps you explore your data universe.

Visit →
04
Apache Hadoop

A framework for the distributed processing of large data sets across clusters of computers.

Visit →
05
Apache Samza

Build stateful applications that process data in real-time from multiple sources including Apache Kafka.

Visit →
06
Apache Airflow

A platform created by the community to programmatically author, schedule and monitor workflows.

Visit →
08
HPCC Systems

A data lake platform for combining different types of data easier and faster.

Visit →
09
Delta Lake

Open-source storage framework enabling Lakehouse architecture with Spark, PrestoDB, Flink, Trino, and Hive. APIs for Scala, Java, Rust, Ruby, and Python.

Visit →
10
Apache Drill

Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage.

Visit →
11
Apache Druid

A real-time database to power modern analytics applications at scale.

Visit →
12
Apache Flink

Stateful computations over data streams — a framework for distributed stream and batch data processing.

Visit →
13
Apache Hive

Data warehouse software that facilitates reading, writing, and managing large datasets in distributed storage using SQL.

Visit →
14
Apache Hudi

Brings transactions, record-level updates/deletes and change streams to data lakes.

Visit →
15
Apache Iceberg

High-performance format for huge analytic tables. Brings the reliability and simplicity of SQL tables to big data.

Visit →
16
Apache Kafka

Open-source distributed event streaming platform for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications.

Visit →
17
Apache Kylin

Open source, distributed Analytical Data Warehouse for Big Data, providing OLAP capability in the big data era.

Visit →
18
Presto

Open source distributed SQL query engine for interactive analytic queries against data sources of all sizes — gigabytes to petabytes.

Visit →