Data Ingestion Tools
- 01. Kafka is a popular data ingestion tool that supports streaming data.
- 02. Apache NiFi supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic
- 03. Wavefront is a high-performance streaming analytics platform and it scales to very high data ingestion rates and query loads
- 04. Amazon Kinesis - Easily collect, process, and analyze video and data streams in real time
- 05. Apache Sqoop is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases.
- 06. Fluentd is an open source data collector for unified logging layer
- 07. RCG|enable Data is a solution for managing, preparing and delivering data from a vast array of sources
- 08. StreamSets Data Collector is an easy-to-use modern execution engine for fast data ingestion and light transformations
- 09. Apache Gobblin - A distributed data integration framework that simplifies data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
- 10. Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data.
- 11. Logstash is a free and open server-side data processing pipeline that ingests data from a multitude of sources, transforms it, and then sends it to your favorite "stash."
- 12. Equalum - A performant and scalable data ingestion platform, effectively combining batch and streaming
data pipeline