Try our new research platform with insights from 80,000+ expert users
Apache Spark Streaming Logo

Apache Spark Streaming pros and cons

Vendor: Apache
3.9 out of 5

Pros & Cons summary

Buyer's Guide

Get pricing advice, tips, use cases and valuable features from real users of this product.
Get the report

Prominent pros & cons

PROS

Apache Spark Streaming offers efficiency and stability, making it a reliable solution.
It provides near real-time analytics and supports easy API development for coding streaming pipelines.
Integration with other stock services and open-source nature makes it a cost-effective choice.
The platform's scalability and full-featured nature enhance data freshness rates and lower latency.
Micro-batching capabilities, along with different window types, optimize data processing.

CONS

Apache Spark Streaming could improve in the area of user configuration, making it less developer-focused and more business-user friendly.
The initial setup of Apache Spark Streaming is complex and resource-intensive, even for small-scale applications.
Apache Spark Streaming faces issues with memory management, latency, and lacks real-time analytics capabilities, which restricts its use for use cases like IOT-based or anomaly detection with millisecond latency.
The cost-related and load-related optimizations are areas where Apache Spark Streaming needs enhancement.
Monitoring and troubleshooting Apache Spark Streaming can be challenging due to the generation of numerous logs that become unmanageable over time.
 

Apache Spark Streaming Pros review quotes

Venkata Phaneendra Reddy Janga - PeerSpot reviewer
Aug 18, 2025
By integrating Apache Spark Streaming, the data freshness rate, and latency have significantly improved from 24-hour batch processing to less than one minute, facilitating faster communication to downstream systems, aiding marketing campaigns.
Himansu Jena - PeerSpot reviewer
Aug 19, 2025
With Apache Spark Streaming's integration with Anaconda and Miniconda with Python, I interact with databases using data frames or data sets in micro versions and create solutions based on business expectations for decision-making, logistic regression, linear regression, or machine learning which provides image or voice record and graphical data for improved accuracy.
Kuldeep Pal - PeerSpot reviewer
Aug 22, 2025
With Apache Spark Streaming, you can have multiple kinds of windows; depending on your use case, you can select either a tumbling window, a sliding window, or a static window to determine how much data you want to process at a single point of time.
Learn what your peers think about Apache Spark Streaming. Get advice and tips from experienced pros sharing their opinions. Updated: September 2025.
867,676 professionals have used our research since 2012.
Shahzad Munir - PeerSpot reviewer
Aug 25, 2025
I appreciate Apache Spark Streaming's micro-batching capabilities; the watermarking functionality and related features are quite good.
DR
Jun 8, 2023
Apache Spark Streaming was straightforward in terms of maintenance. It was actively developed, and migrating from an older to a newer version was quite simple.
RK
Jun 3, 2024
Apache Spark's capabilities for machine learning are quite extensive and can be used in a low-code way.
Prashast Tripathi - PeerSpot reviewer
Jul 24, 2023
Apache Spark Streaming has features like checkpointing and Streaming API that are useful.
Ajay Hiremath - PeerSpot reviewer
Sep 9, 2025
For Apache Spark Streaming, the feature I appreciated most is that it provides live data delivery; additionally, it provides the capability to send a larger amount of data in parallel.
Oscar Estorach - PeerSpot reviewer
Aug 18, 2021
The solution is very stable and reliable.
SB
Nov 21, 2022
It's the fastest solution on the market with low latency data on data transformations.
 

Apache Spark Streaming Cons review quotes

Venkata Phaneendra Reddy Janga - PeerSpot reviewer
Aug 18, 2025
One improvement I would expect is real-time processing instead of micro-batch or near real-time.
Himansu Jena - PeerSpot reviewer
Aug 19, 2025
When dealing with various data types including COBOL, Excel, JSON, video, audio, and MPG files, challenges can arise with incomplete or missing values.
Kuldeep Pal - PeerSpot reviewer
Aug 22, 2025
While it is reliable, there are some issues with Apache Spark Streaming as it is not 100% reliable.
Learn what your peers think about Apache Spark Streaming. Get advice and tips from experienced pros sharing their opinions. Updated: September 2025.
867,676 professionals have used our research since 2012.
Shahzad Munir - PeerSpot reviewer
Aug 25, 2025
Monitoring is an area where they could definitely improve Apache Spark Streaming. When you have a streaming application, it generates numerous logs. After some time, the logs become meaningless because they're quite large and impossible to open.
DR
Jun 8, 2023
It was resource-intensive, even for small-scale applications.
RK
Jun 3, 2024
The debugging aspect could use some improvement.
Prashast Tripathi - PeerSpot reviewer
Jul 24, 2023
The cost and load-related optimizations are areas where the tool lacks and needs improvement.
Ajay Hiremath - PeerSpot reviewer
Sep 9, 2025
The downside is when you have this the other way around in the columns, it becomes really hard to use.
Oscar Estorach - PeerSpot reviewer
Aug 18, 2021
The solution itself could be easier to use.
SB
Nov 21, 2022
The initial setup is quite complex.