

Find out what your peers are saying about Amazon Web Services (AWS), Apache, Spot - A Flexera company and others in Compute Service.
Thanks to improvements on both our side in how we run processes and enhancements to Apache NiFi, we have reduced the time commitment to almost not needing to interact with Apache NiFi except for minor queue-clearance tasks, allowing it to run smoothly.
It supports not just ETL but also ELT, allowing us to save significant time.
There may be return on investment based on the technology and easily moving our workloads onto Apache NiFi from our previous system.
The customer support is really good, and they are helpful whenever concerns are posted, responding immediately.
Customer support for Apache NiFi has been excellent, with minimal response times whenever we raise cases that cannot be directly addressed by logs.
I would rate the customer support of Apache NiFi a 10 on a scale of 1 to 10.
The fact that no interaction is needed shows their great support since I don't face issues.
Google's support team is good at resolving issues, especially with large data.
Whenever we have issues, we can consult with Google.
Depending on the workload we process, it remains stable since at the end of the day, it is just used as an orchestration tool that triggers the job while the heavy lifting is done on Spark servers.
Scaling up is fairly straightforward, provided you manage configurations effectively.
Based on the workload, more nodes can be added to make a bigger cluster, which enhances the cluster whenever needed.
Google Cloud Dataflow has auto-scaling capabilities, allowing me to add different machine types based on pace and requirements.
As a team lead, I'm responsible for handling five to six applications, but Google Cloud Dataflow seems to handle our use case effectively.
Google Cloud Dataflow can handle large data processing for real-time streaming workloads as they grow, making it a good fit for our business.
I have seen Apache NiFi crashing at times, which is one of the issues we have faced in production.
Apache NiFi is stable in most cases.
I have not encountered any issues with the performance of Dataflow, as it is stable and backed by Google services.
The job we built has not failed once over six to seven months.
The automatic scaling feature helps maintain stability.
Apache NiFi should have APIs or connectors that can connect seamlessly to other external entities, whether in the cloud or on-premises, creating a plug-and-play mechanism.
The history of processed files should be more readable so that not only the centralized teams managing Apache NiFi but also application folks who are new to the platform can read how a specific document is traversing through Apache NiFi.
The initial error did not indicate it was related to memory or size limitations but appeared as a parsing error or something similar.
Outside of Google Cloud Platform, it is problematic for others to use it and may require promotion as an actual technology.
I feel there could be something that they can introduce, such as when we have data in the tables, a feature that creates a unique persona of the user automatically, so we do not have to do that manually.
Dealing with a huge volume of data causes failure due to array size.
The pricing in Italy is considered a little bit high, but the product is worth it.
It is part of a package received from Google, and they are not charging us too high.
Apache NiFi has positively impacted my organization by definitely bridging the gap between the on-premises and cloud interaction until we find a solution to open the firewall for cloud components to directly interact with on-premises services.
Development has improved with a reduction in time spent being the main benefit; before we needed a matter of days to create the ingestion flows, but now it only takes a couple of hours to configure.
The ease of use in Apache NiFi has helped my team because anyone can learn how to use it in a short amount of time, so we were able to get a lot of work done.
It supports multiple programming languages such as Java and Python, enabling flexibility without the need to learn something new.
The integration within Google Cloud Platform is very good.
Google Cloud Dataflow's features for event stream processing allow us to gain various insights like detecting real-time alerts.
| Product | Mindshare (%) |
|---|---|
| Apache NiFi | 8.2% |
| AWS Lambda | 14.2% |
| Amazon EC2 | 13.6% |
| Other | 64.0% |
| Product | Mindshare (%) |
|---|---|
| Google Cloud Dataflow | 3.7% |
| Apache Flink | 8.9% |
| Databricks | 8.1% |
| Other | 79.3% |
| Company Size | Count |
|---|---|
| Small Business | 5 |
| Midsize Enterprise | 1 |
| Large Enterprise | 18 |
| Company Size | Count |
|---|---|
| Small Business | 3 |
| Midsize Enterprise | 2 |
| Large Enterprise | 11 |
Apache NiFi offers a flexible platform for data orchestration, transformation, and ingestion, catering to both low and high-code customization needs. It streamlines data movement with a powerful visual interface and robust scalability, facilitating seamless integration with diverse data sources.
With Apache NiFi's drag-and-drop capabilities and extensive built-in processors, users can easily simplify complex workflows. Its open-source framework promises cost savings and increased productivity, enabling efficient pipeline development and real-time data handling. While it's valued for data integration and external tool compatibility, there's a need for improvements in logging clarity, local development integration, and cloud-native features.
What are the key features of Apache NiFi?In industries like finance, healthcare, and logistics, Apache NiFi is often implemented for data orchestration and transformation tasks, enhancing workflows through integration with tools like Spark and Elasticsearch. It supports data migration and ETL processes, enabling seamless management of large-scale data operations across systems.
Google Cloud Dataflow provides scalable batch and streaming data processing with Apache Beam integration, supporting Python and Java. It's designed for efficient data transformations, analytics, and machine learning, featuring cost-effective serverless operations.
Google Cloud Dataflow is a robust tool for handling large-scale data processing tasks with flexibility in processing batch and streaming workloads. It integrates seamlessly with other Google Cloud services like Pub/Sub for real-time messaging and BigQuery for advanced analytics. The platform supports a wide array of data transformation and preparation needs, making it suitable for complex data workflows and machine learning applications. Despite its advantages, users have noted challenges such as incomplete error logs, longer job startup times, and some limitations in the Python SDK.
What are the key features of Google Cloud Dataflow?Industries, especially in retail and eCommerce, implement Google Cloud Dataflow for effective batch job execution, data transformation, and event stream processing. It aids in constructing distributed data pipelines for handling extensive analytics tasks, supporting effective large-scale data-driven decisions.
We monitor all Compute Service reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.