Excellent speeds for publishing messages faster.
Java Architect at a tech vendor with 51-200 employees
The speed at which it publishes messages is valuable.
Pros and Cons
- "Excellent speeds for publishing messages faster."
- "Too much dependency on the zookeeper and leader selection is still the bottleneck for Kafka implementation."
What is most valuable?
What needs improvement?
Too much dependency on the zookeeper and leader selection is still the bottleneck for Kafka implementation.
What do I think about the scalability of the solution?
RESTful API implementation actually uses the Kafka Broker to publish the messages but I am not able to find it becoming scalable. Partially, the reason might be there is no load balancer for the RESTful API web server.
How was the initial setup?
Setup is very much straightforward for development, and cluster setup is also easy. I am not aware of the production setup yet.
Buyer's Guide
Apache Kafka
June 2025

Learn what your peers think about Apache Kafka. Get advice and tips from experienced pros sharing their opinions. Updated: June 2025.
857,028 professionals have used our research since 2012.
What about the implementation team?
I implemented it on my own.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.

Enterprise Architect at a logistics company with 1,001-5,000 employees
We use it for reactive architecture, track and trace, mail and parcel.
What is most valuable?
- Supports more than 10,000 events/second.
- Scalability
- Replication
It is a good product for event-driven architecture.
How has it helped my organization?
We use Kafka for reactive architecture, track and trace, mail and parcel.
What needs improvement?
A good free monitor tool would be great for Apache Kafka (from Apache foundation).
For how long have I used the solution?
We used Kafka 0.8 for 2 years and Kafka 0.10 for 3 months.
What do I think about the stability of the solution?
We have not encountered any stability issues.
What do I think about the scalability of the solution?
We have not encountered any scalability issues.
How are customer service and technical support?
We haven’t used technical support.
Which solution did I use previously and why did I switch?
Apache MQ is different. It is a message bus (log rotate) than can manage more than 10,000 events/sec.
How was the initial setup?
The basic configuration is quite good. We have built a Hadoop cluster and the Kafka service was included.
What's my experience with pricing, setup cost, and licensing?
We use a community version.
What other advice do I have?
Kafka processes asynchronous exchanges, so there are no transactional interactions.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Apache Kafka
June 2025

Learn what your peers think about Apache Kafka. Get advice and tips from experienced pros sharing their opinions. Updated: June 2025.
857,028 professionals have used our research since 2012.
Technical Lead/Project Manager(Consulting Apple Inc) at a tech services company with 1,001-5,000 employees
Topic-based eventing, scalability, and retention periods are valuable.
What is most valuable?
The most valuable features are topic-based eventing, scalability, and retention periods.
How has it helped my organization?
My organization is transforming by using the new SOA/eventing-based architecture. The application depends on the employees’ information events. Kafka is very helpful in implementing this. It increases the performance and gives the details to multiple external/internal teams using Kafka topics in an asynchronous manner.
For example, if someone is moving from one office to another one, we have to update the software. While updating it, the system puts that event in a topic so that all other consumers can update that person’s new location. This can include the payroll team, the insurance team, and the hospital network.
The retention period helps us retain the data in the topic for the configured number of days. In this example, if any of the consumers fail to consume the message from the topic, then that message will be there until the retention period ends.
What needs improvement?
I would like to see a more user-friendly GUI.
For how long have I used the solution?
We have used this solution since December, 2015.
What do I think about the stability of the solution?
If you are using the same group ID for multiple topics, it may shut down the application. We have faced this issue before.
What do I think about the scalability of the solution?
We have not had any scalability issues.
How are customer service and technical support?
I would give technical support a rating of 6 out of 10.
Which solution did I use previously and why did I switch?
We were using ActiveMQ, which is just a messaging system. We are changing because of Kafka’s added value of scalability, retention, and high payload support.
How was the initial setup?
The installation was somewhat straightforward.
What's my experience with pricing, setup cost, and licensing?
The solution is worth the money.
What other advice do I have?
This is the best tool I have ever used for asynchronous, event-based solutions.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Lead Engineer at a retailer with 10,001+ employees
We use the product for high-scale distributed messaging. Multiple consumers can sync with it and fetch messages.
What is most valuable?
We use the product for high-scale distributed messaging. The processing capability of the product is enormous. Being a distributed platform, multiple consumers can sync with it and fetch messages.
Another great feature is the consumer offset log which tells you where the consumer left and where he needs to start again. Consumers aren’t required to code and put extra effort to maintain the offset.
How has it helped my organization?
We were using another commercial messaging engine, which was not scalable unless you paid more. Each hub that we provisioned was expensive. This solution is open source, which is much easier to use and doesn’t cost us anything.
What needs improvement?
This product guarantees at-least-once delivery. We have asked JIRA to provide features such as at-most-once delivery to remove duplicate message consumption.
What do I think about the stability of the solution?
We haven’t faced any issues so far. Some of the clusters churn millions of records per seconds with ease.
What do I think about the scalability of the solution?
We have clustered environments and we haven’t seen any scalability issues. We can provision a new node in as little as 45 minutes.
How are customer service and technical support?
It is open source, so support is in our own hands. The only option is to make a new feature request through JIRA. When multiple people in the community make a request for similar feature, it gets priority.
Which solution did I use previously and why did I switch?
We switched from a previous solution mainly to reduce costs and to have a more scalable solution.
How was the initial setup?
The initial setup was a bit complex in terms of how to manage it across data centers. But once it was setup, we never faced issues.
Which other solutions did I evaluate?
We evaluated multiple options, such as ActiveMQ and RabbitMQ. We leaned towards this solution.
What other advice do I have?
I would advise others to start with non-SSL implementations and try to do PoCs. Afterwards, they should move towards more secure features.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Java Developer at a media company with 10,001+ employees
It provides safety for data in case of node failure or data center outage. Partitioning is useful for parallelizing processing.
What is most valuable?
The most valuable features to me are replication, partitioning and easy integration with Apache Spark, which we use quite a bit for distributed processing.
Replication is good for high availability. It provides additional safety for data in case of node failure or data center outage. Partitioning is a really useful feature for parallelizing processing. We use Apache Spark to process data from a Kafka queue, and Spark is able to assign one executor to each Kafka partition. The more partitions we have, the more threads we can use to process data in parallel. This helps us achieve really good throughput.
How has it helped my organization?
It will help us build a scalable platform. This will allow the company to provide better customer service.
What needs improvement?
It’s pretty easy to use for now. I haven’t had any difficulty or problems that I can complain about. Maybe they can add a UI to the configure queues and to display statistics about data stores.
For how long have I used the solution?
I have used Kafka for about a year.
What do I think about the stability of the solution?
So far, we have not encountered any stability issues.
What do I think about the scalability of the solution?
We have not had any scalability issues. The product is horizontally scalable, so adding extra hardware is all that is needed.
How are customer service and technical support?
We haven’t needed technical support with the product yet.
Which solution did I use previously and why did I switch?
I think performance-wise, the product is very good and fits in our use case. We used other distributed message queues, but all products have their own use case
How was the initial setup?
Initial setup wasn’t really complex. We use Kafka through Hortonworks Suite, which comes with many other big data tools. Ambari makes it easy to setup
What's my experience with pricing, setup cost, and licensing?
Licensing and pricing was handled by my management, so I don’t have much knowledge there.
What other advice do I have?
Give it a try. It’s a valuable, high-performance, distributed processing tool.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Building Event-centric Data processing Architectures at a tech services company with 51-200 employees
The product is scalable and provides good connectors, but the ability to connect the producers and consumers must be improved
Pros and Cons
- "The connectors provided by the solution are valuable."
- "The ability to connect the producers and consumers must be improved."
What is our primary use case?
We use the solution for analytics for streaming. We also use it for fraud detection.
What is most valuable?
The Kafka Streams library gives quite a bit of functionality. The connectors provided by the solution are valuable.
What needs improvement?
The ability to connect the producers and consumers must be improved. It's still a pain point because a lot of development goes into it.
For how long have I used the solution?
I have been using the solution for seven to eight years.
What do I think about the stability of the solution?
For what it does, the tool is very stable. It is a message broker. It receives the messages and holds them for producers and consumers. It's usually everything around Kafka that has stability problems because Kafka does exactly what it's supposed to do.
What do I think about the scalability of the solution?
Scalability is one of the main selling points of the tool. The additional nodes we add give us the additional storage capacity we need. I rate the scalability a ten out of ten. The solution is used across multiple domains in our organization. I use the product daily. It’s a continuously growing platform.
How are customer service and support?
Apache doesn't provide support. There are sites we can go to for information, but there's no support team for Apache. There are companies like Confluent and HPE that provide support for the solution.
Which solution did I use previously and why did I switch?
We also use Flink and other streaming tools. We use Apache Kafka in addition to other technologies because of the requirement and the business use cases.
How was the initial setup?
It is super easy to set up. I rate the ease of setup a ten out of ten. However, building and administration get quite difficult. It takes three months to make things production-ready.
What about the implementation team?
The deployment was done in-house. We used the tools that we have in our CI/CD pipeline. We needed three people for the deployment. The infrastructure team maintains the tool. The infrastructure team has three to ten members.
What was our ROI?
We see an ROI on the product. If we don't have a tool to buffer the amount of traffic coming in from high-traffic sites, we cannot use the data. Apache Kafka gives us a resting area where we can push as much information as we want to. It’s picked up by consumers when they need it.
It’s a huge return on investment. Otherwise, we must have a system tied to the producer waiting for the consumer to consume before we can do anything with the rest of the messages. A solution like Kafka provides us with a buffer to consume the data as we choose to.
What's my experience with pricing, setup cost, and licensing?
The price depends on who we are getting the product from. If we buy it from Confluent, we always have to try to negotiate the price. The price is always negotiable.
What other advice do I have?
Overall, I rate the product a six out of ten.
Which deployment model are you using for this solution?
Hybrid Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Co-Founder at Attaika
A great streaming platform with good functionality
Pros and Cons
- "A great streaming platform."
- "Observability could be improved."
What is our primary use case?
We are a service implementer and we supply this solution to our customers. I'm a company co-founder and we are customers of Apache.
What is most valuable?
The solution has improved our functionality, it's one of the best streaming platforms I've used.
What needs improvement?
I'd like to see improvement in terms of observability.
For how long have I used the solution?
I've been using this product for the last five years on and off.
What do I think about the stability of the solution?
The solution is stable.
What do I think about the scalability of the solution?
The solution is scalable.
How was the initial setup?
The initial setup is straightforward, it's not complicated.
What's my experience with pricing, setup cost, and licensing?
This is an open-source product.
What other advice do I have?
I rate this solution nine out of 10.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Software Support & Development Engineer at a computer software company with 501-1,000 employees
Scalable and free to use
Pros and Cons
- "Apache Kafka is scalable. It is easy to add brokers."
- "Apache Kafka can improve by making the documentation more user-friendly. It would be beneficial if we could explain to customers in more detail how the solution operates but the documentation get highly technical quickly. For example, if they had a simple page where we can show the customers how it works without the need for the customer to have a computer science background."
What is our primary use case?
Apache Kafka is used for connecting components between each other in the same application. The use is quite limited, but I was curious about its filtering capability of it.
How has it helped my organization?
We implemented the notification system between our components, and we found that Apache Kafka performs well in scalability. It has improved our organization because of the scalability and the comfort of a fail-safe or disaster recovery it provides.
What needs improvement?
Apache Kafka can improve by making the documentation more user-friendly. It would be beneficial if we could explain to customers in more detail how the solution operates but the documentation get highly technical quickly. For example, if they had a simple page where we can show the customers how it works without the need for the customer to have a computer science background.
For how long have I used the solution?
I have been using Apache Kafka for approximately two years.
What do I think about the scalability of the solution?
Apache Kafka is scalable. It is easy to add brokers.
We have approximately 30 people using this solution in my organization. They use the solution daily.
Which solution did I use previously and why did I switch?
I have only used Apache Kafka.
How was the initial setup?
The initial setup of Apache Kafka took some time but after it was easy.
I rate the initial setup of Apache Kafka a three out of five.
What about the implementation team?
We set up the solution in-house.
What's my experience with pricing, setup cost, and licensing?
This is an open-source solution and is free to use.
What other advice do I have?
We have not used the solution in production. We do not have a lot of data at the moment.
I would recommend this solution to others.
I rate Apache Kafka an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.

Buyer's Guide
Download our free Apache Kafka Report and get advice and tips from experienced pros
sharing their opinions.
Updated: June 2025
Product Categories
Streaming AnalyticsPopular Comparisons
Databricks
Confluent
Azure Stream Analytics
Spring Cloud Data Flow
PubSub+ Platform
Informatica Data Engineering Streaming
Buyer's Guide
Download our free Apache Kafka Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which ETL tool would you recommend to populate data from OLTP to OLAP?
- What are the differences between Apache Kafka and IBM MQ?
- How do you select the right cloud ETL tool?
- What is the best streaming analytics tool?
- What are the benefits of streaming analytics tools?
- What features do you look for in a streaming analytics tool?
- When evaluating Streaming Analytics, what aspect do you think is the most important to look for?
- Why is Streaming Analytics important for companies?