We performed a comparison between Apache Hadoop and Microsoft Parallel Data Warehouse based on real PeerSpot user reviews.Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
"The scalability of Apache Hadoop is very good."
"Apache Hadoop can manage large amounts and volumes of data with relative ease, which is a feature that is beneficial."
"The most important feature is its ability to handle large volumes. Some of our customers have really large volumes, and it is capable of handling their data in terms of the core volume and daily incremental volume. So, its processing power and speed are most valuable."
"Hadoop is designed to be scalable, so I don't think that it has limitations in regards to scalability."
"We selected Apache Hadoop because it is not dependent on third-party vendors."
"I liked that Apache Hadoop was powerful, had a lot of tools, and the fact that it was free and community-developed."
"Hadoop is extensible — it's elastic."
"One valuable feature is that we can download data."
"I like Data Warehouse's data integrity features. Data integrity is what databases are made for as opposed to spreadsheets."
"The solution has been reliable."
"We have complete control over our data."
"The price could be better. I think we would use it more, but the company didn't want to pay for it. Hortonworks doesn't exist anymore, and Cloudera killed the free version of Hadoop."
"Real-time data processing is weak. This solution is very difficult to run and implement."
"The integration with Apache Hadoop with lots of different techniques within your business can be a challenge."
"It requires a great deal of learning curve to understand. The overall Hadoop ecosystem has a large number of sub-products. There is ZooKeeper, and there are a whole lot of other things that are connected. In many cases, their functionalities are overlapping, and for a newcomer or our clients, it is very difficult to decide which of them to buy and which of them they don't really need. They require a consulting organization for it, which is good for organizations such as ours because that's what we do, but it is not easy for the end customers to gain so much knowledge and optimally use it."
"Hadoop's security could be better."
"From the Apache perspective or the open-source community, they need to add more capabilities to make life easier from a configuration and deployment perspective."
"I mentioned it definitely, and this is probably the only feature we can improve a little bit because the terminal and coding screen on Hadoop is a little outdated, and it looks like the old C++ bio screen. If the UI and UX can be improved slightly, I believe it will go a long way toward increasing adoption and effectiveness."
"I think more of the solution needs to be focused around the panel processing and retrieval of data."
"I would like the ability to do more real-time type updates instead of batch-oriented updates."
"We'd like to see it be a bit more compatible with other solutions."
"The query is slow if we don't optimize it."
The traditional structured relational data warehouse was never designed to handle the volume of exponential data growth, the variety of semi-structured and unstructured data types, or the velocity of real time data processing. Microsoft's SQL Server data warehouse solution integrates your traditional data warehouse with non-relational data and it can handle data of all sizes and types, with real-time performance.
Apache Hadoop is ranked 6th in Data Warehouse with 9 reviews while Microsoft Parallel Data Warehouse is ranked 14th in Data Warehouse with 3 reviews. Apache Hadoop is rated 8.0, while Microsoft Parallel Data Warehouse is rated 7.4. The top reviewer of Apache Hadoop writes "Has good processing power and speed and is capable of handling large volumes of data and doing online analysis". On the other hand, the top reviewer of Microsoft Parallel Data Warehouse writes "The admission tools, BI tools, and data warehousing tools are great". Apache Hadoop is most compared with Microsoft Azure Synapse Analytics, Snowflake, Azure Data Factory, Oracle Exadata and Teradata, whereas Microsoft Parallel Data Warehouse is most compared with Microsoft Azure Synapse Analytics, Oracle Exadata, Snowflake, SAP BW4HANA and Vertica. See our Apache Hadoop vs. Microsoft Parallel Data Warehouse report.
We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.