Apache Hadoop vs Teradata comparison

Cancel
You must select at least 2 products to compare!
Apache Logo
2,630 views|2,223 comparisons
89% willing to recommend
Teradata Logo
6,154 views|5,080 comparisons
87% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Apache Hadoop and Teradata based on real PeerSpot user reviews.

Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Apache Hadoop vs. Teradata Report (Updated: March 2024).
768,415 professionals have used our research since 2012.
Q&A Highlights
Question: Which data catalog can provide support for BI data sources such as SAP BO and Tableau?
Answer: Dear Community, Many thanks for yor support and help!
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Two valuable features are its scalability and parallel processing. There are jobs that cannot be done unless you have massively parallel processing.""The performance is pretty good.""The scalability of Apache Hadoop is very good.""What comes with the standard setup is what we mostly use, but Ambari is the most important.""Hadoop File System is compatible with almost all the query engines.""Hadoop is designed to be scalable, so I don't think that it has limitations in regards to scalability.""As compared to Hive on MapReduce, Impala on MPP returns results of SQL queries in a fairly short amount of time, and is relatively fast when reading data into other platforms like R.""The most valuable feature is the database."

More Apache Hadoop Pros →

"It handles large amounts of information with a linear performance increase, in relation to a HW investment.""Auto-partitioning and indexing, and resource allocation on the fly are key features.""The most valuable features are the Shared-nothing architecture and data protection functionality.""The functionality of the solution is excellent.""It is a solid database a lot of different tools to move data.""The two types of partitioning have been very significant for us - row and columnar partitioning.""Improved performance of ETL procedures, reporting.""There are several features of Teradata that I like. One of the most basic is the indexes. I also like that it provides lower TCO. It also has the optimizer feature which is a good feature and isn't found in other legacy systems. Parallelism is also another feature I like in Teradata because when you are running or hosting on multiple systems, you have this shared-nothing architecture that helps. Loading and unloading in Teradata are also really helpful compared to other systems."

More Teradata Pros →

Cons
"It requires a great deal of learning curve to understand. The overall Hadoop ecosystem has a large number of sub-products. There is ZooKeeper, and there are a whole lot of other things that are connected. In many cases, their functionalities are overlapping, and for a newcomer or our clients, it is very difficult to decide which of them to buy and which of them they don't really need. They require a consulting organization for it, which is good for organizations such as ours because that's what we do, but it is not easy for the end customers to gain so much knowledge and optimally use it.""The solution is very expensive.""The key shortcoming is its inability to handle queries when there is insufficient memory. This limitation can be bypassed by processing the data in chunks.""The load optimization capabilities of the product are an area of concern where improvements are required.""Based on our needs, we would like to see a tool for data visualization and enhanced Ambari for management, plus a pre-built IoT hub/model. These would reduce our efforts and the time needed to prove to a customer that this will help them.""We would like to have more dynamics in merging this machine data with other internal data to make more meaning out of it.""The stability of the solution needs improvement.""The price could be better. I think we would use it more, but the company didn't want to pay for it. Hortonworks doesn't exist anymore, and Cloudera killed the free version of Hadoop."

More Apache Hadoop Cons →

"Sometimes the large injestion takes days to load data, and some of our stored procedures take two to three days.""The setup is not straightforward.""​I think the UI is not there yet. It could be improved by being more user-friendly.​""GUI of administrative tools is really outdated.""Teradata is a bit late for the cloud.""The following could be better: licensing, architecture openness, integration with other tools.""Teradata needs to expand the kind of training that's available to customers. Teradata only offers training directly and doesn't delegate to any third-party companies. As a result, it's harder to find people trained on Teradata in our market relative to Oracle.""I would like to see an improved Knowledge Base on the web."

More Teradata Cons →

Pricing and Cost Advice
  • "Do take into consider that data storage and compute capacity scale differently and hence purchasing a "boxed" / 'all-in-one" solution (software and hardware) might not be the best idea."
  • "​There are no licensing costs involved, hence money is saved on the software infrastructure​."
  • "This is a low cost and powerful solution."
  • "The price of Apache Hadoop could be less expensive."
  • "If my company can use the cloud version of Apache Hadoop, particularly the cloud storage feature, it would be easier and would cost less because an on-premises deployment has a higher cost during storage, for example, though I don't know exactly how much Apache Hadoop costs."
  • "We don't directly pay for it. Our clients pay for it, and they usually don't complain about the price. So, it is probably acceptable."
  • "The price could be better. Hortonworks no longer exists, and Cloudera killed the free version of Hadoop."
  • "We just use the free version."
  • More Apache Hadoop Pricing and Cost Advice →

  • "Teradata is not cheap, but you get what you pay for."
  • "Make sure you have the in-house skills to design and support the solution, as relying on external sources is extremely costly and tends to lock you into specific platforms, tools, and paradigms."
  • "In the past, it turned out that other solutions, in order to provide the full range of abilities that the Teradata platform provides plus the migration costs, would end up costing more than Teradata does."
  • "The initial cost may seem high, but the TCO is low."
  • "Teradata is currently making improvements in this area."
  • "It is still a very expensive solution. While I very much like the pure technological supremacy of the software itself, I believe Teradata as a company needs to become more affordable. They are already losing the market to more flexible or cheaper competitors."
  • "Teradata is expensive but gives value for money, especially if you don't want to move your data to the cloud."
  • "Price is quite high, so if it is really possible to use other solutions (e.g. you do not have strict requirements for performance and huge data volumes), it might be better to look at alternatives from the RDBMS world."
  • More Teradata Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Warehouse solutions are best for your needs.
    768,415 professionals have used our research since 2012.
    Comparison Review
    Anonymous User
    Answers from the Community
    Tomasz Rabong
    InitZero - PeerSpot reviewerInitZero
    Real User

    Hi Tomasz,


    Collibra can scan all these sources. See this link: https://marketplace.collibra.c...


    Also, Erwin Data Intelligence Suite can harvest most (if not all) of these sources:


    https://www.erwin.com/products...

    Leandro Sodré - PeerSpot reviewerLeandro Sodré
    User

    Hi Tomasz Rabong


    I believe that if you have a developer team in Amundsen it would be possible. 


    Alternatively, you can look at Informatica EDC or at Data Virtualization Data Catalog (from Denodo).

    Ritesh Misra - PeerSpot reviewerRitesh Misra
    User

    @Tomasz Rabong, it depends upon the actual requirements of the data catalog. 


    As far as we have experienced SAP BO 4.0 is way ahead in solving architectural, clustering, warehousing and mining complex problems whereas Tableau server 2022.1 is really awesome and has recently included features to solve complex problems. 


    As a team, we prefer SAP BO for billions of data.

    Delmar Assis - PeerSpot reviewerDelmar Assis
    Real User

    Hi @Tomasz Rabong, I hope you're well and safe. 


    Specifically, if you need any help regarding Infogix Data360 Govern, please let me know. 


    Cheers.

    Questions from the Community
    Top Answer:Tools like Apache Hadoop are knowledge-intensive in nature. Unlike other tools in the market currently, we cannot understand knowledge-intensive products straight away. To use Apache Hadoop, a person… more »
    Top Answer: I have spoken to my colleagues about this comparison and in our collective opinion, the reason why some people may declare Teradata better than Oracle is the pricing. Both solutions are quite… more »
    Top Answer: Before my organization implemented this solution, we researched which big brands were using Teradata, so we knew if it would be compatible with our field. According to the product's site, the… more »
    Top Answer:Teradata is not a difficult product to work with, especially since they offer you technical support at all levels if you just ask. There are some features that may cause difficulties - for example… more »
    Ranking
    5th
    out of 34 in Data Warehouse
    Views
    2,630
    Comparisons
    2,223
    Reviews
    11
    Average Words per Review
    532
    Rating
    8.0
    3rd
    out of 34 in Data Warehouse
    Views
    6,154
    Comparisons
    5,080
    Reviews
    21
    Average Words per Review
    469
    Rating
    7.8
    Comparisons
    SQL Server logo
    Compared 19% of the time.
    Snowflake logo
    Compared 11% of the time.
    Oracle Exadata logo
    Compared 11% of the time.
    MySQL logo
    Compared 10% of the time.
    Teradata IntelliFlex logo
    Compared 2% of the time.
    Learn More
    Overview
    The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

    Teradata is a multi-cloud data platform company that provides data warehouse and relational database tools. The brand has an extensive portfolio of big data analytic solutions, integrated marketing applications, and services. The products that Teradata offers can be categorized in the following ways:

    • Software: The products in this category include Teradata Vantage, an advanced SQL engine and Teradata database.

    • Cloud: The solutions can work with popular cloud services such as Amazon AWS, Microsoft Azure, Google Cloud Platform, and VMware. Teradata Cloud and customer cloud are also included in this product category.

    • Ecosystem management: The products in this section include IntelliSphere, business continuity manager, data lab, data mover, data stream, QueryGrid, and viewpoint.

    • Hardware: This category includes backup, archive, and restore (BAR) and IntelliFlex.

    • Applications: Master data management (MDM) and Teradata analytics for enterprise applications are the two products in this category.

    One of the most popular and commonly used products by Teradata is Teradata Vantage. This is a connected multi-cloud data platform for enterprise analytics. The product unifies data lakes, analytics, and data warehouses, as well as data sources and types. The solution offers its customers scalability, which allows them to scale dimensions to handle massive workloads of data. Teradata Vantage utilizes artificial intelligence (AI) and machine learning (ML) to power more models and enhance quality.

    Through Vantage Console, businesses can benefit from role-based, no-code software to organize and manage their data. The product offers deployment on many popular public clouds, on premises, and on commodity hardware. Teradata Vantage unifies and integrates all data types from sources and provides companies with a single source of information. It achieves this by supporting all common data types and formats. The product also provides organizations with tools for monitoring, analyzing, and connecting data throughout the organizations.

    Part of this product is also VantageCloud, which offers a modern cloud-native architecture as well as hybrid and multi-cloud deployment options. This solution offers new ways for clients to deploy their platforms. These include the next-generation, cloud-native architecture of Teradata VantageCloud Lake.

    Teradata Features

    The different products that Teradata offers have various features which facilitate data management for customers. Some of the key capabilities of the solutions offered by Teradata include:

    • Connectivity: Users can benefit from connections to the mainframe or network-attached systems. The product provides its own extension for interaction with data stored in the tables. It also supports SQL.

    • Linear scalability: Teradata solutions are highly scalable and linear. The solution can handle large volumes of data effectively while also supporting the ability to scale up to 2048 nodes.

    • Load and unload utilities: The product offers features to move data in and out of the product's system effectively.

    • Mature optimizer: Part of this feature is Teradata Optimizer, which is an advanced product that can handle up to 64 joins in a single query.

    • Robust utilities: The solution includes multiple robust utilities in this category of features that facilitate data handling in and out of the product's systems. The tools include MultiLoad, FastLoad, and FastExport.

    • Shared nothing architecture: This set of features, connected to the architecture of Teradata, is known as “shared nothing architecture” because the nodes, processors, and disks all work independently. These capacities ensure better value for a given task.

    • Unlimited parallelism: These features divide large volumes of data into smaller processes that are executed in parallel. This contributes to the execution of complex tasks in a timely manner.

    Teradata Benefits

    Through its multiple solutions, Teradata offers various benefits for its clients. Some of these include:

    • The solution facilitates prototype creation and offers faster times for completion.

    • Through Teradata, users are able to use faster, simpler, and easier solutions for data-related tasks.

    • The product eliminates the need for unnecessary data movement, which contributes to performance improvements.

    • Teradata offers developers the ability to decide where in the architecture different parts of an application run.

    • Teradata allows database administrators to manage databases from a single point of control.

    • The solution provides the option to get the same data on multiple deployment options.

    • The product supports Online Analytical Programming (OLAP) functions, which allows it to perform a complex analytical process of data.

    • Teradata uses the Service Workstation to provide a single operation view for the multi-node system of the product.

    Reviews from Real Users

    Martin P., a services manager at Bytes Systems Integration, describes Teradata as a product that is very fast with good database control and excellent support.

    Blaine V., principal at Insight Data Consulting, rates Teradata highly because of its excellent native features, highly stable, and impressive automation.

    Sample Customers
    Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab
    Netflix
    Top Industries
    REVIEWERS
    Financial Services Firm38%
    Comms Service Provider25%
    Hospitality Company6%
    Consumer Goods Company6%
    VISITORS READING REVIEWS
    Financial Services Firm27%
    Computer Software Company10%
    Comms Service Provider6%
    University6%
    REVIEWERS
    Comms Service Provider26%
    Computer Software Company22%
    Financial Services Firm9%
    Energy/Utilities Company9%
    VISITORS READING REVIEWS
    Financial Services Firm26%
    Computer Software Company10%
    Manufacturing Company8%
    Healthcare Company7%
    Company Size
    REVIEWERS
    Small Business34%
    Midsize Enterprise23%
    Large Enterprise43%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise75%
    REVIEWERS
    Small Business33%
    Midsize Enterprise11%
    Large Enterprise56%
    VISITORS READING REVIEWS
    Small Business14%
    Midsize Enterprise9%
    Large Enterprise77%
    Buyer's Guide
    Apache Hadoop vs. Teradata
    March 2024
    Find out what your peers are saying about Apache Hadoop vs. Teradata and other solutions. Updated: March 2024.
    768,415 professionals have used our research since 2012.

    Apache Hadoop is ranked 5th in Data Warehouse with 32 reviews while Teradata is ranked 3rd in Data Warehouse with 54 reviews. Apache Hadoop is rated 7.8, while Teradata is rated 8.2. The top reviewer of Apache Hadoop writes "A file system for data collection that contains needed information and files". On the other hand, the top reviewer of Teradata writes "Offers seamless integration capabilities and performance optimization features, including extensive indexing and advanced tuning capabilities". Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata, Snowflake and BigQuery, whereas Teradata is most compared with SQL Server, Snowflake, Oracle Exadata, MySQL and Teradata IntelliFlex. See our Apache Hadoop vs. Teradata report.

    See our list of best Data Warehouse vendors.

    We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.