AWS Glue vs Talend Open Studio comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
12,051 views|8,582 comparisons
Talend Logo
13,379 views|10,123 comparisons
Comparison Buyer's Guide
Executive Summary
Updated on May 22, 2022

We performed a comparison between AWS Glue vs Talend Open Studio based on our users’ reviews in five categories. After reading all of the collected data, you can find our conclusion below.

  • Ease of Deployment: Users report that the initial setup and deployment of both solutions is straightforward and easy.
  • Features: Reviewers of both products are happy with their stability and scalability. AWS Glue users like its user interface and say it is easy to implement ETL processes with, but they feel it is slow when starting up. Talend Open Studio reviewers say it is user-friendly and has excellent data integration but consumes a high amount of memory.
  • Pricing: Each of these products received mixed reviews in the pricing category. Some users of each feel that the price is too high.
  • ROI: AWS Glue users do not mention ROI. Talend Open Studio report seeing an ROI.
  • Service and Support: Users of both solutions report being satisfied with the level of support they receive.

Comparison Results: AWS Glue has a slight edge in this comparison since it is more lightweight than Talend Open Studio.

To learn more, read our detailed AWS Glue vs. Talend Open Studio Report (Updated: March 2024).
765,234 professionals have used our research since 2012.
Q&A Highlights
Question: How does Talend Open Studio compare with AWS Glue?
Answer: We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in the ETL console. This solution has a lot of useful features. For example, you can point AWS Glue to data in AWS, and Glue stores the associated metadata in the data catalog. As a result, the data becomes searchable and available in the ETL. The UI is very friendly and easy to use. We also like how fast it is for ETL processes. If your company handles large quantities of sensitive data, AWS Glue provides a scheduler, and the data deployment is very straightforward. Still, AWS Glue leaves some room for improvement. The setup can be pretty tricky if you are not already an AWS user. Also, the sample code is not available for many use cases. Talend Open Studio is a data integration solution designed for ETL, Big Data, and data integration. It is also easy to use. The feature we like the most is that It allows us to integrate the different data sources. It has the added advantage of being open-source. It provides a lot of features out of the box, which is nice to have. The pre-built widgets help you streamline integrations with databases and web services. We use it for small loads, but for large amounts of data it stalls sometimes. Conclusions: If you are already an AWS user, AWS Glue is probably the best ETL tool for you. If that's not the case, deployment can be quite difficult. For small loads, Talend is a better option.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Glue is a NoSQL-based data ETL tool that has some advantages over IIS and ISAs.""What I like best about AWS Glue is its real-time data backup feature. Last week, there was a production push, and what used to take almost ten days to send out around fifty-six thousand emails now takes only two hours.""I appreciate AWS Glue for its cost-effectiveness.""I like the fact that AWS Glue works with Python scripts.""The solution is serverless so it allows us to transform data while optimizing the cost and performance of Spark jobs.""Transformations are valuable because you can modify or override complex data logic from an open source or Spark to solve issues.""It is a stable and scalable solution.""AWS Glue is fast and managed by AWS. Hence, you don't have to worry about capacity and the performance of Glue jobs. It has integrations with other data stores of AWS. The product offers metadata management, logging, and ETL processing capabilities. It comes with a powerful feature, Glue Studio, which helps to do queries interactively within the community. It is a managed service and very secure. Another popular and mature service is S3."

More AWS Glue Pros →

"The API integration and big data approach are very good because of how you extract data from JSP files or big data web repositories like MongoDB.""The most valuable feature of Talend Open Studio is the tMap component. There is a lot of functionality in one component.""The rapidity of integration with data may be one of the valuable features.""We have contacted their technical support. They are great. They offer very professional help. If I need some technical answer, they are very professional. They are quick, professional, and very accurate.""Talend is safe to use because it is very restrictive. It is easy to use when one learns how to manipulate data with SQL.""Talend can connect to multiple data sources, including relational data sources, ERP, CRM, and others.""The drag-and-drop feature in the interface is very good.""There are many architectures: hybrid, cloud, and on-prem."

More Talend Open Studio Pros →

Cons
"The crucial problem with AWS Glue is that it only works with AWS. It is not an agnostic tool like Pentaho. In PowerCenter, we can install the forms from Google and other vendors, but in the case of AWS Glue, we can only use AWS.""There should be more connectors for different databases.""I would like to see a more robust interface on the no-code side. This would be nice to be able to split cells.""I have encountered challenges with multi-region support.""We face performance issues when using AWS Glue for data transformation and integration.""The interface for AWS Glue could improve, they do not put a lot of details. You can write the code, in PySpark or in Scala, which is a big advantage, it is only easy to use for a developer. It will be difficult for new users to enter the cloud environment.""The monitoring is not that good.""Glue could perform better. It sometimes takes too long to test a Glue job. Google Cloud Platform offers more Python scripts than AWS."

More AWS Glue Cons →

"It doesn't have the ability to keep the repository of the source code (visual pipeline). It can be integrated with Git.""Having additional training materials, such as a video tutorial, would be an improvement.""We need more components to be more efficient. We use a lot of components, such as Salesforce, and it's not easy to use. There's are minor bugs and it's not easy to use some of the features.""In the next release, Open Studio should include cloud storage as an input.""As for improvement or additional features, I would like to know how to use Java in Talend and also how to use Talend in the cloud or in big data. I would prefer to have storage directly on Talend.""Talend should improve the log and error handling to better track the errors you find during development. Sometimes it's challenging to see what's causing an issue, and tracking that on Talend is complicated.""The user interface could be made simpler.""If I compare Open Studion to other solutions, their interface is robust but not so fancy looking. This is an area that could use improvement. I would like to see an updated interface."

More Talend Open Studio Cons →

Pricing and Cost Advice
  • "The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
  • "It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
  • "Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
  • "Technical support is a paid service, and which subscription you have is dependent on that. You must pay one of them, and it ranges from $15,000 to $25,000 per year."
  • "This solution is affordable and there is an option to pay for the solution based on your usage."
  • "AWS Glue is quite costly, especially for small organizations."
  • "AWS Glue uses a pay-as-you-go approach which is helpful. The price of the overall solution is low and is a great advantage."
  • "The overall cost of AWS Glue could be better. It cost approximately $1,000 a month. There is paid support available from AWS Glue."
  • More AWS Glue Pricing and Cost Advice →

  • "Pricing and licensing are fairly straightforward. It is reasonably priced and managed."
  • "Talend is free and you can download it."
  • "The paid version of this solution has a very high price, but even with the limitations, the Community version works fine."
  • "Price could be lower. It is getting too expensive when compared to some other solutions, which is actually a little bit concerning."
  • "There are many versions available and one is open-sourced which is free."
  • "The cost for one year for the ETL tools, not for the big data, is 6K per year. It is a good price."
  • "It does the job well for nothing — without cost. That's the advantage of this product."
  • "Talend Open Studio is priced too high."
  • More Talend Open Studio Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
    765,234 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:AWS Glue and Azure Data factory for ELT best performance cloud services.
    Top Answer:We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in… more »
    Top Answer:AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or… more »
    Top Answer:The product's initial setup phase was easy.
    Top Answer:For Talend Open Studio, there is a need to make yearly payments towards the licensing cost. Talend Open Studio is a bit expensive, in my opinion.
    Top Answer:The high price of the solution is an area of concern where improvements are required.
    Ranking
    1st
    Views
    12,051
    Comparisons
    8,582
    Reviews
    30
    Average Words per Review
    398
    Rating
    7.9
    5th
    out of 94 in Data Integration
    Views
    13,379
    Comparisons
    10,123
    Reviews
    14
    Average Words per Review
    573
    Rating
    7.8
    Comparisons
    Also Known As
    Open Studio
    Learn More
    Overview

    AWS Glue is a serverless cloud data integration tool that facilitates the discovery, preparation, movement, and integration of data from multiple sources for machine learning (ML), analytics, and application development. The solution includes additional productivity and data ops tooling for running jobs, implementing business workflows, and authoring.

    AWS Glue allows users to connect to more than 70 diverse data sources and manage data in a centralized data catalog. The solution facilitates visual creation, running, and monitoring of extract, transform, and load (ETL) pipelines to load data into users' data lakes. This Amazon product seamlessly integrates with other native applications of the brand and allows users to search and query cataloged data using Amazon EMR, Amazon Athena, and Amazon Redshift Spectrum.

    The solution also utilizes application programming interface (API) operations to transform users' data, create runtime logs, store job logic, and create notifications for monitoring job runs. The console of AWS Glue connects all of these services into a managed application, facilitating the monitoring and operational processes. The solution also performs provisioning and management of the resources required to run users' workloads in order to minimize manual work time for organizations.

    AWS Glue Features

    AWS Glue groups its features into four categories - discover, prepare, integrate, and transform. Within those groups are the following features:

    • Automatic schema discovery: AWS Glue crawlers connect to the organization's source or target data source through a prioritized list of classifiers to determine the schema for users' data. This feature creates metadata in companies' AWS Glue Data Catalog.

    • Schemas for data stream management: The AWS Glue Schema Registry enables users to validate and control the evolution of streaming data through registered Apache Avro schemas for no additional charge.

    • Automatic scaling based on workload: This feature dynamically scales resources up and down based on workload. The feature controls job resources, removing them depending on how much the workload can be split up.

    • FindMatches: This feature is for machine learning-based data deduplication and cleansing, and works by finding records that are imperfect matches of each other to remove useless data copies.

    • Edit, debug, and test ETL code: This feature helps users who have chosen to interactively develop their ETL code by providing development endpoints for editing, debugging, and testing the code it generates for them.

    • AWS Glue DataBrew: An interactive, point-and-click visual interface for specialists to clean and normalize data without the need to write any code.

    • AWS Glue Interactive Sessions: This feature simplifies the development of data integration jobs by enabling data engineers to interactively prepare and explore data.

    • AWS Glue Studio Job Notebooks: This AWS Glue feature provides serverless notebooks with minimal setup, allowing developers to start working in a timely manner.

    • Complex ETL pipeline building: This feature allows the product to be invoked on a schedule, on demand, or based on an event, allowing users to start multiple jobs in parallel or specify dependencies to build complex ETL pipelines.

    • AWS Glue Studio: This AWS Glue feature allows users to visually transform data through a drag-and-drop interface. The product automatically generates the code for ETL processes for users' data.

    AWS Glue Benefits

    AWS Glue offers a wide range of benefits for its users. These benefits include:

    • Users of other AWS products can easily onboard with AWS Glue, as it is integrated across a wide range of the company's services.

    • The solution is serverless, which allows for a lower total cost of ownership.

    • AWS Glue offers more power for users, as it automates much of the effort in building, maintaining, and running ETL jobs.

    • The product allows customers to easily discover and search across all their AWS datasets through AWS Glue Data Catalog.

    • AWS Glue does not require additional payment for managing and enforcing schemas for data streams.

    • The solution facilitates the authority of scalable ETL jobs for beginners and non-coding experts through a drag-and-drop interface.

    Reviews from Real Users

    Mustapha A., a cloud data engineer at Jems Groupe, likes AWS Glue because it is a product that is great for serverless data transformations.

    Liana I., CEO at Quark Technologies SRL, describes AWS Glue as a highly scalable, reliable, and beneficial pay-as-you-go pricing model.

    Talend Open Studio is a free, open source ETL tool for data integration and Big Data. The solution enables you to extract diverse datasets and normalize and transform them into a consistent format which can be loaded into a number of third-party databases and applications.

    Talend Open Studio Features

    Talend Open Studio has many valuable key features. Some of the most useful ones include:

    • Automatic identification of data types and potential errors
    • tMap module
    • Graphical conversion tools
    • Charts
    • Database SCD Tools
    • Business intelligence formats (Jasper, OLAP, SPSS, Splunk)
    • ETL and ELT support
    • Eclipse-based development tooling
    • Versioning
    • Large library of connectors
    • Data flow orchestration
    • File management without scripting
    • Data transformations

    Talend Open Studio Benefits

    There are several benefits to implementing Talend Open Studio. Some of the biggest advantages the solution offers include:

    • Reduces the time taken to develop the integration.
    • Provides a wide selection of source and target connectors.
    • Monitor and manage problematic deployments with ease.
    • Allows developers to have the lowest cost of ownership for any solution.
    • Improves collaboration between different teams who need access to data.
    • Automated data integration process synchronizes the data and eases real time and periodic reporting, which would be time-consuming if done manually.
    • Achieve better data quality because data matures and improves over time.

    Reviews from Real Users

    Below are some reviews and helpful feedback written by PeerSpot users currently using the Talend Open Studio solution.

    Elio B., Data Integration Specialist/CTO at Asset messages, says, "The solution has a good balance between automated items and the ability for a developer to integrate and extend what he needs. Other competing tools do not offer the same grade of flexibility when you need to go beyond what is provided by the tool. Talend, on the other hand, allows you to expand very easily."

    A Practice Head, Analytics at a tech services company mentions, “The data integration aspect of the solution is excellent. The product's data preparation features are very good. There's very useful data stewardship within the product. From a technical standpoint, the solution itself is pretty good. There are very good pre-built connectors in Talend, which is good for many clients or businesses, as, in most cases, companies are dealing with multiple data sources from multiple technologies. That is where a tool like Talend is extremely helpful.”

    Prerna T., Senior System Executive at a tech services company, comments, “The best thing I have found with Talend Open Studio is their major support for the lookups. With Salesforce, when we want to relate our child objects to their parent object, we need to create them via IDs. Then the upsert operation, which will allow you to relate a child object to the event, will have an external ID. That is the best thing which keeps it very sorted. I like that.”

    An Implementation Specialist, Individual Contributor at a computer software company, states, “I can connect with different databases such as Oracle Database or SQL Server. It allows you to extract the data from one database to another. I can structure the data by filtering and mapping the fields.” He also adds, “It is very user-friendly. You need to know the basics of SQL development or SQL queries, and you can use this tool.”

    PeerSpot user Badrakh V., Information System Architect at Astvision, explains, "The most valuable features are the ETL tools."

    Sample Customers
    bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
    Almerys, BF&M, Findus
    Top Industries
    REVIEWERS
    Computer Software Company47%
    Financial Services Firm18%
    Pharma/Biotech Company12%
    Consumer Goods Company6%
    VISITORS READING REVIEWS
    Financial Services Firm20%
    Computer Software Company13%
    Insurance Company7%
    Manufacturing Company7%
    REVIEWERS
    Computer Software Company29%
    Insurance Company12%
    Financial Services Firm12%
    University12%
    VISITORS READING REVIEWS
    Computer Software Company16%
    Financial Services Firm14%
    Manufacturing Company7%
    Government6%
    Company Size
    REVIEWERS
    Small Business29%
    Midsize Enterprise13%
    Large Enterprise58%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise12%
    Large Enterprise73%
    REVIEWERS
    Small Business42%
    Midsize Enterprise25%
    Large Enterprise33%
    VISITORS READING REVIEWS
    Small Business20%
    Midsize Enterprise14%
    Large Enterprise66%
    Buyer's Guide
    AWS Glue vs. Talend Open Studio
    March 2024
    Find out what your peers are saying about AWS Glue vs. Talend Open Studio and other solutions. Updated: March 2024.
    765,234 professionals have used our research since 2012.

    AWS Glue is ranked 1st in Cloud Data Integration with 37 reviews while Talend Open Studio is ranked 5th in Data Integration with 47 reviews. AWS Glue is rated 7.8, while Talend Open Studio is rated 8.0. The top reviewer of AWS Glue writes "Provides serverless mechanism, easy data transformation and automated infrastructure management". On the other hand, the top reviewer of Talend Open Studio writes "An open-source ETL tool, when deployed on-premises, requiring an easy installation phase". AWS Glue is most compared with AWS Database Migration Service, Informatica PowerCenter, SSIS, Informatica Cloud Data Integration and Confluent, whereas Talend Open Studio is most compared with SSIS, IBM InfoSphere DataStage, Azure Data Factory, Talend Data Fabric and Oracle Data Integrator (ODI). See our AWS Glue vs. Talend Open Studio report.

    See our list of best Cloud Data Integration vendors.

    We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.