AWS Glue is a serverless cloud data integration tool that facilitates the discovery, preparation, movement, and integration of data from multiple sources for machine learning (ML), analytics, and application development. The solution includes additional productivity and data ops tooling for running jobs, implementing business workflows, and authoring.
Product | Market Share (%) |
---|---|
AWS Glue | 17.2% |
AWS Database Migration Service | 13.4% |
MuleSoft Anypoint Platform | 6.5% |
Other | 62.9% |
Type | Title | Date | |
---|---|---|---|
Category | Cloud Data Integration | Aug 29, 2025 | Download |
Product | Reviews, tips, and advice from real users | Aug 29, 2025 | Download |
Comparison | AWS Glue vs AWS Database Migration Service | Aug 29, 2025 | Download |
Comparison | AWS Glue vs Informatica Intelligent Data Management Cloud (IDMC) | Aug 29, 2025 | Download |
Comparison | AWS Glue vs MuleSoft Anypoint Platform | Aug 29, 2025 | Download |
Title | Rating | Mindshare | Recommending | |
---|---|---|---|---|
Informatica Intelligent Data Management Cloud (IDMC) | 4.0 | 5.4% | 93% | 186 interviewsAdd to research |
MuleSoft Anypoint Platform | 4.0 | 6.5% | 92% | 59 interviewsAdd to research |
Company Size | Count |
---|---|
Small Business | 11 |
Midsize Enterprise | 6 |
Large Enterprise | 29 |
Company Size | Count |
---|---|
Small Business | 321 |
Midsize Enterprise | 216 |
Large Enterprise | 1197 |
AWS Glue allows users to connect to more than 70 diverse data sources and manage data in a centralized data catalog. The solution facilitates visual creation, running, and monitoring of extract, transform, and load (ETL) pipelines to load data into users' data lakes. This Amazon product seamlessly integrates with other native applications of the brand and allows users to search and query cataloged data using Amazon EMR, Amazon Athena, and Amazon Redshift Spectrum.
The solution also utilizes application programming interface (API) operations to transform users' data, create runtime logs, store job logic, and create notifications for monitoring job runs. The console of AWS Glue connects all of these services into a managed application, facilitating the monitoring and operational processes. The solution also performs provisioning and management of the resources required to run users' workloads in order to minimize manual work time for organizations.
AWS Glue Features
AWS Glue groups its features into four categories - discover, prepare, integrate, and transform. Within those groups are the following features:
AWS Glue Benefits
AWS Glue offers a wide range of benefits for its users. These benefits include:
Reviews from Real Users
Mustapha A., a cloud data engineer at Jems Groupe, likes AWS Glue because it is a product that is great for serverless data transformations.
Liana I., CEO at Quark Technologies SRL, describes AWS Glue as a highly scalable, reliable, and beneficial pay-as-you-go pricing model.
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Author info | Rating | Review Summary |
---|---|---|
Principal Consultant at a retailer with 1,001-5,000 employees | 4.0 | I use AWS Glue for ETL processes, including data transformation and cleansing for our data warehouse. Its serverless nature and excellent performance are beneficial, though the UI and version upgrades need improvement. Despite some challenges, Glue remains my preferred solution. |
application security engineer at Hyperspace IT India | 4.0 | No summary available |
Data Architect at a financial services firm with 10,001+ employees | 2.5 | I use AWS Glue primarily for data ingestion, curation, and transformation, benefiting from its compatibility with Python for big data. Though clunky and code-heavy, it suits limited pipelines well, especially with AWS, despite preferring GUI-based tools. |
Python AWS & AI Expert at a tech consulting company | 4.0 | I use AWS Glue primarily for serverless integration across various services. Its valuable features include robust transformation capabilities and seamless data preparation, though the deployment process could be simplified. It's integrated with AWS for comprehensive data workflow management. |
Offshore Delivery | AWS architect | Manager - Projects at Cognizant | 4.0 | I use AWS Glue for efficient data transformation and integration with Apache Airflow, enabling smooth orchestration without cold starts like AWS Lambda. Although managing environment variables could improve, Glue's extended session capability suits our needs better than other solutions. |
Senior Developer for cloud services at Coforge Growth Agency | 4.5 | I primarily use AWS Glue for data ingestion and extraction from multiple sources. The Glue Crawler efficiently updates schemas for large datasets, and the orchestration of ETL pipelines is effective. Improvements are needed in Lambda functions' resource allocation and timeout management. |
Site Reliability Engineer (AWS) at KFin Technologies Ltd | 5.0 | I use AWS Glue for data-intensive tasks such as data lake creation and machine learning pipelines. Its valuable features include the Data Catalog for metadata management and easy data transformation. The platform could benefit from improved speed and reliability in processing. |
Senior Vice President & Global Head AWS BU at a tech services company with 10,001+ employees | 4.0 | As the global lead for AWS solutions, we utilize AWS Glue for its serverless data integration and seamless AWS service compatibility in cloud migrations. Its cost-effectiveness and security are valuable, though improved metadata management and integration with legacy products are needed. |