AWS Glue is a serverless cloud data integration tool that facilitates the discovery, preparation, movement, and integration of data from multiple sources for machine learning (ML), analytics, and application development. The solution includes additional productivity and data ops tooling for running jobs, implementing business workflows, and authoring.
| Product | Market Share (%) |
|---|---|
| AWS Glue | 10.9% |
| AWS Database Migration Service | 8.5% |
| Informatica Intelligent Data Management Cloud (IDMC) | 7.0% |
| Other | 73.6% |
| Type | Title | Date | |
|---|---|---|---|
| Category | Cloud Data Integration | Dec 29, 2025 | Download |
| Product | Reviews, tips, and advice from real users | Dec 29, 2025 | Download |
| Comparison | AWS Glue vs AWS Database Migration Service | Dec 29, 2025 | Download |
| Comparison | AWS Glue vs Informatica Intelligent Data Management Cloud (IDMC) | Dec 29, 2025 | Download |
| Comparison | AWS Glue vs MuleSoft Anypoint Platform | Dec 29, 2025 | Download |
| Title | Rating | Mindshare | Recommending | |
|---|---|---|---|---|
| Informatica Intelligent Data Management Cloud (IDMC) | 4.0 | 7.0% | 92% | 214 interviewsAdd to research |
| MuleSoft Anypoint Platform | 4.0 | 5.9% | 92% | 60 interviewsAdd to research |
| Company Size | Count |
|---|---|
| Small Business | 11 |
| Midsize Enterprise | 6 |
| Large Enterprise | 29 |
| Company Size | Count |
|---|---|
| Small Business | 265 |
| Midsize Enterprise | 162 |
| Large Enterprise | 851 |
AWS Glue allows users to connect to more than 70 diverse data sources and manage data in a centralized data catalog. The solution facilitates visual creation, running, and monitoring of extract, transform, and load (ETL) pipelines to load data into users' data lakes. This Amazon product seamlessly integrates with other native applications of the brand and allows users to search and query cataloged data using Amazon EMR, Amazon Athena, and Amazon Redshift Spectrum.
The solution also utilizes application programming interface (API) operations to transform users' data, create runtime logs, store job logic, and create notifications for monitoring job runs. The console of AWS Glue connects all of these services into a managed application, facilitating the monitoring and operational processes. The solution also performs provisioning and management of the resources required to run users' workloads in order to minimize manual work time for organizations.
AWS Glue Features
AWS Glue groups its features into four categories - discover, prepare, integrate, and transform. Within those groups are the following features:
AWS Glue Benefits
AWS Glue offers a wide range of benefits for its users. These benefits include:
Reviews from Real Users
Mustapha A., a cloud data engineer at Jems Groupe, likes AWS Glue because it is a product that is great for serverless data transformations.
Liana I., CEO at Quark Technologies SRL, describes AWS Glue as a highly scalable, reliable, and beneficial pay-as-you-go pricing model.
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
| Author info | Rating | Review Summary |
|---|---|---|
| Principal Consultant at a retailer with 1,001-5,000 employees | 4.0 | I use AWS Glue for ETL processes, including data transformation and cleansing for our data warehouse. Its serverless nature and excellent performance are beneficial, though the UI and version upgrades need improvement. Despite some challenges, Glue remains my preferred solution. |
| application security engineer at Hyperspace IT India | 4.0 | No summary available |
| Data Architect at a financial services firm with 10,001+ employees | 2.5 | I use AWS Glue primarily for data ingestion, curation, and transformation, benefiting from its compatibility with Python for big data. Though clunky and code-heavy, it suits limited pipelines well, especially with AWS, despite preferring GUI-based tools. |
| Python AWS & AI Expert at a tech consulting company | 4.0 | I use AWS Glue primarily for serverless integration across various services. Its valuable features include robust transformation capabilities and seamless data preparation, though the deployment process could be simplified. It's integrated with AWS for comprehensive data workflow management. |
| AVP at a manufacturing company with 10,001+ employees | 3.0 | I use AWS Glue in my company for building data lakes and processing data from various sources like Oracle and MongoDB. It's valuable for managing large data volumes serverlessly, but its high cost, especially if systems are poorly designed, poses significant challenges. |
| Offshore Delivery | AWS architect | Manager - Projects at Cognizant | 4.0 | I use AWS Glue for efficient data transformation and integration with Apache Airflow, enabling smooth orchestration without cold starts like AWS Lambda. Although managing environment variables could improve, Glue's extended session capability suits our needs better than other solutions. |
| Engineering Manager at Milestone Technologies | 3.5 | We use AWS Glue to build tables from CSV data, appreciating its versatile crawlers but facing challenges with output limitations and configuration. Despite past use of Spark, AWS Glue's managed service benefits us, though we're considering Databricks for future needs. |
| Technology Specialist at a consultancy with 10,001+ employees | 3.5 | I use AWS Glue for microservices triggered by events and for data computing. It's easy to run without frameworks, but its complex syntax and limited integration resources with third-party services pose challenges. I've worked with other languages like Python and .NET. |