We performed a comparison between AWS Glue and IBM InfoSphere Information Server based on real PeerSpot user reviews.
Find out in this report how the two Cloud Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
"It is AWS-integrated. There is end-to-end integration with the other AWS services. It is also user-friendly."
"The key role for Glue is that it hosts our metadata before rolling out our actual data. This is the major advantage of using this solution and our clients client have been very satisfied with it."
"Its user interface is quite good. You just need to choose some options to create a job in AWS Glue. The code-generation feature is also useful. If you don't want to customize it and simply want to read a file and store the data in the database, it can generate the code for you."
"I like that it's flexible, powerful, and allows you to write your own queries and scripts to get the needed transformations."
"The solution is stable and reliable."
"The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features."
"AWS Glue is a good solution for developers, they have the ability to write code in different languages and other software."
"AWS Glue's most valuable features are the data catalog, including crawlers and tables, and Glue Studio, which means you don't have to use custom code."
"IBM InfoSphere Information Server is stable."
"Currently, it supports only two languages in the background: Python and Scala. From our customization point of view, it would be helpful if it can also support Java in the background."
"The monitoring is not that good."
"The crucial problem with AWS Glue is that it only works with AWS. It is not an agnostic tool like Pentaho. In PowerCenter, we can install the forms from Google and other vendors, but in the case of AWS Glue, we can only use AWS."
"There should be more connectors for different databases."
"The technical support for this solution could be improved. In future, we would like to connect more services like Athena or Kinesis to help control more loads of data."
"It would be better if it were more user-friendly. The interesting thing we found is that it was a little strange at the beginning. The way Glue works is not very straightforward. After trying different things, for example, we used just the console to create jobs. Then we realized that things were not working as expected. After researching and learning more, we realized that even though the console creates the script for the ETL processes, you need to modify or write your own script in Spark to do everything you want it to do. For example, we are pulling data from our source database and our application database, which is in Aurora. From there, we are doing the ETL to transform the data and write the results into Redshift. But what was surprising is that it's almost like whatever you want to do, you can do it with Glue because you have the option to put together your own script. Even though there are many functionalities and many connections, you have the opportunity to write your own queries to do whatever transformations you need to do. It's a little deceiving that some options are supposed to work in a certain way when you set them up in the console, but then they are not exactly working the right way or not as expected. It would be better if they provided more examples and more documentation on options."
"There is a learning curve to this tool."
"If there's a cluster-related configuration, we have to make worker notes, which is quite a headache when processing a large amount of data."
"IBM InfoSphere Information Server should be more scalable. It should have the option to change the configuration to run on a single, non-multiple node, or multi-threading processing."
AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics. You can create and run an ETL job with a few clicks in the AWS Management Console. You simply point AWS Glue to your data stored on AWS, and AWS Glue discovers your data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, your data is immediately searchable, queryable, and available for ETL.
AWS Glue is ranked 2nd in Cloud Data Integration with 10 reviews while IBM InfoSphere Information Server is ranked 39th in Data Integration Tools with 1 review. AWS Glue is rated 8.2, while IBM InfoSphere Information Server is rated 7.0. The top reviewer of AWS Glue writes "Easy to perform ETL on multiple data sources, and easy to use after you learn it". On the other hand, the top reviewer of IBM InfoSphere Information Server writes "Prompt support, reliable, but lacking scalability". AWS Glue is most compared with AWS Database Migration Service, Informatica PowerCenter, Talend Open Studio, Informatica Cloud Data Integration and SSIS, whereas IBM InfoSphere Information Server is most compared with IBM InfoSphere DataStage, Oracle GoldenGate, Qlik Replicate and IBM Watson Knowledge Catalog.
We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.