

IBM InfoSphere DataStage and Azure Data Factory are competitors in the ETL and data integration market. Azure Data Factory appears to have the upper hand due to its cloud-native design and integration with Azure services.
Features: IBM InfoSphere DataStage is known for supporting high-volume data processing with a range of transformations, parallel processing, and integration options with IBM solutions. It offers strong metadata management and data quality capabilities. Azure Data Factory provides a cloud-native design with numerous connectors, seamless Azure services integration, and a drag-and-drop interface for ease of use.
Room for Improvement: IBM InfoSphere DataStage users face challenges with high costs, complex setup, and an outdated UI, along with calls for better cloud integration. Azure Data Factory users seek clearer documentation, simplified pricing, and improved integration with non-Microsoft products. Enhanced transformation features and better monitoring tools are also desired.
Ease of Deployment and Customer Service: IBM InfoSphere DataStage is primarily on-premises and struggles with cloud adoption, receiving mixed technical support reviews. Azure Data Factory, as a cloud-based service, is easy to deploy within the Azure ecosystem but needs better support for integration issues.
Pricing and ROI: IBM InfoSphere DataStage is expensive due to licensing and maintenance fees, making it suitable for large enterprises but a barrier for smaller firms. Azure Data Factory uses a pay-as-you-go model, offering cost flexibility, though its pricing can be unpredictable. Users report reduced data integration expenses and improved ROI.
Our stakeholders and clients have expressed satisfaction with Azure Data Factory's efficiency and cost-effectiveness.
The technical support from Microsoft is rated an eight out of ten.
The technical support is responsive and helpful
They are not slow on responding or very informative.
We also have the flexibility to submit a feature request to be included as part of the wishlist, potentially becoming a product feature in subsequent releases.
I rate their support as nine on a scale from one to ten.
IBM tech support has allocated dedicated resources, making it satisfactory.
Azure Data Factory is highly scalable.
If the job provided suggestions about running this kind of parallel processing and how many virtual nodes are required, it would help.
The solution has a high level of stability, roughly a nine out of ten.
Incorporating more dedicated API sources to specific services like HubSpot CRM or Salesforce would be beneficial.
Sometimes, the compute fails to process data if there is a heavy load suddenly, and it doesn't scale up automatically.
There is a problem with the integration with third-party solutions, particularly with SAP.
If the job itself gave some guidance, such as running this parallel processing with this many nodes, it would help; I think that is missing.
I wonder if it supports other areas, such as cloud environments with open source support, or EdgeShift.
The solution needs improvement in connectivity with big data technologies such as Spark.
The pricing is cost-effective.
It is considered cost-effective.
Pricing for IBM InfoSphere DataStage is moderate and not much expensive.
It connects to different sources out-of-the-box, making integration much easier.
The platform excels in handling major datasets, particularly when working with Power BI for reporting purposes.
Regarding the integration feature in Azure Data Factory, the integration part is excellent; we have major source connectors, so we can integrate the data from different data sources and also perform basic transformation while transforming, which is a great feature in Azure Data Factory.
It is straightforward from a design and development perspective, and also for deployment.
As we are a financial organization, security is our main concern, so we prefer enterprise tools.
I have leveraged IBM InfoSphere DataStage's integration with IBM's Information Server suite, and it is indeed beneficial.
| Product | Mindshare (%) |
|---|---|
| Azure Data Factory | 2.4% |
| IBM InfoSphere DataStage | 1.6% |
| Other | 96.0% |

| Company Size | Count |
|---|---|
| Small Business | 31 |
| Midsize Enterprise | 20 |
| Large Enterprise | 57 |
| Company Size | Count |
|---|---|
| Small Business | 23 |
| Midsize Enterprise | 4 |
| Large Enterprise | 26 |
Azure Data Factory efficiently manages and integrates data from various sources, enabling seamless movement and transformation across platforms. Its valuable features include seamless integration with Azure services, handling large data volumes, flexible transformation, user-friendly interface, extensive connectors, and scalability. Users have experienced improved team performance, workflow simplification, enhanced collaboration, streamlined processes, and boosted productivity.
IBM InfoSphere DataStage offers powerful ETL capabilities focusing on data transformation and integration, ensuring seamless data processing and management in complex environments. It is particularly valued for handling extensive data volumes with robust transformation features and scalability options.
IBM InfoSphere DataStage is renowned for its strength in data extraction, transformation, and loading, making it a preferred choice for businesses handling large datasets. It provides extensive database connectors, integrates efficiently with existing systems, and facilitates complex data transformations. Users appreciate its scalability, metadata management, and effectiveness in applying business rules. Despite this, areas for improvement include enhanced cloud integration, better error messaging, and expanded connectivity with modern databases. Its pricing scheme and deployment complexity also present considerations for potential users.
What are the key features of IBM InfoSphere DataStage?Businesses in sectors like telecommunications, banking, and insurance commonly implement IBM InfoSphere DataStage for ETL processes. It's used for integrating data from multiple sources into data warehouses, supporting business intelligence initiatives, and managing data quality. Known for efficiently handling integration of mainframes and Oracle databases, it supports complex data projects tailored to industry needs.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.