IBM InfoSphere DataStage and Azure Data Factory are both competitors in the data integration tools category. User feedback suggests Azure Data Factory has an edge due to its scalability and integration features.
Features: IBM InfoSphere DataStage offers high scalability, robust metadata management, and error logging capabilities, making it ideal for handling large data sets. Azure Data Factory provides ease of integration, flexibility, and extensive cloud capabilities, simplifying data pipeline management.
Room for Improvement: IBM InfoSphere DataStage needs a more user-friendly UI, enhanced cloud integration, and better connectivity with modern data sources. Azure Data Factory could benefit from refining data transformation features, improved integration with other Azure services, and a simplified pricing structure.
Ease of Deployment and Customer Service: IBM InfoSphere DataStage is predominantly on-premises or hybrid, presenting integration strengths but cloud adaptability challenges. Azure Data Factory's alignment with modern cloud deployments and Microsoft's global support network results in generally positive user satisfaction, though room for enhanced technical assistance exists.
Pricing and ROI: IBM InfoSphere DataStage is considered expensive, with comprehensive solutions for large enterprises providing potential long-term ROI benefits. Azure Data Factory's pay-as-you-go model is more accessible but poses unpredictability in cost estimation; it remains generally affordable with caution advised regarding scaling costs.
Our stakeholders and clients have expressed satisfaction with Azure Data Factory's efficiency and cost-effectiveness.
The technical support is responsive and helpful
The technical support from Microsoft is rated an eight out of ten.
The technical support for Azure Data Factory is generally acceptable.
We also have the flexibility to submit a feature request to be included as part of the wishlist, potentially becoming a product feature in subsequent releases.
IBM tech support has allocated dedicated resources, making it satisfactory.
Azure Data Factory is highly scalable.
The solution has a high level of stability, roughly a nine out of ten.
I suggest integrating some AI functionality to analyze data during the transition itself, providing insights such as null records, common records, and duplicates without running a separate pipeline or job.
The inability to connect local VMs and local servers into the data flow is a limitation that prevents giving Azure Data Factory a perfect score.
There is a problem with the integration with third-party solutions, particularly with SAP.
I wonder if it supports other areas, such as cloud environments with open source support, or EdgeShift.
The solution needs improvement in connectivity with big data technologies such as Spark.
The pricing is cost-effective.
It is considered cost-effective.
Pricing for IBM InfoSphere DataStage is moderate and not much expensive.
The orchestration features in Azure Data Factory are definitely useful, as it is not only for Azure Data Factory; we can also include DataBricks and other services for integrating the data solution, making it a very beneficial feature.
The platform excels in handling major datasets, particularly when working with Power BI for reporting purposes.
It connects to different sources out-of-the-box, making integration much easier.
It is straightforward from a design and development perspective, and also for deployment.
IBM InfoSphere DataStage is very scalable, allowing us to extend it according to our processing needs.
Product | Market Share (%) |
---|---|
Azure Data Factory | 5.6% |
IBM InfoSphere DataStage | 3.7% |
Other | 90.7% |
Company Size | Count |
---|---|
Small Business | 31 |
Midsize Enterprise | 19 |
Large Enterprise | 55 |
Company Size | Count |
---|---|
Small Business | 23 |
Midsize Enterprise | 4 |
Large Enterprise | 25 |
Azure Data Factory efficiently manages and integrates data from various sources, enabling seamless movement and transformation across platforms. Its valuable features include seamless integration with Azure services, handling large data volumes, flexible transformation, user-friendly interface, extensive connectors, and scalability. Users have experienced improved team performance, workflow simplification, enhanced collaboration, streamlined processes, and boosted productivity.
IBM InfoSphere DataStage is a high-quality data integration tool that aims to design, develop, and run jobs that move and transform data for organizations of different sizes. The product works by integrating data across multiple systems through a high-performance parallel framework. It supports extended metadata management, enterprise connectivity, and integration of all types of data.
The solution is the data integration component of IBM InfoSphere Information Server, providing a graphical framework for moving data from source systems to target systems. IBM InfoSphere DataStage can deliver data to data warehouses, data marts, operational data sources, and other enterprise applications. The tool works with various types of patterns - extract, transform and load (ETL), and extract, load, and transform (ELT). The scalability of the platform is achieved by using parallel processing and enterprise connectivity.
The solution has various versions, catering to different types of companies, which include the Server Edition, the Enterprise Edition, and the MVS Edition. Depending on which version a company has bought, different goals can be achieved. They include the following:
IBM InfoSphere DataStage can be deployed in various ways, including:
IBM InfoSphere DataStage Features
The tool has various features through which users can integrate and utilize their data effectively. The components of IBM InfoSphere DataStage include:
IBM InfoSphere DataStage Benefits
This solution offers many benefits for the companies that utilize it for data integration. Some of these benefits include:
Reviews from Real Users
A data/solution architect at a computer software company says the product is robust, easy to use, has a simple error logging mechanism, and works very well for huge volumes of data.
Tirthankar Roy Chowdhury, team leader at Tata Consultancy Services, feels the tool is user-friendly with a lot of functionalities, and doesn't require much coding because of its drag-and-drop features.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.