Architect at Elections Canada | Elections Canada
Real User
Top 20
A stable solution that is easy to install and works well so far, but the error-handling is not user-friendly and it needs a built-in tool for geospatial data
Pros and Cons
  • "I didn't have many problems installing it. It seemed very straightforward to me."
  • "I think my biggest problem with the tool is that the errors are very hard to debug."

What is our primary use case?

I use the open-source version of this solution. It's data integration-related. Right now, I'm doing some legacy TL into a new enterprise data repository.

What needs improvement?

When the tool has an error, the error-handling is not that user-friendly. It might just be my inexperience with the tool, but I struggle a lot with finding my layers sometimes. I think my biggest problem with the tool is that the errors are very hard to debug.

Our special need is geospatial data, so that could be included as part of the tool and not only as an add-on. It's always hard to do add-ons and tools when it's a third party, and every time there's going to be an upgrade, you don't know if it will keep working. It would be great if they were included as part of the main tool, like FME. That would be my main concern about the Talend tool right now.

For how long have I used the solution?

I have been using this solution for about a month. 

What do I think about the stability of the solution?

The solution is stable. 

Buyer's Guide
Talend Open Studio
March 2024
Learn what your peers think about Talend Open Studio. Get advice and tips from experienced pros sharing their opinions. Updated: March 2024.
765,234 professionals have used our research since 2012.

What do I think about the scalability of the solution?

The solution has add-ons that you can add. I'm dealing with geospatial data, and you have to have add-ons to do that, but I haven't gotten a chance to add them yet. The first add-on I tried didn't work, but I think it might just be outdated and not compatible with the version of the product that I have.

There are two of us using this solution, my colleague and myself, but in the next month or so we are going to get the Professional version in a private cloud environment. We're exploring the tool to decide if it's going to become our tool of choice for the data repository and data warehouse. Right now, we're working locally with the open-source version of it, but we're trying to decide if Talend is the right tool for us and in order to do that, we want to have the proper version of Talend in the proper environment and then do all the tests.

Which solution did I use previously and why did I switch?

FME has been used for a long time at my company, strictly for managing geospatial data. Now, we're creating an enterprise repository, and we need to deal with the regular data plus the geospatial data together, so we're trying to find a tool that will be able to handle both of these types of data, and manage all of the enterprise requirements that we have. We have used solutions like FME, but I don't believe there was any other tool that was used on the scale that we're trying to use Talend now.

How was the initial setup?

The initial setup was pretty straightforward. I didn't have many problems installing it. It seemed very straightforward to me. 

Deployment can take a few hours, depending on your environment, because it requires some things that you may need to install if you're missing them. I think it took me probably under an hour to do it.

What about the implementation team?

I did the deployment myself on my local workstation. 

What's my experience with pricing, setup cost, and licensing?

Right now, because we're using the open-source version, there's no cost. However, down the road when we use the Professional version, there will be costs. I don't know what the cost will be because I'm not involved in that, but I know that there is a license at that point.

What other advice do I have?

I would rate this solution as a seven out of ten. 

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Developer with 1-10 employees
Real User
Top 20
A complete product with good integrations and excellent flexibility
Pros and Cons
  • "The solution has a good balance between automated items and the ability for a developer to integrate and extend what he needs. Other competing tools do not offer the same grade of flexibility when you need to go beyond what is provided by the tool. Talend, on the other hand, allows you to expand very easily."
  • "The server-side should be completely revamped."

What is our primary use case?

I primarily use the solution for integration. I consolidate data from several different databases and spreadsheets and merge systems into Amazon Redshift.

What is most valuable?

The solution has a good balance between automated items and the ability for a developer to integrate and extend what he needs. Other competing tools do not offer the same grade of flexibility when you need to go beyond what is provided by the tool. Talend, on the other hand, allows you to expand very easily.

We have a good integration with Artifactory and a good integration with GitHub. 

I don't see the need for anything. At the moment, Talend is a good complete product while at the same time not being overwhelming.

What needs improvement?

What I really don't like is the TAC, which is the Talent Administration Console. It's currently slow, old technology. It's obsolete and ugly to use. The studio is great, whereas TAC is on the other side of the spectrum. Actually, in AWS, in many cases, we are ditching the TAC and we are using Amazon-provided services, like Lambda functions, to code the Talend files produced by Talend Studio.

The server-side should be completely revamped.

For how long have I used the solution?

I've been dealing with the solution for more than three years.

What do I think about the scalability of the solution?

There are two pairs of version keys that parallelize, which is not available in the free version. However, when you can parallelize your jobs you can achieve a good degree of scalability. This is one of the points where Talend shines compared to, for example, Informatica. Informatica is designed to do one record at a time. It requires expert support and general competence in the product.

Talend, by default, can be very efficient, especially using the bulk load and bulk insert components, which, at the moment we are using. We are moving to approximately 10 terabytes of data daily. Notwithstanding the load, we had to tweak very, very little. Using standard components we could achieve our overall needs.

How are customer service and technical support?

The fact that the technical support is out of China means that sometimes the engineers don't have a good command of the English language. That could hinder the overall experience. Apart from that, most of the incidents were resolved in 48 hours. Therefore, we're pretty satisfied with the level of service we've received.

Which solution did I use previously and why did I switch?

I have previously worked with Informatica and TIBCO. I still believe Talent is superior. 

How was the initial setup?

Obviously, being on AWS, we're using Linux AMI machines or vertical machines, using two machines. The setup was very easy. The only difficult part was to set up to the current time zone, however, this is nothing to do with Talend. We were having issues when Talend jobs were reporting the wrong date when they arrived because the set up was ETC rather than the local time zone.

Setting up Talend and setting up the administration console at the Tac server was very, very easy for somebody with a Linux System Administration skillset.

What about the implementation team?

I set up the solution on my own.

What's my experience with pricing, setup cost, and licensing?

I don't deal with corporate and pricing. I am just an IT Consultant. I honestly don't know the costs to run the solution. I don't know the price. I know people in the finance department are often complaining about the price, however, I don't know what the exact number is. 

It's all relative, after all. I have a very strong Oracle background and the Oracle price is totally outrageous. I don't believe that Talend prices are in the same ballpark as Oracle. I don't understand why people are complaining. That said, I don't deal with signing contracts and processes so I really don't know.

What other advice do I have?

I am an IT Consultant. I use it currently in my job. I'm providing services to a company in Australia, using Talend.

I would advise others to use the Studio. If you have to pay for some license to use the key parallelize component, it's a good idea to do it. However, don't use the TAC. Use the other orchestration services like Control-M or AWS functions such as AWS Lambda. Don't use the TAC. The TAC is really ugly.

Overall, I'd rate the solution eight out of ten. I'd rate it higher, however, the TAC is unreliable. It's a big part of the solution, and, while I do really appreciate Talend Studio and the ability to link into a producer to make a Java code, which is ugly, I don't care because the job is done anyway. The fact that the orchestration is creating execution plans, from the TAC it's fine. However, when the running of the execution plan, sometimes there are issues. The job remains stuck on the actual server. I've had a lot of issues with TAC. 

Which deployment model are you using for this solution?

Private Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Talend Open Studio
March 2024
Learn what your peers think about Talend Open Studio. Get advice and tips from experienced pros sharing their opinions. Updated: March 2024.
765,234 professionals have used our research since 2012.
Real User
Top 20
Good for data operations but outdated with poor support
Pros and Cons
  • "Open Studio's best features are that it's user-friendly, even for beginners, and very easy to implement."
  • "In the next release, Open Studio should include cloud storage as an input."

What is our primary use case?

I use Open Studio for the day-to-day management of our data architecture warehouse. 

What is most valuable?

Open Studio's best features are that it's user-friendly, even for beginners, and very easy to implement.

What needs improvement?

Open Studio's pipelines could be improved. It's also lagging behind as the market moves toward the cloud and needs to provide better serviceability. In the next release, Open Studio should include cloud storage as an input.

For how long have I used the solution?

I've been using Open Studio for four to five years.

What do I think about the stability of the solution?

Open Studio's stability isn't great - it seems to need a patch every month and has a lot of security issues.

How are customer service and support?

Talend's technical support isn't very good and not worth the money we spend on it.

How would you rate customer service and support?

Neutral

How was the initial setup?

The initial setup is pretty easy.

What's my experience with pricing, setup cost, and licensing?

Open Studio has a basic license and additional costs for services, including customer support and technical assistance.

What other advice do I have?

I would recommend Open Studio for data operations, but potential customers should be aware of the lack of support and technical expertise Talend provides. I rate Open Studio as six out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Solution Leader at HORSA
Real User
Top 5
The solution works with several architectures but occasionally has problems with stability
Pros and Cons
  • "There are many architectures: hybrid, cloud, and on-prem."
  • "I rate Talend Open Studio's stability an eight out of ten. Talend has some problems sometimes."

What is our primary use case?

Talend Open Studio is mainly with data warehouses. It is also used for data masks. There are so many types of projects we can do with Talend.

What is most valuable?

There are many things I like about Talend since I'm a Talend enthusiast. There are many architectures: hybrid, cloud, and on-prem. We can use the solution on a Linux or Windows architecture. There are many solutions for the same problem when we get experience and build a relationship with the tool. The tool also has a strong community.

For how long have I used the solution?

I've used Talend Open Studio since 2016.

What do I think about the stability of the solution?

I rate Talend Open Studio's stability an eight out of ten. Talend has some problems sometimes.

What do I think about the scalability of the solution?

The tool is not scalable, but the solutions you create with the tool are.

How was the initial setup?

Open Studio's initial setup is pretty easy. When installing the solution, you get installation on the cloud console, and you get the installation with Java integrated. The easy configuration allows a quick start.

However, you need some lessons. I give lessons to customer companies. The course has four lessons per year. It's quite simple to learn how to use Talend.

What's my experience with pricing, setup cost, and licensing?

Open Studio is free to use. Without a license, you cannot use versioning, and you have to use an external scheduler. The product is not so expensive compared to other alternatives. Talend Open Studio costs about 11,000 a year.

What other advice do I have?

I rate Talend Open Studio a seven out of ten. I recommend this solution.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
Flag as inappropriate
PeerSpot user
Clarence Gitahi - PeerSpot reviewer
Data engineer at Zealtechdata.co.ke
Real User
Top 5
It has a straightforward initial setup process and provides efficient data integration
Pros and Cons
  • "The product is easy to install and configure. It is one of the best tools for data integration."
  • "It is complicated to understand the configuration process for email components."

What is most valuable?

The product is easy to install and configure. It is one of the best tools for data integration.

What needs improvement?

It is complicated to understand the configuration process for email components.

For how long have I used the solution?

We have been using Talend Open Studio for two to three months.

What do I think about the stability of the solution?

I rate the product’s stability a nine out of ten.

What do I think about the scalability of the solution?

I rate the product’s scalability a nine out of ten. Currently, we have one user in our organization. We plan to add more users in the future.

How was the initial setup?

The initial setup is straightforward. It takes a couple of hours to complete.

What's my experience with pricing, setup cost, and licensing?

It is an open-source product.

What other advice do I have?

I rate Talend Open Studio a nine out of ten. It is an interesting tool to learn. It has many components compared to other products. It supports multiple operating systems and platforms.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Flag as inappropriate
PeerSpot user
Archan Chatterje - PeerSpot reviewer
Consultant at Keyrus
Real User
A solution that offers good scalability and stability with a responsive technical support team
Pros and Cons
  • "The initial setup was quite straightforward. The deployment took between two and three days."
  • "The profiling perspective needs improvement. Instead of using it in the studio, we are using a different tool which is also provided by Talend. It's redundant."

What is our primary use case?

We primarily use the solution just to pull the data from the source to the landing zone. We are using the administrative console as well.

What needs improvement?

The profiling perspective needs improvement. Instead of using it in the studio, we are using a different tool which is also provided by Talend. It's redundant. They should remove that from studio to make it more lightweight or improve upon its interface.

The two things that Talend lacks is an MDM and the CDC. I know Talend has both of them, but both of them are not exactly usable in actual scenarios. I know that Talend has some integrations with other companies for MDM, but I'm not sure what they're doing for CDC. Maybe Talend can work something out regarding the CDC feature. I know the solution has its own CDC, but they're not focused on it as much as people would like them to be. I would like to know what their plans for it are or if they plan to partner with another organization to market it.

For how long have I used the solution?

I've been using the solution for four or five years.

What do I think about the stability of the solution?

Currently, the solution is stable. However, three or four years back, it used to be a bit unstable. The software is quite stable as well.

What do I think about the scalability of the solution?

The solution has quite a good range of scalability. The jobs are quite scalable since we have now integrated them with Docker so we can follow all the principles of microservices. 

We have a license for five users. We have someone focused on data quality. Someone else is looking into the solution as a tester, and testers are working on the stewardship tool to correct the data and verify it with the business. Then there are two free developers who are solely working on the Open Studio to turn out code. 

We do have plans to increase usage.

How are customer service and technical support?

I don't have any complaints with technical support. They've been very responsive.

Which solution did I use previously and why did I switch?

We previously used to use a wide variety of solutions, including Informatica.

How was the initial setup?

The initial setup was quite straightforward. The deployment took between two and three days. 

We deployed it in our Unix boxes using a third-party tool called Jenkins.

The whole deployment model is automated and it happens every three weeks. There are just two people needed for maintenance.

What about the implementation team?

We are the consultants. We handled the implementation ourselves for our clients.

What's my experience with pricing, setup cost, and licensing?

There are costs above the standard licensing fee, for example, if you need storage space.

Which other solutions did I evaluate?

We evaluated other options, such as Azure Data Factory.

What other advice do I have?

We are using the AWS public cloud deployment model.

I would recommend the product. As long as you follow the best practices you will get what you want out of it.

I would rate the solution eight out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
VP Bigdata and Analytics at Pharos Analytics Lab
Real User
Top 20
An easy-to-install product that supports ETL processes
Pros and Cons
  • "The initial setup of the product was very easy."
  • "Talend Open Studio is in Java language, and right now, you can only use the debug functionality in Java. I see that people who know programming languages other than Java currently face difficulties."

What is our primary use case?

My company uses Talend Open Studio to support ETL processes.

What is most valuable?

The most valuable feature of the solution is that it is an open-source tool. For all the components of the tool, you can work on the desktop version, and later on, once the component and ETL transformation are completed, we can move to the cloud. With other tools, trial versions are available only for one month. Talend makes available an open-source community version, which is helpful for ETL transformation.

What needs improvement?

Talend Open Studio is in Java language, and right now, you can only use the debug functionality in Java. I see that people who know programming languages other than Java currently face difficulties. The aforementioned area can be of concern where improvements can be considered to help non-Java users.

For how long have I used the solution?

I have been using Talend Open Studio for a year. I use the solution's latest version. I work as a system integrator in my company.

What do I think about the stability of the solution?

Stability-wise, I rate the solution an eight out of ten.

What do I think about the scalability of the solution?

Scalability-wise, I rate the solution an eight out of ten.

Four people in my company, consisting of data engineers, data analysts, and data scientists, use the solution.

How are customer service and support?

I haven't communicated with the solution's technical support since one can quickly grab any information from the product's documentation.

Which solution did I use previously and why did I switch?

Informatica and Azure are some of the products I have used in the past. Based on clients' requirements, my company chose Talend Open Studio over Informatica.

How was the initial setup?

The initial setup of the product was very easy.

The solution is deployed on an on-premises model.

Based on the log file and how the transformation and aggregation work, within five three to five minutes, the transformation happens.

What other advice do I have?

There is no maintenance required for the product in my company.

I recommend the product to those who plan to use it.

I rate the overall solution a seven out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer:
Flag as inappropriate
PeerSpot user
PeerSpot user
(2IC) Senior System Analyst at a insurance company with 10,001+ employees
Real User
Creates a job stream that connects to multiple data sources, but needs better installation configuration for other databases
Pros and Cons
  • "The Talend Studio connected to the Talend MDM (Master Data Management) is the most valuable feature. Talend Studio is used to create a job stream that connects to multiple data sources, matches, compares or creates a golden record for overall identification. It also has a good catalogue of objects that can be dragged and dropped for building models."
  • "It needs better installation configuration for other databases. Although the installation allows you to select another database, this doesn't mean that all connection points in the application point to the database selected. You actually need to do a search through the entire install to locate the configuration settings and change them."
  • "In version 6.2 we did encounter issues with the job servers and specifically with ESB. Version 6.3 is better but large jobs can cause the MDM server to fall over, requiring a reboot."

How has it helped my organization?

By being able to cross-match records across multiple data sources and create a logical dataflow with options to place rejected records in a separate table, we are able to cleanse and create golden records in multiple categories. Rejected records, once identified, can be assessed for repair. This also means that we can identify how and where the rejected record occurred.

What is most valuable?

The Talend Studio connected to the Talend MDM (Master Data Management) is the most valuable feature. Talend Studio is used to create a job stream that connects to multiple data sources, matches, compares or creates a golden record for overall identification. It also has a good catalogue of objects that can be dragged and dropped for building models.

What needs improvement?

It needs better installation configuration for other databases. Although the installation allows you to select another database, this doesn't mean that all connection points in the application point to the database selected. You actually need to do a search through the entire install to locate the configuration settings and change them.

For how long have I used the solution?

One to three years.

What do I think about the stability of the solution?

In version 6.2 we did encounter issues with the job servers and specifically with ESB. Version 6.3 is better but large jobs can cause the MDM server to fall over, requiring a reboot.

We've built in some self-healing scripts to detect a loss of connectivity and force a restart of the services.

What do I think about the scalability of the solution?

Our Talend installation has been deployed onto Red Hat OpenStack, separating out MDM, TAC, DQ, and thee job servers. I made a point of determining data storage requirements for each server, and a memory ulimit setting to match the resource profile of the components. It was trial and error but it paid off by allowing the Talend system to process large jobs of 200-300 million records over a number of hours, rather than days.

How are customer service and technical support?

Support tends to be good for the usual types of issues, but once a problem gets more complex and deeply into the nuts and bolts of the product, support struggles.

Which solution did I use previously and why did I switch?

Initially we used Pentaho, however, it was determined that this was not as feature rich as Talend.

How was the initial setup?

The initial setup out of the box is straightforward. However, it becomes more complex as you start to distribute the components and get forced down a path of connecting to one type of database for all the components. In my case, I had to deploy Talend using RedHat Ansible and use only a PostgreSQL database.

I needed to first install the software, search for all references to H2 or PostgreSQL, change the configuration files, and then do it all over again for the distributed installs; then translate this into Ansible scripts. So although it's not directly Talend that made this complex, the installation by Talend gives the option to install to PostgreSQL but doesn't use PostgreSQL for all database repositories.

What's my experience with pricing, setup cost, and licensing?

Pricing and licensing are fairly straightforward. It is reasonably priced and managed. It's a good solution overall.

Which other solutions did I evaluate?

Pentaho, and prior to that SAS MDM which was similar but it was harder to create models. We also ran a PoC for IBM Infosphere MDM, but the cost was considered unacceptable.

What other advice do I have?

Make sure you have someone with technical skills and patience to install in a distributed deployment. Learn the product well and build in your own log shipping with either Splunk or Elastic or Telegraf to ease your diagnostic pains.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Download our free Talend Open Studio Report and get advice and tips from experienced pros sharing their opinions.
Updated: March 2024
Product Categories
Data Integration
Buyer's Guide
Download our free Talend Open Studio Report and get advice and tips from experienced pros sharing their opinions.