What is our primary use case?
Amazon Redshift serves as our primary analytics solution, as we receive a large amount of chunk data daily from different restaurant sites. We also use Amazon Redshift as a cloud data warehouse for storing and analyzing a large set of data, including loading data from S3 and other sources into Redshift, and then using SQL querying and other reporting tools to provide insights to business users.
Amazon Redshift fits into our analytics and reporting process as S3 is the source table for us. We load the data into Redshift and SQL Server, which are our primary target tables, and we are using Informatica PowerCenter and SnapLogic tools to perform transformations while simultaneously transitioning from on-premises to cloud platforms.
What is most valuable?
Amazon Redshift offers massive parallel processing capability, allowing it to handle millions of records, along with integration with other AWS features such as S3 and QuickSight. The column-level storage supports high performance and easy data retrieval compared to other tools.
The integration with S3 has helped my team immensely since S3 is our starting source, and the direct integration with Amazon Redshift simplifies the transformation and loading process. Columnar storage has benefited our performance significantly, allowing us to extract specific rows from tables with years of data much faster than other tools with row-level storage.
Amazon Redshift has positively impacted our organization by significantly improving query performance and speed. Features such as easy integration with S3 allow us to process millions of records efficiently, ultimately saving us nearly one hour per day for our project.
What needs improvement?
Amazon Redshift could improve by enhancing its UI to be more user-friendly for non-technical users and by offering better cost management, as similar tools tend to be less expensive. Additionally, integration with AI features could elevate its capabilities and popularity.
For how long have I used the solution?
I have been using Amazon Redshift for the past eight months.
What do I think about the stability of the solution?
Amazon Redshift has been stable with no crashes or reliability issues since I joined this organization.
What do I think about the scalability of the solution?
Amazon Redshift's scalability has met my expectations, especially with the recent added data volume from new restaurant sites. We have successfully increased our storage space, which was a smooth process without server crashes before or after scaling.
We recently scaled Amazon Redshift storage by 500 GB to accommodate the additional data from approximately 20 to 25 new restaurants. This demonstrated that the backup feature and scalability in Amazon Redshift were crucial during this process.
How are customer service and support?
My experience with Amazon Redshift's customer support has been positive. When I needed setup assistance and credentials, they quickly helped me and provided documentation that allows anyone with prior knowledge of Redshift or SQL to resolve technical issues.
Which solution did I use previously and why did I switch?
I have not used previous solutions in this organization, but I also utilize SQL Server, Oracle SQL Developer, and GCP BigQuery. In comparison, Amazon Redshift excels in query performance, scalability, and security, ranking it second only to GCP BigQuery.
How was the initial setup?
When I first joined my team a year ago, they had been using Amazon Redshift for the past five to six years. While I did not face any initial issues, I am uncertain if my colleagues experienced any difficulties during the initial setup.
What was our ROI?
We earned back our investment in Amazon Redshift within the first year. Initially, there were 25 employees for development, but now only four maintain Amazon Redshift daily, saving significant resources and allowing us to use the saved time for other tasks.
What's my experience with pricing, setup cost, and licensing?
My experience with pricing and setup cost is satisfactory, as we have been loyal clients of Amazon since the beginning of our organization. However, I am not aware of the exact costs associated with our project since I am not part of the management team.
Which other solutions did I evaluate?
My team did not evaluate other options before choosing Amazon Redshift, as it integrates seamlessly with our source, Amazon S3, making it a straightforward choice compared to other cloud tools.
What other advice do I have?
One unique aspect of our workflow with Amazon Redshift involving our main use case is that I resolved an issue related to bundle data in our restaurant's profit sheet. I queried a few tables in Redshift and discovered that while coupons were applying, they were not yet added to our main base tables, which we later corrected.
The performance of Amazon Redshift when handling large data sets or complex queries is impressive. Due to its column-level storage and parallel processing, we can quickly extract data from databases with millions of rows.
Amazon Redshift integrates well with ETL tools such as Informatica PowerCenter and SnapLogic, transforming data before loading it into target tables. From those tables, we use Looker dashboards or FreeForm and Essbase to report insights to business users.
My advice for others considering Amazon Redshift is to choose appropriate distribution and sort keys as crucial steps in its integration, utilize the copy command for bulk loading from S3, review the documentation for a better understanding of features, and consider Redshift Spectrum to reduce storage costs while reallocating resources for other tasks.
I have covered all the important aspects, including features, disadvantages, and what I appreciate about Amazon Redshift. I would rate this product an 8 out of 10 based on my overall experience.
Which deployment model are you using for this solution?
Private Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?