Our company uses the solution as a big data lake for storage and cherry-picking data sets using multiple languages. We create ETL pipelines, run them on a schedule, and export the data to visualize it.
We perform functional tests on the data sets using Excel in a Fusion Sheet. A schema is created that shows all data in columns and can be manipulated to extract meaningful information.
The Code Workbook is used to import data and write code using R, Python, Spark SQL, or PySpark. From there, you can perform calculations and create data sets.
Contour is the graphical user interface that gives us the available basic or automatic operations. You do not need a technical grasp because it is easy to use with knowledge of the basics and filters.
Across our company, there are 3,000 users who access our data lake.
The Code Workbook gives you the option to switch across built-in languages such as Spark SQL, PySpark, R, or Python.
Live video sessions enhance the available documentation and allow you to ask questions directly. There are a multitude of sessions within each framework that occur weekly. At the end of a session, you have the option to read other user's questions or ask questions yourself.
The GUI is easy to use and does not require advanced technical knowledge.
There is not a wide user base for the solution's online documentation so it is sometimes difficult to find answers. It is easy to find answers for code issues because Spark SQL and Python have wide user bases. There is a certain probability you won't find a solution-specific answer if you search for it. For example, there are certain errors that are specific to the solution. The more you use the solution, the more you understand it. The learning curve could be reduced with online documentation that includes the meaning of and troubleshooting for error codes.
Predefined code templates or informational prompts would help with writing syntaxes.
I have been using the solution for fourteen months.
The solution is very stable. On occasion, we receive an error but it is rectified within a few hours.
We create use cases that do not have processing limits. The solution is a big data tool so should handle any scalability.
I have not needed technical support.
I previously used Microsoft SQL which is a traditional database. The solution is an advancement because it is a direct jump to a big data source.
Comparing the solution to traditional databases is liking comparing an apple to a banana.
A different team handles the solution and our data lake so I don't have knowledge of the setup. Our team accesses the solution via a web link and creates use cases.
The solution has many features but I am only using the Code Workbook and Contour. Another feature called Slate allows you to create websites or record user data.
Based on my current usage, I rate the solution a seven out of ten.