Amazon SageMaker vs Databricks comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
11,742 views|9,310 comparisons
83% willing to recommend
Databricks Logo
28,975 views|18,474 comparisons
96% willing to recommend
Comparison Buyer's Guide
Executive Summary
Updated on Mar 6, 2024

We compared Databricks and Amazon SageMaker based on our user's reviews in several parameters.

Databricks offers seamless integration with various data sources, advanced analytics capabilities, and efficient customer service. Users appreciate the collaborative features and positive ROI. On the other hand, Amazon SageMaker is praised for its ease of use, comprehensive ML capabilities, and robust monitoring tools. Users find the pricing transparent and support team responsive.

Features: Databricks is known for its seamless integration with various data sources and platforms, collaborative capabilities, advanced analytics, and machine learning capabilities. On the other hand, Amazon SageMaker offers ease of use, comprehensive machine learning capabilities, seamless integration with other AWS services, customizable workflows, efficient model training and deployment, automated data labeling, and robust monitoring and troubleshooting tools.

Pricing and ROI: Databricks users have reported positive feedback on pricing, setup cost, and licensing. The setup cost is straightforward and hassle-free, while the license terms offer flexibility. Similarly, Amazon SageMaker users find the pricing reasonable, setup cost hassle-free, and licensing process clear and transparent., Users have reported positive outcomes and returns on investment with Databricks, appreciating its impact on efficiency, productivity, and data analysis capabilities. Similarly, Amazon SageMaker delivers positive ROI, providing value and benefits for businesses.

Room for Improvement: Databricks has room for improvement in aspects such as data visualization, monitoring and debugging tools, integration with external data sources and services, documentation and tutorials, and pricing flexibility. In comparison, users have identified areas for enhancement in Amazon SageMaker.

Deployment and customer support: Based on user reviews, there are varying durations required for deploying, setting up, and implementing a new tech solution on both Databricks and Amazon SageMaker. While some users mentioned spending three months on deployment and a week on setup for both products, it is important to evaluate the context to determine if these terms refer to the same period or should be considered separately., Customers have reported positive experiences with both Databricks and Amazon SageMaker customer service. Databricks is praised for its efficiency and proactive approach, while SageMaker is commended for its attentiveness and commitment to customer needs.

The summary above is based on 56 interviews we conducted recently with Databricks and Amazon SageMaker users. To access the review's full transcripts, download our report.

To learn more, read our detailed Amazon SageMaker vs. Databricks Report (Updated: March 2024).
768,578 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The solution is easy to scale...The documentation and online community support have been sufficient for us so far.""Feature Store, CodeCommit, versioning, model control, and CI/CD pipelines are the most valuable features in Amazon SageMaker.""The few projects we have done have been promising.""The most valuable feature of Amazon SageMaker is that you don't have to do any programming in order to perform some of your use cases.""The deployment is very good, where you only need to press a few buttons.""We were able to use the product to automate processes.""The most valuable feature of Amazon SageMaker is its integration. For example, AWS Lambda. Additionally, we can write Python code.""The tool makes our ML model development a bit more efficient because everything is in one environment."

More Amazon SageMaker Pros →

"When we have a huge volume of data that we want to process with speed, velocity, and volume, we go through Databricks.""Databricks has improved my organization by allowing us to transform data from sources to a different format and feed that to the analytics, business intelligence, and reporting teams. This tool makes it easy to do those kinds of things.""The most valuable feature of Databricks is the integration with Microsoft Azure.""Databricks has helped us have a good presence in data.""Imageflow is a visual tool that helps make it easier for business people to understand complex workflows.""The setup is quite easy.""Easy to use and requires minimal coding and customizations.""The load distribution capabilities are good, and you can perform data processing tasks very quickly."

More Databricks Pros →

Cons
"AI is a new area and AWS needs to have an internship training program available.""The solution is complex to use.""Lacking in some machine learning pipelines.""In general, improvements are needed on the performance side of the product's graphical user interface-related area since it consumes a lot of time for a user.""The payment and monitoring metrics are a bit confusing not only for Amazon SageMaker but also for the range of other products that fall under AWS, especially for a new user of the product.""The pricing of the solution is an issue...In SageMaker, monitoring could be improved by supporting more data types other than JSON and CSV.""Creating notebook instances for multiple users is pretty expensive in Amazon SageMaker.""I would say the IDE is quite immature, but it is still in its infancy, so I expect it to get better over time."

More Amazon SageMaker Cons →

"I would like it if Databricks adopted an interface more like R Studio. When I create a data frame or a table, R Studio provides a preview of the data. In R Studio, I can see that it created a table with so many columns or rows. Then I can click on it and open a preview of that data.""The solution has some scalability and integration limitations when consolidating legacy systems.""There should be better integration with other platforms.""The stability of the clusters or the instances of Databricks would be better if it was a much more stable environment. We've had issues with crashes.""When I used the support, I had communication problems because of the language barrier with the agent. The accent was difficult to understand.""It should have more compatible and more advanced visualization and machine learning libraries.""I have had some issues with some of the Spark clusters running on Databricks, where the Spark runtime and clusters go up and down, which is an area for improvement.""I believe that this product could be improved by becoming more user-friendly."

More Databricks Cons →

Pricing and Cost Advice
  • "The pricing is complicated as it is based on what kind of machines you are using, the type of storage, and the kind of computation."
  • "The support costs are 10% of the Amazon fees and it comes by default."
  • "SageMaker is worth the money for our use case."
  • "Databricks solution is less costly than Amazon SageMaker."
  • "I would rate the solution's price a ten out of ten since it is very high."
  • "There is no license required for the solution since you can use it on demand."
  • "I rate the pricing a five on a scale of one to ten, where one is the lowest price, and ten is the highest price. The solution is priced reasonably. There is no additional cost to be paid in excess of the standard licensing fees."
  • "You don't pay for Sagemaker. You only pay for the compute instances in your storage."
  • More Amazon SageMaker Pricing and Cost Advice →

  • "Whenever we want to find the actual costing, we have to send an email to Databricks, so having the information available on the internet would be helpful."
  • "I do not exactly know the costs, but one of our clients pays between $100 USD and $200 USD monthly."
  • "Licensing on site I would counsel against, as on-site hardware issues tend to really delay and slow down delivery."
  • "We find Databricks to be very expensive, although this improved when we found out how to shut it down at night."
  • "The pricing depends on the usage itself."
  • "I am based in South Africa, where it is expensive adapting to the cloud, and then there is the price for the tool itself."
  • "The price is okay. It's competitive."
  • "Databricks uses a price-per-use model, where you can use as much compute as you need."
  • More Databricks Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
    768,578 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:We researched AWS SageMaker, but in the end, we chose Databricks Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It… more »
    Top Answer:The tool makes our ML model development a bit more efficient because everything is in one environment.
    Top Answer:The pricing is comparable. It is not very cheap. I rate the pricing an eight out of ten. The main reason why we're using it is because of its cost. We are aiming at keeping the costs at $100 per… more »
    Top Answer:Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with… more »
    Top Answer:Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their… more »
    Top Answer:Databricks is hosted on the cloud. It is very easy to collaborate with other team members who are working on it. It is production-ready code, and scheduling the jobs is easy.
    Ranking
    5th
    Views
    11,742
    Comparisons
    9,310
    Reviews
    11
    Average Words per Review
    536
    Rating
    7.2
    1st
    Views
    28,975
    Comparisons
    18,474
    Reviews
    47
    Average Words per Review
    441
    Rating
    8.3
    Comparisons
    Also Known As
    AWS SageMaker, SageMaker
    Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
    Learn More
    Overview

    Amazon SageMaker is a fully-managed platform that enables developers and data scientists to quickly and easily build, train, and deploy machine learning models at any scale. Amazon SageMaker removes all the barriers that typically slow down developers who want to use machine learning.

    Databricks is an industry-leading data analytics platform which is a one-stop product for all data requirements. Databricks is made by the creators of Apache Spark, Delta Lake, ML Flow, and Koalas. It builds on these technologies to deliver a true lakehouse data architecture, making it a robust platform that is reliable, scalable, and fast. Databricks speeds up innovations by synthesizing storage, engineering, business operations, security, and data science.

    Databricks is integrated with Microsoft Azure, Amazon Web Services, and Google Cloud Platform. This enables users to easily manage a colossal amount of data and to continuously train and deploy machine learning models for AI applications. The platform handles all analytic deployments, ranging from ETL to models training and deployment.

    Databricks deciphers the complexities of processing data to empower data scientists, engineers, and analysts with a simple collaborative environment to run interactive and scheduled data analysis workloads. The program takes advantage of AI’s cost-effectivity, flexibility, and cloud storage.

    Databricks Key Features

    Some of Databricks key features include:

    • Cloud-native: Works well on any prominent cloud provider.
    • Data storage: Stores a broad range of data, including structured, unstructured, and streaming.
    • Self-governance: Built-in governance and security controls.
    • Flexibility: Flexible for small-scale jobs as well as running large-scale jobs like Big Data processing because it’s built from Spark and is specifically optimized for Cloud environments.
    • Data science tools: Production-ready data tooling, from engineering to BI, AI, and ML.
    • Familiar languages: While Databricks is Spark-based, it allows commonly used programming languages like R, SQL, Scala, and Python to be used.
    • Team sharing workspaces: Creates an environment that provides interactive workspaces for collaboration, which allow multiple members to collaborate for data model creation, machine learning, and data extraction.
    • Data source: Performs limitless Big Data analytics by connecting to Cloud providers AWS, Azure, and Google, as well as on-premises SQL servers, JSON and CSV.

    Reviews from Real Users

    Databricks stands out from its competitors for several reasons. Two striking features are its collaborative ability and its ability to streamline multiple programming languages.

    PeerSpot users take note of the advantages of these features. A Chief Research Officer in consumer goods writes, “We work with multiple people on notebooks and it enables us to work collaboratively in an easy way without having to worry about the infrastructure. I think the solution is very intuitive, very easy to use. And that's what you pay for.”

    A business intelligence coordinator in construction notes, “The capacity of use of the different types of coding is valuable. Databricks also has good performance because it is running in spark extra storage, meaning the performance and the capacity use different kinds of codes.”

    An Associate Manager who works in consultancy mentions, “The technology that allows us to write scripts within the solution is extremely beneficial. If I was, for example, able to script in SQL, R, Scala, Apache Spark, or Python, I would be able to use my knowledge to make a script in this solution. It is very user-friendly and you can also process the records and validation point of view. The ability to migrate from one environment to another is useful.”

    Sample Customers
    DigitalGlobe, Thomson Reuters Center for AI and Cognitive Computing, Hotels.com, GE Healthcare, Tinder, Intuit
    Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
    Top Industries
    REVIEWERS
    Computer Software Company22%
    Manufacturing Company11%
    Logistics Company11%
    Transportation Company11%
    VISITORS READING REVIEWS
    Financial Services Firm17%
    Educational Organization13%
    Computer Software Company11%
    Manufacturing Company7%
    REVIEWERS
    Computer Software Company25%
    Financial Services Firm16%
    Retailer9%
    Manufacturing Company9%
    VISITORS READING REVIEWS
    Financial Services Firm15%
    Computer Software Company12%
    Manufacturing Company8%
    Healthcare Company6%
    Company Size
    REVIEWERS
    Small Business16%
    Midsize Enterprise42%
    Large Enterprise42%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise17%
    Large Enterprise68%
    REVIEWERS
    Small Business27%
    Midsize Enterprise14%
    Large Enterprise59%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise11%
    Large Enterprise71%
    Buyer's Guide
    Amazon SageMaker vs. Databricks
    March 2024
    Find out what your peers are saying about Amazon SageMaker vs. Databricks and other solutions. Updated: March 2024.
    768,578 professionals have used our research since 2012.

    Amazon SageMaker is ranked 5th in Data Science Platforms with 18 reviews while Databricks is ranked 1st in Data Science Platforms with 78 reviews. Amazon SageMaker is rated 7.2, while Databricks is rated 8.2. The top reviewer of Amazon SageMaker writes "Easy to use and manage, but the documentation does not have a lot of information". On the other hand, the top reviewer of Databricks writes "A nice interface with good features for turning off clusters to save on computing". Amazon SageMaker is most compared with Azure OpenAI, Google Vertex AI, Domino Data Science Platform, Microsoft Azure Machine Learning Studio and Dataiku Data Science Studio, whereas Databricks is most compared with Informatica PowerCenter, Dataiku Data Science Studio, Microsoft Azure Machine Learning Studio, Dremio and Azure Stream Analytics. See our Amazon SageMaker vs. Databricks report.

    See our list of best Data Science Platforms vendors.

    We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.