IT Central Station is now PeerSpot: Here's why
2021-11-25T06:01:00Z

What are the main storage requirements to support Artificial Intelligence and Deep Learning applications?


What should a storage vendor implement to support them? 

What should a customer be aware of?

PeerSpot user
Guest
24 Answers

TjeerdSaijoen - PeerSpot reviewer
Top 5Real User

Storage requirements are depending on several factors. 


For example, the language you will use, the data you will receive, do you use a RAID, etc. 

2021-11-29T08:56:18Z
Shibu Babuchandran - PeerSpot reviewer
ExpertModeratorReal User

Hi @Evgeny Belenky ​,


HERE ARE THE STORAGE REQUIREMENTS FOR DEEP LEARNING


Deep learning workloads are a special kind of beast: all DL data is considered hot data, which raises the dilemma of not being able to employ any sort of tiered storage management solution. This is because normal SSDs usually used for hot data under conventional conditions simply won’t move the data required for millions, billions, or even trillions of metadata transfers for an ML training model to classify an unknown something out of only a limited amount of examples.


Below are a few examples of a few storage requirements needed to avoid the dreaded curse of dimensionality.


COST EFFICIENCY


Enormous AI data sets become an even bigger burden if they don’t fall within the budget set aside for storage. Anyone who has been in charge of managing enterprise data for any amount of time knows well that highly-scalable systems have always been more high-priced on a capacity versus cost basis. The ultimate deep learning storage system must be both affordable and scalable to make sense.


PARALLEL ARCHITECTURE


In order to avoid those dreaded choke points that stunt a deep learning machine’s ability to learn, it’s essential for data sets t to have parallel-access architecture.


DATA LOCALITY


While it might be possible that many organizations may opt to keep some of their data on the cloud, most of it should remain on-site in a data center. There are at least three reasons for this: regulatory compliance, cost efficiency, and performance. For this reason, on-site storage must rival the cost of keeping it on the cloud.


HYBRID ARCHITECTURE


As touched on above, different types of data have unique performance requirements. Thus, storage solutions should offer the perfect mixture of storage technologies instead of an asymmetrical strategy that will eventually fail. It’s all about simultaneously meeting ML storage performance and scalability.


SOFTWARE-DEFINED STORAGE


Not all huge data sets are the same—especially in terms of DL and ML. While some of them can get by with the simplicity of pre-configured machines, others need hyper-scale data centers featuring purpose-built servers architectures that are previously set in place. This is what makes software-defined storage solutions the best option.


Our X-AI Accelerated is an any–scale DL and ML solution that offers unmatched versatility for any organization’s needs. X-AI Accelerated was engineered from the ground up and optimized for “ingest, training, data transformations, replication, metadata, and small data transfers.” Not only that but RAID Inc. offers all the aforementioned requirements such as all-flash NVMe X2-AI/X4-AI or the X5-AI, which are hybrid flash and hard drive storage platforms.


Both the NVMe X2-AI/X4-AI and the X5-AI support parallel access to flash and deeply expandable HDD storage as well. Furthermore, the X-AI Accelerated storage platform permits one to scale out from only a few TBs to tens of PBs.

2021-11-26T03:34:49Z
Ariful Mondal - PeerSpot reviewerAriful Mondal (Wipro Ltd)
ExpertModeratorReal User

@Shibu Babuchandran thank you for your detailed answers. This is very helpful. 

Shibu Babuchandran - PeerSpot reviewerShibu Babuchandran
ExpertModeratorReal User

@Ariful Mondal ,
Thanks for your response.

Buyer's Guide
DataCore SANsymphony SDS vs. NetApp ONTAP
May 2022
Find out what your peers are saying about DataCore SANsymphony SDS vs. NetApp ONTAP and other solutions. Updated: May 2022.
599,220 professionals have used our research since 2012.