Securing AI data supply chains with AltaStata Fortified Data Lakes

AltaStata protects data in multi-cloud environments and AI/ML ecosystems from malicious users and accidental misconfigurations without compromising utility or speed.

Overview
AltaStata Fortified Data Lake

An AI platform provider to a network of hospitals provides the infrastructure for secure data collaboration and processing. With AltaStata, hospitals upload encrypted datasets (e.g., genomic data, MRI scans, clinical patient records) to the AltaStata Fortified Data Lake, hosted within the Ceph storage environment. This setup enables model training on NVIDIA GPUs and fosters collaborative research, all while maintaining strict protection against data breaches and poisoning. Platform administrators, who also manage the storage infrastructure, are not considered insider threats, as they cannot view or alter the encrypted data. Meanwhile, hospital administrators retain full control over their own data within the AltaStata Fortified Data Lake, with the ability to grant or revoke access permissions for personnel and applications.

AltaStata addresses the critical need for robust data security for AI. Our patented Fortified Data Lake solution ensures the AI supply chain remains protected throughout its lifecycle, from AI model training to data processing with AI in real-time.

By seamlessly integrating with various data sources and platforms, AltaStata secures OpenShift AI without requiring code changes — keeping data compliant, breach-resistant, and tamper-proof.

Secure, Multi-Party AI Research Collaboration Utilizing Remote Infrastructure

Using AltaStata’s Fortified Data Lake, the AI platform provider provides a protected environment that enables hospitals, clinical research organizations, and other data partners to access NVIDIA GPUs and scalable storage—eliminating the need for their own data centers, power management, or specialized hardware. 

End-to-End Data Privacy and Integrity

Hospitals upload encrypted datasets—such as genomic data, MRI scans, and clinical records—to the Fortified Data Lake at the AI platform provider and use OpenShift AI for training and running models. Data remains encrypted at all times, ensuring that even platform provider administrators cannot view or tamper with datasets or models.

Full Data Control at Third-Party Facilities

Hospital administrators retain full control over their data—even when stored within the AI platform provider facility. They can grant or revoke access permissions at the file level for other research participants, platform staff, and applications running on OpenShift. 

Seamless Integration with Leading Data Science Platforms

AltaStata’s Fortified Data Lake seamlessly supports Databricks, PyTorch, TensorFlow, and other platforms—enabling the processing of vast amounts of encrypted data without compromising training or inference speed.

Advanced Data Compression

AltaStata’s data compression technology reduces storage costs by up to 63% and boosts processing speeds by up to 3×.

Get started with OpenShift

A container platform to build, modernize, and deploy applications at scale.

Try itDeployment options
ResourcesFAQs

Does AltaStata Fortified Data Lake have a pre-built Jupyter notebook?

Yes. AltaStata Fortified Data Lake has a Jupyter notebook ready to get you started quickly.

How does AltaStata Fortified Data Lake protect data?

AltaStata Fortified Data Lake protects data by enforcing user access at the file level as opposed to accessing the entire storage container and end-to-end encryption, both in transit and at rest.

How does AltaStata Fortified Data Lake reduce storage costs?

AltaStata Fortified Data Lake uses compression to reduce storage requirements and costs. This also provides optimized performance through faster downloads.
Red Hat logoLinkedInYouTubeFacebookTwitter

Platforms

Products & services

Try, buy, sell

Help

About Red Hat Ecosystem Catalog

The Red Hat Ecosystem Catalog is the official source for discovering and learning more about the Red Hat Ecosystem of both Red Hat and certified third-party products and services.

We’re the world’s leading provider of enterprise open source solutions—including Linux, cloud, container, and Kubernetes. We deliver hardened solutions that make it easier for enterprises to work across platforms and environments, from the core datacenter to the network edge.