AWS Storage Blog

Category: Learning Levels

How SkyWatch built its satellite imagery solution using AWS Lambda and Amazon EFS

SkyWatch is on a mission to democratize remote sensing data through a simple user experience. Every day, trillions of pixels of Earth observation imagery are captured by satellites orbiting our planet. New applications for this data are developed every week, with demand increasing across many industries. Examples include commercial applications, such as construction, finance, and […]

AWS Elastic Disaster Recovery

Creating a scalable disaster recovery plan with AWS Elastic Disaster Recovery

IT disruptions can occur for many reasons, including human error, weather, or a cyber attack. Enterprises need to have a solution in place that will get them up and running quickly with minimal downtime. When orchestrating disaster recovery at scale, it is important to automate recovery plans as much as possible. This allows for a […]

AWS DataSync Featured Image 2020

Using AWS DataSync to move data from Hadoop to Amazon S3

You want to leverage cloud scalability, increase cost efficiency by paying only for utilized storage, decouple big data storage from processing, and increase capabilities for data analytics and machine learning using AWS. But how do you move your Hadoop cluster? To accelerate this transition, AWS DataSync recently launched support for moving data between Hadoop Distributed […]

Amazon EBS

How Cohesity uses Amazon EBS direct APIs to accelerate application backup and recovery times

When backing up applications, AWS Backup and Restore Partners seek methods that minimize complexity and reduce costs for their customers. Most backup applications protect Amazon Elastic Block Store (EBS) volumes using EBS snapshots as a part of Amazon EC2 protection feature. For backups with long term retention, backup applications offer additional streaming backup capabilities that […]

AWS Backup 2021 blog image

Automate and centrally manage data protection for Amazon S3 with AWS Backup

Customers globally, especially in regulated industries, require centralized protection and demonstrable compliance for their application data. Centralized data protection and enhanced visibility across backup operations can reduce the risks of costly disasters and accidents, improve business continuity, and simplify the auditing process. With AWS Backup for Amazon S3 now being generally available, you can centralize […]

Site-Merch_Amazon-FSx-for-NetApp_Blog

Caching data using Amazon FSx for NetApp ONTAP

Accessing network attached storage (NAS) over long distances can introduce latency that can potentially impact business processes, slow down engineering jobs, and increase costs. Often only a small percent of the dataset needs to be accessed at any given time, allowing caching data locally to solve these challenges without replicating the full dataset. This is […]

A gene-editing prediction engine with iterative learning cycles built on AWS

NRGene develops cutting-edge genomic analytics products that are reshaping agriculture worldwide. Among our customers are some of the biggest and most sophisticated companies in seed-development, food and beverages, paper, rubber, cannabis, and more. In the middle of 2020, NRGene joined a consortium of companies and academic institutions to build the best-in-class gene-editing prediction platform to […]

Amazon EBS

Restoring on-premises applications to AWS from Amazon EBS Snapshots created by EBS direct APIs

Incremental, point-in-time copies of data can be a secure and cost effective tool anchoring disaster recovery, data migration, and compliance solutions. Amazon EBS Snapshots are how EBS customers leverage point-in-time copies of their data stored on AWS, and you can use Snapshots on premises too. In December 2019, AWS introduced Amazon EBS direct APIs, providing […]

Amazon S3

MemQ by Pinterest: An efficient, scalable, cloud-native publish/subscribe system

The Logging Platform at Pinterest powers all data ingestion and transportation at Pinterest. At the heart of the Pinterest Logging Platform are distributed pub/sub systems that help our customers transport, buffer, and consume data asynchronously. Pub/sub messaging, is a form of asynchronous service-to-service communication used in serverless and microservices architectures. In a pub/sub model, any […]

AWS DataSync Featured Image 2020

Simplify data migrations using an AWS DataSync agent on Linux KVM Hypervisor

UPDATE (1/19/2023): Some readers who followed the steps in this blog post to deploy an AWS DataSync agent on the KVM platform ran into issues, either because the hypervisor host does not support virtualization or it is not enabled on the platform. Therefore, I have added the steps to verify whether the hypervisor host supports […]