AWS Storage Blog
Tag: data lake
AWS re:Invent recap: Break down data silos with a data lake on Amazon S3
When you have datasets in different places controlled by different groups, you are dealing with data silos, which inherently obscure data. In contrast, a data lake can serve as your central repository of data regardless of source or format. At re:Invent 2020, we had the opportunity to present a session on building a data lake […]
Read MoreNEW Amazon S3 sessions at AWS re:Invent are coming on Jan 12-14
We are into week two of AWS re:Invent, and a lot of the Amazon S3 sessions we posted about are now available on-demand, with a few more to be broadcast over the next two weeks. Hopefully, you also heard about some of the major announcements for Amazon S3, including strong read-after-write consistency, replication to multiple […]
Read MoreHow Bristol Myers Squibb uses Amazon S3 and AWS Storage Gateway to manage scientific data
Bristol Myers Squibb develops and discovers innovative medicines to help treat, manage, and cure serious diseases. We use many AWS services to help us manage our scientific data, lab workflows, and large computations for analyzing molecular, cellular, and clinical datasets. Genomics and clinical data, generated in Bristol Myers Squibb labs, is growing at an exponential […]
Read MoreMigrate HDFS files to an Amazon S3 data lake with AWS Snowball Edge
The need to store newly connected data grows as the sources of data increase. Enterprise customers use Hadoop Distributed File System (HDFS) as their data lake storage repository for on-premises Hadoop applications. Customers are migrating their data lakes to AWS for a more secure, scalable, agile, and cost-effective solution. For HDFS migrations where high-speed transfer […]
Read MoreFree AWS Loft Events: Attend “AWS Storage Days” in San Francisco or NYC
As we gear up for AWS re:Invent 2019 December 2 – 6, we want to ensure you are up to speed on the full portfolio of AWS storage services. In San Francisco September 10 – 11 and in NYC September 24- 25, we will be conducting ‘AWS Storage Days’ at the AWS Loft locations. These […]
Read MoreBuild a data lake on Amazon S3: Recent customer case studies
Amazon Simple Storage Service (S3) is the largest and most performant object storage service for structured and unstructured data and the storage service of choice to build a data lake. With Amazon S3, you can cost-effectively build and scale a data lake of any size in a secure environment where data is protected by 99.999999999% […]
Read MoreNew on the APN Blog: Building a Data Lake Foundation for Salesforce in AWS
Over on the AWS Partner Network Blog, a recent blog post caught my eye and I thought it was worth sharing with our growing storage audience. The post, Building a Data Lake Foundation for Salesforce in AWS, is written by Simon Ejsing, Director of Analytics at FinancialForce. Simon’s post outlines their approach to unlocking the potential of […]
Read More