AWS Storage Blog
Tag: Amazon Simple Storage Service (Amazon S3)
KKCompany saves 93% on data storage by migrating music streaming services to AWS
KKCompany Technologies (KKCompany) is an AI multimedia technology group based in Taipei, with offices across Asia. Its flagship music streaming platform, KKBOX, serves over 12 million users worldwide. As its user base and music catalog rapidly grew, KKCompany faced significant challenges with its on-premises storage infrastructure. These included costly hardware scaling, constrained rack space, and […]
Rapid monitoring of Amazon S3 bucket policy changes in AWS environments
Enterprises today manage hundreds of access policies across their expanding cloud environments. As they scale their cloud infrastructure, maintaining visibility into policy changes becomes increasingly important. Central security teams can enhance their cloud governance by implementing comprehensive monitoring of access policies. This approach supports compliance efforts while strengthening overall resource security across their environments. Users […]
How Uniphore achieved 30% cost savings by modernizing Windows servers on AWS
Uniphore is the first built-for-scale, AI-native company that infuses AI into every part of the enterprise experience. Uniphore’s enterprise-class multimodal AI and data platform unifies all elements of voice, video, text, and data by using generative AI, Knowledge AI, Emotion AI, and workflow automation together as trusted co-pilot. Uniphore was running one of their core […]
Building multi-writer applications on Amazon S3 using native controls
Organizations managing data lakes often require additional infrastructure to support concurrent writes from multiple applications. Traditional approaches require external systems for coordination, adding infrastructure overhead, costs, and potential performance bottlenecks. Developers typically implement client-side locking mechanisms using databases or dedicated lock services, resulting in complex multi-step workflows. Amazon S3 offers capabilities to address these concurrent […]
Architecting scalable checkpoint storage for large-scale ML training on AWS
The exponential growth in size and complexity of foundation models (FMs) has created unprecedented infrastructure demands across compute, networking, and storage resources. Storage systems, in particular, face intense requirements for throughput, latency, and capacity. In machine learning (ML) model training, these storage demands are particularly evident in checkpointing—a critical reliability mechanism that periodically saves and […]
Query Amazon S3 Tables from open source Trino using Apache Iceberg REST endpoint
Organizations are increasingly focused on addressing the growing challenge of managing and analyzing vast data volumes, while making sure that their data teams have timely access to this data to enable rapid insights and decision-making. Data analysts and scientists need self-service analytics capabilities to build and maintain data products, often involving complex transformations and frequent […]
Automate data transfers and migrations with AWS DataSync and Terraform
In today’s data-driven world, organizations face the challenge of efficiently managing and consolidating vast amounts of information from diverse sources. Whether it’s for analytics, machine learning (ML), or other business-critical applications, the ability to seamlessly transfer and organize data is crucial. However, this process can be complex, time-consuming, and prone to errors when done manually. […]
University of California Irvine backs up petabytes of research data to AWS
Editor’s note: AWS is not responsible for UCI’s public GitHub repo linked in this post, which has been provided so that interested parties can explore the solution described in this post in more detail. The University of California, Irvine (UCI) is a public land-grant research university with troves of research data stored on servers in […]
How to consume tabular data from Amazon S3 Tables for insights and business reporting
When was the last time you found yourself trying to look at rows and rows of data in a spreadsheet struggling to interpret and draw conclusions? Many analysts and engineers experience the same challenge every day. Whether it’s analyzing sales trends, monitoring operational metrics, or understanding customer behavior, the challenge lies not just in interpreting […]
Automating paper-to-electronic healthcare claims processing with AWS
Health plans process billions of claims electronically each year. Council for Affordable Quality Healthcare (CAQH) estimates that approximately 10% of claims still arrive as paper documents, accounting for hundreds of millions of paper submissions annually in the U.S. These paper claims create processing bottlenecks and consume a disproportionate share of operational costs and resources, with […]