Compute | AWS Storage Blog

Building self-managed RAG applications with Amazon EKS and Amazon S3 Vectors

Retrieval-Augmented Generation (RAG) is a technique that optimizes large language model (LLM) outputs by referencing authoritative knowledge bases outside of the model’s training data before generating responses. This addresses common limitations of traditional LLMs, such as outdated knowledge, hallucinated facts, and misinterpreted terminology. Organizations can implement RAG to enhance their generative AI applications with current, […]

Simplify cross-account storage management with Amazon EFS and Amazon EKS

Organizations are increasingly adopting a multi-account Amazon Web Services (AWS) strategy to achieve enhanced security, governance, and operational efficiency at scale. Implementing separate accounts for production and non-production environments enables enterprises to group workloads based on business purpose, apply distinct security postures by environments, restrict access to sensitive data, and streamline cost management. You can […]

Boost testing confidence with automated Amazon RDS data replication from production to non-production environment

Automated testing in a pre-production environment is crucial for verifying the reliability and stability of software releases in any organization. However, for many applications, writing and executing these tests necessitates the use of data from production system. This production data is valuable for testing and development because it represents real-world scenarios, usage patterns, and edge […]

Mountpoint for Amazon S3 CSI driver v2: Accelerated performance and improved resource usage for Kubernetes workloads

Amazon S3 is the best place to build data lakes because of its durability, availability, scalability, and security. In 2023, we introduced Mountpoint for Amazon S3, an open source file client that allows Linux-based applications to access S3 objects through a file API. Shortly after, we took this one step further with the Mountpoint for […]

Backup Amazon Elastic Kubernetes Service (EKS) resources using NetApp Trident Protect

Kubernetes is an open source container orchestration platform that automates the deployment, scaling, and management of containerized applications. For users running applications and workloads on Kubernetes, protecting resources and data from accidental deletion or hardware failures is crucial for maintaining business continuity and meeting compliance requirements. While Kubernetes provides high availability through its control plane […]

Faster threat detection at scale: Real-time cybersecurity graph analytics with PuppyGraph and Amazon S3 Tables

Modern cybersecurity teams are facing unprecedented challenges in data analysis by the scale, complexity, and velocity of data. Cloud environments continuously generate massive amounts information in form of access logs, configuration changes, alerts, and telemetry. Traditional analysis methods of looking at these data points in isolation can’t effectively detect threats such as lateral movement and […]

KKCompany saves 93% on data storage by migrating music streaming services to AWS

KKCompany Technologies (KKCompany) is an AI multimedia technology group based in Taipei, with offices across Asia. Its flagship music streaming platform, KKBOX, serves over 12 million users worldwide. As its user base and music catalog rapidly grew, KKCompany faced significant challenges with its on-premises storage infrastructure. These included costly hardware scaling, constrained rack space, and […]

How Uniphore achieved 30% cost savings by modernizing Windows servers on AWS

Uniphore is the first built-for-scale, AI-native company that infuses AI into every part of the enterprise experience. Uniphore’s enterprise-class multimodal AI and data platform unifies all elements of voice, video, text, and data by using generative AI, Knowledge AI, Emotion AI, and workflow automation together as trusted co-pilot. Uniphore was running one of their core […]

Protect on-premises VMware infrastructure with NetApp BlueXP Disaster Recovery, Amazon Elastic VMware Service, and Amazon FSx for NetApp ONTAP

Your VMware workloads contain critical data that drives business decisions and powers your operations. Maintaining the availability and resilience of your data is a top priority where potential disasters such as ransomware threats, catastrophic hardware failures, and natural calamities, can lead to costly downtime and data loss. To address these challenges, all businesses require strategic […]

Query Amazon S3 Tables from open source Trino using Apache Iceberg REST endpoint

Organizations are increasingly focused on addressing the growing challenge of managing and analyzing vast data volumes, while making sure that their data teams have timely access to this data to enable rapid insights and decision-making. Data analysts and scientists need self-service analytics capabilities to build and maintain data products, often involving complex transformations and frequent […]

AWS Storage Blog

Category: Compute