AWS Open Source Blog

RocknRoller Studios – stock.adobe.com Data storage Technology Concept with cloud symbol against a Futuristic, Blue Digital Grid background. Network Tech Wallpaper. 3D Render

Enabling Amazon Simple Storage Service (Amazon S3) Access Points in Apache Hadoop S3A

We’re pleased to announce that Amazon Simple Storage Service (Amazon S3) Access Points can now be used in Apache Hadoop 3.3.2 and any framework consuming the S3A connector or relying on the Hadoop Distributed File System (such as Apache Spark, Apache Hive, Apache Pig, and Apache Flink). In this post, we review what access point […]

Managing Spinnaker using Spinnaker Operator in Amazon EKS

Overview Spinnaker enables developers to focus on writing code and deploying their applications without having to worry about the underlying infrastructure. The development team can focus on application development and leave ops provisioning to Spinnaker for automating reinforcement of business and regulatory requirements. Spinnaker, a cloud-based open source continuous delivery platform built originally by Netflix […]

Building a multi-tenant Kubeflow environment on Amazon EKS using Amazon Cognito and ADFS

NOTE: Since this blog post was written, much about Kubeflow has changed. While we are leaving it up for historical reference, more accurate information about Kubeflow on AWS can be found here. The Kubeflow project is dedicated to making deployments of machine learning (ML) workflows on Kubernetes simple, portable, and scalable. The project’s goal is […]

Build, train, and deploy Amazon Fraud Detector models using the open source Python SDK

Companies providing digital services are looking for ways to effectively identify fraudulent activities, such as online payment fraud and fake account creation. Amazon Fraud Detector is a fully managed service that uses machine learning (ML) and builds on 20 years of fraud detection expertise from Amazon Web Services (AWS) and Amazon.com to automatically identify potentially […]

Amplify Bash: Get Started Contributing to AWS Amplify Open Source

Each year, more and more developers join open source communities and work collaboratively to create software solutions. In Matt Asay’s article, The Real Number of Open Source Developers, he states we should see 27.7 million developers by 2023. More importantly, he points out, the pace of first time contributors is astonishing — in 2019, there […]

Learn Amazon Simple Storage Service transfer configuration with Syne Tune

The object storage service Amazon Simple Storage Service (Amazon S3) is a foundational storage building block powering a variety of workloads from asset backup and serving, to analytics and machine learning. In this blog post, we describe how to search and find a scenario-specific optimized S3 download configuration in minutes using the open source distributed […]

Using Apollo Server on AWS Lambda with Amazon EventBridge for real-time, event-driven streaming

GraphQL is an application-level query language that helps clients and servers communicate by establishing a common protocol for queries. It represents an alternative to the REST style: unlike REST, GraphQL gives the client, not the server, the power to define what kind of data will be included in the response to its query. GraphQL allows […]

Sustainability with Rust

Rust is a programming language implemented as a set of open source projects. It combines the performance and resource efficiency of systems programming languages like C with the memory safety of languages like Java. Rust started in 2006 as a personal project of Graydon Hoare before becoming a research project at Mozilla in 2010. Rust […]

Introducing AWS Cloud Map MCS Controller for K8s

Modern applications built using microservices patterns are distributed and dynamic by nature. Deploying these applications to Kubernetes clusters tightly couples the application and cluster together. Increasingly, customers are asking for the ability to deploy applications across clusters to allow for easier upgrades and migrations and to break down isolation boundaries. However, bridging the gap between […]

New cluster-mode support in redis-py

What is Redis? Redis is an incredibly popular open source (BSD licensed) in-memory data store, generally used as a database, cache, or message broker. Redis is regularly touted by developers as the most loved database and you’ve used Redis when on Twitter, GitHub, Instagram, Airbnb, and many other products. Redis Clusters enable you to scale […]