Mobile applications are an incredibly fast growing market segment given the rapid consumer device adoption and the decline in use of traditional computer equipment. Whether it be for games, commercial applications, health applications, and so on, virtually every market segment today has a mobile friendly application. From an application development perspective, building mobile apps is very similar to building any other form of application. You have the same areas of concern, your presentation tier, business tier and data tier. While your screen real estate and development tools are different, delivering a great user experience is a shared goal across all applications. With effective caching strategies, your mobile applications can deliver the performance your users expect, scale massively, and reduce your overall cost. The AWS Mobile Hub is a console that provides an integrated experience for discovering, configuring, and accessing AWS cloud services for building, testing, and monitoring usage of mobile apps.

Interactivity is a cornerstone requirement for almost any modern game. Nothing frustrates players more than a slow or unresponsive game, and those rarely become successful. The requirement on performance is even more demanding for mobile multiplayer games, where an action that any one player takes needs to be shared with others in real time. Caching plays a crucial role in keeping the game smooth by providing sub-millisecond query response for frequently accessed data. It is also helpful to alleviate hot key issues when the same data is queried multiple times, such as “who are the current top 10 players by score?” To learn more about developing games on AWS click here .

HTTP sessions contain the user data exchanged between your site users and your web applications such as login information, shopping cart lists, previously viewed items and so on. Critical to providing great user experiences on your website is managing your HTTP sessions effectively by remembering your user’s preferences and providing rich user context. With modern application architectures, utilizing a centralized session management data store is the ideal solution for a number of reasons including providing, consistent user experiences across all web servers, better session durability when your fleet of web servers is elastic and higher availability when session data is replicated across cache servers. For more information, click here .

Media companies often deal with the need to transmit a large amount of static content to their customers with a constantly changing number of readers/viewers. An example is a video streaming service such as Netflix or Amazon Video, which streams a large amount of video content to the viewers. This is a perfect fit for a Content Delivery Network , where data is stored on a globally distributed set of caching servers. Another aspect of media applications is that load tends to be spikey and unpredictable. Imagine a blog on a website that a celebrity just tweeted about, or the website of a Football team during the Super Bowl. Such a large spike of demand to a small subset of content is a challenge to most databases since they are limited in their per-key throughput. Since memory has a much higher throughput than disk, a database cache would resolve the issue by redirecting the reads to the in memory cache.

Modern eCommerce applications are becoming more sophisticated, offering personalized shopping experience, including real-time recommendations based on a user’s data and shopping history. Those often also include looking at a user’s social network and providing the recommendation based on what her friends liked or purchased. While the amount of data needed to process is increasing, customer’s patience is not. Therefore, keeping the application performing in real-time is not a luxury, but a necessity; a well-executed caching strategy is a critical aspect of the application performance, and could be the difference between an application’s success or failure, between making a sale or losing a customer.

Social Media apps have taken the world by storm. Social networks like Facebook, Twitter, Instagram and Snapchat have a vast number of users who consume ever growing amount of content. When a user opens her feed, she expects to see her latest personalized content in near real-time. That is not static content since each user has different friends, images, interests, etc., exacerbating the engineering complexity needs of the underlying platform. Social Media apps are also very prone to spikes in usage around major entertainment, sports, and political events. Such spike resiliency and real-time performance are achieved through multiple layers of caching – including Content Delivery Network for the static content such as background images, session cache for keeping track of a user’s current session data, and database cache for keeping frequently accessed data such as latest news from closest friends and the last few images handy.

When delivering web content to your viewers, much of the latency involved with retrieving web assets such as images, html documents, video, etc. can be greatly reduced by caching those artifacts and eliminating disk reads and server load. Various web caching techniques can be employed both on the server and on the client side. Server side web caching typically involves utilizing a web proxy which retains web responses from the web servers it sits in front of, effectively reducing their load and latency. Client side web caching can include browser based caching which retains a cached version of the previously visited web content. For more information on Web Caching, click here .

Accessing data from memory is orders of magnitude faster than accessing data from disk or SSD, so leveraging data in cache has a lot of advantages. For many use-cases that do not require transactional data support or disk based durability, using an in-memory key-value store as a standalone database is a great way to build highly performant applications. In addition to speed, application benefits from high throughput at a cost-effective price point. Referenceable data such product groupings, category listings, profile information, and so on are great use cases for a general cache . For more information on general cache, click here .

An integrated cache is an in-memory layer that automatically caches frequently accessed data from the origin database. Most commonly, the underlying database will utilize the cache to serve the response to the inbound database request given the data is resident in the cache. This dramatically increases the performance of the database by lowering the request latency and reducing CPU and memory utilization on the database engine. An important characteristic of an integrated cache is that the data cached is consistent with the data stored on disk by the database engine.

Home›
Caching

Caching Overview

Caching helps applications perform dramatically faster and cost significantly less at scale

Get Started with Caching

What is Caching?

In computing, a cache is a high-speed data storage layer which stores a subset of data, typically transient in nature, so that future requests for that data are served up faster than is possible by accessing the data’s primary storage location. Caching allows you to efficiently reuse previously retrieved or computed data.

How does Caching work?

The data in a cache is generally stored in fast access hardware such as RAM (Random-access memory) and may also be used in correlation with a software component. A cache's primary purpose is to increase data retrieval performance by reducing the need to access the underlying slower storage layer.

Trading off capacity for speed, a cache typically stores a subset of data transiently, in contrast to databases whose data is usually complete and durable.