Shows you how to create and manage Amazon Redshift clusters.  HTML | PDF | Kindle

Explains how to design, build, query, and maintain the databases that make up your data warehouse. It also includes syntax for Amazon Redshift SQL commands and functions.  HTML | PDF

Describes all the API operations for Amazon Redshift in detail. It also provides sample requests, responses, and errors for the supported web services protocols.  HTML | PDF

Describes the command line interface for Amazon Redshift in detail. It provides basic syntax, options, and usage examples for each command.  HTML

AWS webcast covering best practices for loading data and optimizing query performance in Amazon Redshift.  Watch Video Part 1 | Watch Video Part 2

Load your data efficiently using COPY commands, bulk inserts, and staging tables.  Learn more »

Design your database for high query performance, low storage and memory requirements, and minimal I/O operations.  Learn more »

Tune your queries for maximum performance using these tips and tactics.  Learn more »

SQL scripts for running diagnostics on your Amazon Redshift cluster using system tables.  Download here »

List of Scripts:

  • top_queries.sql - returns top 50 most time-consuming statements in the last 7 days
  • perf_alerts.sql - returns top occurrences of alerts, join with table scans
  • filter_used.sql - returns filter applied to tables on scans. To aid in choosing sortkey
  • commit_stats.sql - shows information on consumption of cluster resources through COMMIT statements
  • current_session_info.sql - shows information about sessions with currently running queries
  • missing_table_stats.sql - shows EXPLAIN plans which flagged "missing statistics" on the underlying tables
  • queuing_queries.sql - lists queries which are waiting on a WLM Query Slot
  • table_info.sql - Return Table storage information (size, skew, etc)

SQL Views for managing your Amazon Redshift Cluster and generating Schema DDL. All Views assume you have a schema called Admin.  Download here »

List of views:

  • v_check_data_distribution.sql - get data distribution across slices
  • v_constraint_dependency.sql - get the the foreign key constraints between tables
  • v_generate_group_ddl.sql - get the DDL for a group
  • v_generate_schema_ddl.sql - get the DDL for schemas
  • v_generate_tbl_ddl.sql - get the DDL for a table, contains distkey, sortkey, & constraints
  • v_generate_unload_copy_cmd.sql - generate unload and copy commands for an object
  • v_generate_user_object_permissions.sql - get the DDL for a user’s permissions to tables and views
  • v_generate_view_ddl.sql - get the DDL for a view
  • v_get_obj_priv_by_user.sql - get the table/views that a user has access to
  • v_get_schema_priv_by_user.sql - get the schema that a user has access to
  • v_get_tbl_priv_by_user.sql - get the tables that a user has access to
  • v_get_users_in_group.sql - get all users in a group
  • v_get_view_priv_by_user.sql - get the views that a user has access to
  • v_object_dependency.sql - merge the different dependency views together
  • v_space_used_per_tbl.sql - pull space used per table
  • v_view_dependency.sql - get names of views that are dependent on other tables/views
  • v_check_transaction_locks.sql - get info on locks held by open transactions
  • v_check_wlm_query_time.sql - get WLM wait time & execution time for past 7 days
  • v_check_wlm_query_trend_daily.sql - get WLM query Count, wait time, execution time by day
  • v_check_wlm_query_trend_hourly.sql - get WLM query count, wait time, execution time by hour
  • v_generate_cancel_query.sql - get cancel query
  • v_get_cluster_restart_ts.sql - get datetime when cluster was recently restarted
  • v_get_vacuum_details.sql - get vacuum details

Determine optimal compression for your Amazon Redshift data with this utility.  Download here »

Easily migrate data between two Amazon Redshift tables that may be in different databases, clusters, or regions. All exported data is encrypted with AWS Key Management Service (KMS). You can use this utility to supplement AWS Data Pipeline to routinely move data between systems, such as between production and test, or for maintaining read-only copies on other clusters.  Download here »

Common User-Defined Functions (UDF). A collection of helpful UDFs for everyone to use. Contributed by Periscope.  Learn More » | Download here »

Framework for developing UDFs. To make development easier, Periscope has built a framework for managing and testing UDFs. Contributed by Periscope.  Download here »

re:Dash. A free open source data collaboration and visualization platform built to allow fast and easy access to billions of records. Features include a schema browser, improved scheduling, autocomplete, and ability to write Python queries. Contributed by  Download here »

EverythingMe's Amazon Redshift Admin Dashboard. Monitor running queries, WLM queues, and your tables/schemas with this user-friendly tool. Contributed by EverythingMe.  Download here »

Yelp’s ETL tool for Amazon Redshift. Use Yelp's Mycroft to automatically load data into Amazon Redshift. Read this blog post for more information.  Download here »

Spark-Redshift Library. Use this library to load data from Amazon Redshift into an Apache Spark SQL DataFrames, and write data from Apache Spark back into Amazon Redshift. Good example of using AVRO ingestion with Amazon Redshift. Contributed by Swiftkey.  Download here »

We have a rich set of blog articles that provide use case and best practices guidance to help you get the most out of Amazon Redshift. Access our full list of blog articles here.