AWS Big Data Blog

Amazon Redshift UDF repository on AWSLabs

Christopher Crosbie is a Healthcare and Life Science Solutions Architect with Amazon Web Services

Zach Christopherson, an Amazon Redshift Database Engineer, contributed to this post

Did you ever have a need for complex string parsing in Amazon Redshift and wish you could simply add f_parse_url_query_string(url) to your SQL query? Have you ever tried to weigh which would be less effort: writing a complicated regex in SQL to parse a query string or pulling the data into Python simply to take advantage of packages like urlparse? When was the last time you were developing a report and wished there was an easy way to get the next business day from a query result or even get the next business day according to your own company’s calendar?

These scenarios represent just a few of the Python UDF functions that AWS has released as part of the initial AWS Labs Amazon Redshift UDF repository:parsing, date functions, and more! No longer are you constrained to the world of SQL within your Amazon Redshift data warehouse. Python UDFs allow you to extend Amazon Redshift SQL with Python’s rich library of packages.

We encourage all Python UDF and Amazon Redshift developers to take a peek at what’s available today. We also encourage you to submit your own pull requests to show off what you can do with Amazon Redshift and Python UDF capabilities.

If you have questions or suggestions, please leave a comment below.

If you have questions or suggestions, please leave a comment below.



Introduction to Python UDFs in Amazon Redshift