Create a new account

About

Amazon Sagemaker

Amazon SageMaker is a fully-managed platform that enables developers and data scientists to quickly and easily build, train, and deploy machine learning models at any scale. With Amazon SageMaker, all the barriers and complexity that typically slow down developers who want to use machine learning are removed. The service includes models that can be used together or independently to build, train, and deploy your machine learning models.

Bayesian filtering Factor Analysis VBfFA Free trial

By:

i4cast LLC

Latest Version:

0.1.0

Variational Bayesian filtering Factor Analysis (VBfFA) to estimate time-varying statistical factors of large set of multiple time-series

Continue to Subscribe

Product Overview

The variational Bayesian filtering factor analysis (VBfFA) algorithm/model is a filter with dimension-reduction, or rank-reduction, to extract a number of ever-evolving unobserved common factors, or signals from common sources, underlying and influencing a large number of related time-series. Relevant examples of time-series include: economic indicators in a nation, region, or international economic sector; prices of assets in a national, regional or global marketplace; performance score time-series related to a business marketing campaign; and time-series signals from an array of radar or sonar sensors tracking several moving targets. By applying (variational) Bayesian filtering (instead of traditional moving/rolling data windows for frequentist time-dependent analysis), the VBfFA algorithm is able to update predictions with only the newly arrived time-series data point (instead of all data points in the data window); and predict underlying changes in time-series early.

Key Data

Version

0.1.0

Usage Instructions

Release Notes

i4cast LLC

Highlights

WHY Bayesian filter? To estimate a “time-varying” or “time-dependent” statistic of time-series, traditional method is to use “moving/rolling data window” and/or “exponentially decayed time weights”. A straightforward and natural alternative is to use Bayesian framework: at each moment in time, using the last estimate as prior; conditional distribution of estimate (given statistical model for the estimate) as likelihood; and newly arrived/available time-series data point as observation. Then, the resulted posterior is a new estimate. In time-domain, such a Bayesian formulation is a filter.
WHY variational Bayes? The Bayesian filtering framework for estimating time-dependent statistics of time-series is straightforward and simple. The need of joint and conditional probability distribution functions, however, makes actual estimation complicated, difficult or even intractable. Discretized approximation, e.g. numerical particle filters, is computation-intensive and prone to cumulative errors in numerical distributions. A variational Bayes (VB) is an analytical approximation. After tedious derivation, a VB is fast, as long as the assumptions on distributions are appropriate.
WHAT next? In addition to a stand-alone filtering package for time-varying factor analysis on multiple time-series, the VBfFA algorithm will be employed as the underlying factor analysis engine of other machine learning packages here introduced earlier by i4cast LLC: LMDFM (long memory dynamic factor model); YWpcAR (Yule-Walker-PCA autoregressive model); LMVAR (long memory vector autoregressive model); and CTVARF (continuously trained vector autoregressive forecast model). We will introduce a published general-purpose multivariate variational Bayesian filter (VBF) algorithm as well.

Not quite sure what you’re looking for? AWS Marketplace can help you find the right solution for your use case. Contact us

Pricing Information

Use this tool to estimate the software and infrastructure costs based your configuration choices. Your usage and costs might be different from this estimate. They will be reflected on your monthly AWS billing reports.

Estimating your costs

Choose your region and launch option to see the pricing details. Then, modify the estimated price by choosing different instance types.

Version

Region

Software Pricing

Algorithm Training$0.10/hr

running on ml.m5.xlarge

Model Realtime Inference$0.10/hr

running on ml.m5.xlarge

Model Batch Transform$0.10/hr

running on ml.m5.xlarge

Infrastructure Pricing
With Amazon SageMaker, you pay only for what you use. Training and inference is billed by the second, with no minimum fees and no upfront commitments. Pricing within Amazon SageMaker is broken down by on-demand ML instances, ML storage, and fees for data processing in notebooks and inference instances.
Learn more about SageMaker pricing

SageMaker Algorithm Training$0.23/host/hr

running on ml.m5.xlarge

SageMaker Realtime Inference$0.23/host/hr

running on ml.m5.xlarge

SageMaker Batch Transform$0.23/host/hr

running on ml.m5.xlarge

About Free trial

Try this product for 120 days. There will be no software charges, but AWS infrastructure charges still apply. Free Trials will automatically convert to a paid subscription upon expiration.

Algorithm Training

For algorithm training in Amazon SageMaker, the software is priced based on hourly pricing that can vary by instance type. Additional infrastructure cost, taxes or fees may apply.

	InstanceType	Algorithm/hr
	ml.m4.4xlarge	$0.10
	ml.c5n.18xlarge	$0.10
	ml.g4dn.4xlarge	$0.10
	ml.m5.4xlarge	$0.10
	ml.m4.16xlarge	$0.10
	ml.m5.2xlarge	$0.10
	ml.p3.16xlarge	$0.10
	ml.g5.xlarge	$0.10
	ml.g5.12xlarge	$0.10
	ml.g4dn.2xlarge	$0.10
	ml.g5.4xlarge	$0.10
	ml.m4.2xlarge	$0.10
	ml.c5.2xlarge	$0.10
	ml.c4.2xlarge	$0.10
	ml.g4dn.12xlarge	$0.10
	ml.p4d.24xlarge	$0.10
	ml.m4.10xlarge	$0.10
	ml.m5.24xlarge	$0.10
	ml.g4dn.xlarge	$0.10
	ml.g5.48xlarge	$0.10
	ml.g4dn.16xlarge	$0.10
	ml.m5.12xlarge	$0.10
	ml.p3dn.24xlarge	$0.10
	ml.p2.16xlarge	$0.10
	ml.c4.4xlarge	$0.10
	ml.g5.8xlarge	$0.10
	ml.m5.xlarge Vendor Recommended	$0.10
	ml.c5.9xlarge	$0.10
	ml.g5.16xlarge	$0.10
	ml.m4.xlarge	$0.10
	ml.c5.4xlarge	$0.10
	ml.p3.8xlarge	$0.10
	ml.c4.8xlarge	$0.10
	ml.g4dn.8xlarge	$0.10
	ml.p2.8xlarge	$0.10
	ml.c5n.2xlarge	$0.10
	ml.c5n.9xlarge	$0.10
	ml.c5.18xlarge	$0.10
	ml.g5.2xlarge	$0.10
	ml.c5n.4xlarge	$0.10
	ml.g5.24xlarge	$0.10

Model Realtime Inference

For model deployment as Real-time endpoint in Amazon SageMaker, the software is priced based on hourly pricing that can vary by instance type. Additional infrastructure cost, taxes or fees may apply.

	InstanceType	Realtime Inference/hr
	ml.m4.4xlarge	$0.10
	ml.m5d.24xlarge	$0.10
	ml.m5.2xlarge	$0.10
	ml.p3.16xlarge	$0.10
	ml.g4dn.2xlarge	$0.10
	ml.c5d.4xlarge	$0.10
	ml.r5.12xlarge	$0.10
	ml.c4.2xlarge	$0.10
	ml.m4.10xlarge	$0.10
	ml.g4dn.xlarge	$0.10
	ml.r5d.24xlarge	$0.10
	ml.g4dn.16xlarge	$0.10
	ml.m5d.4xlarge	$0.10
	ml.c4.4xlarge	$0.10
	ml.m5.xlarge Vendor Recommended	$0.10
	ml.c5.9xlarge	$0.10
	ml.p3.8xlarge	$0.10
	ml.m5d.12xlarge	$0.10
	ml.c4.8xlarge	$0.10
	ml.g4dn.8xlarge	$0.10
	ml.r5.2xlarge	$0.10
	ml.t2.2xlarge	$0.10
	ml.g4dn.4xlarge	$0.10
	ml.r5d.2xlarge	$0.10
	ml.m5.4xlarge	$0.10
	ml.m4.16xlarge	$0.10
	ml.r5.large	$0.10
	ml.r5d.large	$0.10
	ml.m4.2xlarge	$0.10
	ml.r5d.12xlarge	$0.10
	ml.c5.2xlarge	$0.10
	ml.c5d.9xlarge	$0.10
	ml.r5.xlarge	$0.10
	ml.r5d.xlarge	$0.10
	ml.g4dn.12xlarge	$0.10
	ml.m5.24xlarge	$0.10
	ml.m5d.xlarge	$0.10
	ml.r5.24xlarge	$0.10
	ml.m5.12xlarge	$0.10
	ml.p2.16xlarge	$0.10
	ml.r5.4xlarge	$0.10
	ml.m4.xlarge	$0.10
	ml.c5.4xlarge	$0.10
	ml.m5d.2xlarge	$0.10
	ml.r5d.4xlarge	$0.10
	ml.p2.8xlarge	$0.10
	ml.t2.xlarge	$0.10
	ml.c5.18xlarge	$0.10
	ml.c5d.18xlarge	$0.10
	ml.c5d.2xlarge	$0.10

Model Batch Transform

For model deployment as Batch transform job in Amazon SageMaker, the software is priced based on hourly pricing that can vary by instance type. Additional infrastructure cost, taxes or fees may apply.

	InstanceType	Batch Transform/hr
	ml.m4.4xlarge	$0.10
	ml.g4dn.4xlarge	$0.10
	ml.m5.4xlarge	$0.10
	ml.m4.16xlarge	$0.10
	ml.p3.16xlarge	$0.10
	ml.m5.2xlarge	$0.10
	ml.g4dn.2xlarge	$0.10
	ml.m4.2xlarge	$0.10
	ml.c5.2xlarge	$0.10
	ml.c4.2xlarge	$0.10
	ml.g4dn.12xlarge	$0.10
	ml.m4.10xlarge	$0.10
	ml.m5.24xlarge	$0.10
	ml.g4dn.xlarge	$0.10
	ml.m5.12xlarge	$0.10
	ml.g4dn.16xlarge	$0.10
	ml.p2.16xlarge	$0.10
	ml.c4.4xlarge	$0.10
	ml.c5.9xlarge	$0.10
	ml.m5.xlarge Vendor Recommended	$0.10
	ml.m4.xlarge	$0.10
	ml.c5.4xlarge	$0.10
	ml.p3.8xlarge	$0.10
	ml.c4.8xlarge	$0.10
	ml.p2.8xlarge	$0.10
	ml.g4dn.8xlarge	$0.10
	ml.c5.18xlarge	$0.10

Usage Information

Training

The VBfFA algorithm is a factor analysis filter to extract a number of ever-evolving unobserved common factors, or signals from common sources, underlying and influencing a large number of related time-series data.

The VBfFA algorithm takes, as input data, multiple time-series data contained in a CSV (comma separated value) data table, in a format of a CSV text-string or a CSV text-file. Each row of the data table is for values of an individual time-series (TS). Each column is for values of all time-series at a specific moment in time.

Metrics

Name	Regex
avg_fitvar	`avg_fitvar=(.*?);`
avg_aggvar	`avg_aggvar=(.*?);`
avg_zscore	`avg_zscore=(.*?);`
avg_bias	`avg_bias=(.*?);`
avg_loglik	`avg_loglik=(.*?);`
avg_qstat	`avg_qstat=(.*?);`
diff_avg_fitvar	`diff_avg_fitvar=(.*?);`
diff_avg_aggvar	`diff_avg_aggvar=(.*?);`
diff_avg_zscore	`diff_avg_zscore=(.*?);`
diff_avg_bias	`diff_avg_bias=(.*?);`
diff_avg_loglik	`diff_avg_loglik=(.*?);`
diff_avg_qstat	`diff_avg_qstat=(.*?);`

Channel specification

Fields marked with * are required

train

Training dataset

Input modes: File

Content types: text/csv

Compression types: None

model

Trained model dataset

Input modes: File

Content types: application/gzip

Maximum number of time-series added or deleted to be able to use/update, otherwise not to use/update, previously trained VBfFA model

Type: Integer

Tunable: No

Model input and output details

Input

Summary

The VBfFA algorithm takes, as input data, multiple time-series data contained in a CSV (comma separated value) data table, in a format of a CSV text-string or a CSV text-file.

Input MIME type

text/csv

Sample input data

view data

Expand all input descriptions

Input data description

Field name

Values of time stamp

Description

Each row of the data table is for values of an individual time-series (TS). Row header is the label or symbol of the time-series. Each column is for values of all time-series at a specific moment in time. Column header is the time-index or time-stamp of the moment. The first data column is for the earliest time and the last column for the most recent time. The current version of VBfFA requires equally spaced time-stamps.

Required

Yes

Data type

FreeText

Output

Summary

Outputs in format of CSV tables can be used to make quick review by using a spreadsheet application. Outputs in format of JSON strings can be used as input data for further analysis.

Output MIME type

text/csv, application/json

Sample output data

view data

Expand all output descriptions

Output data description

Field name

prediction

Description

Outputs from VBfFA model are time-series of common factor scores, of variances of common factors, of factor loadings, of variances of factor loadings, and of variance of residual errors. Other outputs include time-series of variance-covariance matrix, etc. Outputs in format of CSV tables can be used to make quick review by using a spreadsheet application. Outputs in format of JSON strings can be used as input data for further analysis.

Always returned

Yes

Data type

FreeText

Sample notebook

Sample notebook link

Git repository link

Additional Resources

A sample notebook containing detailed introduction and sample codes.

An example of input data.

Outputs generated by the sample codes and sample inputs.

End User License Agreement

By subscribing to this product you agree to terms and conditions outlined in the product End user License Agreement (EULA)

Support Information

Bayesian filtering Factor Analysis VBfFA

For questions or call-back number, please send email to i4cast LLC at prod.i4cast@gmail.com.

https://github.com/i4cast/aws/tree/main/variational_Bayesian_filtering_factor_analysis/

AWS Infrastructure

AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

Learn More

Refund Policy

We offer full refund for academic works. Other refunds are offered according to common practices.

Customer Reviews

There are currently no reviews for this product.

View all

Write a review

Share your thoughts about this product.

Write a customer review

Select your cookie preferences

Amazon Sagemaker

Bayesian filtering Factor Analysis VBfFA Free trial

Product Overview

Key Data

Highlights

Pricing Information

Estimating your costs

Version

Region

Software Pricing

About Free trial

Algorithm Training

Model Realtime Inference

Model Batch Transform

Annual contract

Model Realtime Inference

Usage Information

Training

Metrics

Channel specification

train

model

Hyperparameters

num_factors

error_reduct_target

num_data_points

num_va_iteration

len_moving_window

ts_standardization

len_leaveout_window

max_len_output_ts

score_target_type

max_predict_step

weight_dict

max_num_ts_add_del

Model input and output details

Input

Output

Sample notebook

Additional Resources

End User License Agreement

Support Information

Bayesian filtering Factor Analysis VBfFA

AWS Infrastructure

Refund Policy

Customer Reviews

Write a review