SPSS predictive analytics forecasting using data preparation for time series data in notebooks
Last updated: Jan 12, 2024
SPSS predictive analytics forecasting using data preparation for time series data in notebooks
Data preparation for time series data (TSDP) provides the functionality to convert raw time data (in Flattened multi-dimensional format, which includes transactional (event) based and column-based data) into regular time series data (in compact
row-based format) which is required by the subsequent time series analysis methods.
The main job of TSDP is to generate time series in terms of the combination of each unique value in the dimension fields with metric fields. In addition, it sorts the data based on the timestamp, extracts metadata of time variables, transforms
time series with another time granularity (interval) by applying an aggregation or distribution function, checks the data quality, and handles missing values if needed.
This is the date/time convertor API that's used to provide some functionalities of the date/time convertor inside TSDP for applications to use. There are two use cases for this component:
Compute the time points between a specified start and end time. In this case, the start and end time both occur after the first observation in the previous TSDP's output.
Compute the time points between a start index and end index referring to the last observation in the previous TSDP's output.
Temporal causal modeling
Copy link to section
Temporal causal modeling (TCM) refers to a suite of methods that attempt to discover key temporal relationships in time series data by using a combination of Granger causality and regression algorithms for variable selection.
Autoregressive (AR) models are built to compute out-of-sample forecasts for predictor series that aren't target series. These predictor forecasts are then used to compute out-of-sample forecasts for the target series.
Model produced by TemporalCausal
TemporalCausal exports outputs:
a JSON file that contains TemporalCausal model information
One of the advantages of building TCM models is the ability to detect model-based outliers. Outlier detection refers to a capability to identify the time points in the target series with values that stray too far from their expected (fitted)
values based on the TCM models.
Temporal Causal Root Cause Analysis
Copy link to section
The root cause analysis refers to a capability to explore the Granger causal graph in order to analyze the key/root values that resulted in the outlier in question.
Temporal Causal Scenario Analysis
Copy link to section
Scenario analysis refers to a capability of the TCM models to "play-out" the repercussions of artificially setting the value of a time series. A scenario is the set of forecasts that are performed by substituting the values of a root
time series by a vector of substitute values.
Temporal Causal Summary
Copy link to section
TCM Summary selects Top N models based on one model quality measure. There are five model quality measures: Root Mean Squared Error (RMSE), Root Mean Squared Percentage Error (RMSPE), Bayesian Information Criterion (BIC), Akaike Information
Criterion (AIC), and R squared (RSQUARE). Both N and the model quality measure can be set by the user.
Time Series Exploration
Copy link to section
Time Series Exploration explores the characteristics of time series data based on some statistics and tests to generate preliminary insights about the time series before modeling. It covers not only analytic methods for expert users (including
time series clustering, unit root test, and correlations), but also provides an automatic exploration process based on a simple time series decomposition method for business users.
Reverse Data preparation for time series data (RTSDP) provides functionality that converts the compact row based (CRB) format that's generated by TimeSeriesDataPreperation (TSDP) or TemporalCausalModel (TCM Score) back to the flattened multidimensional
(FMD) format.
About cookies on this siteOur websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising.For more information, please review your cookie preferences options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.