🛡️ QuestDB 9.0 is here!Read the release blog

Partial Autocorrelation Function

SUMMARY

The Partial Autocorrelation Function (PACF) measures the direct correlation between observations separated by a given lag after removing the effects of intermediate lags. It's a crucial tool for identifying the order of autoregressive processes and understanding the pure relationship between time series observations.

Understanding partial autocorrelation

The PACF differs from the regular autocorrelation function by isolating the "pure" correlation between observations at different lags. For lag k, it measures the correlation between $y_t$ and $y_{t-k}$ while controlling for the effects of observations at intermediate lags $(y_{t-1}, y_{t-2}, ..., y_{t-k+1})$ .

Mathematically, the partial autocorrelation at lag k, denoted as $\phi_{kk}$ , can be expressed as:

$\phi_{kk} = Corr(y_t - \hat{y}_t^{(k-1)}, y_{t-k} - \hat{y}_{t-k}^{(k-1)})$

where $\hat{y}_t^{(k-1)}$ is the linear projection of $y_t$ on $(y_{t-1}, ..., y_{t-k+1})$ .

Next generation time-series database

QuestDB is an open-source time-series database optimized for market and heavy industry data. Built from scratch in Java and C++, it offers high-throughput ingestion and fast SQL queries with time-series extensions.

Try live demo Read documentation

Applications in time series analysis

Model identification

The PACF is particularly valuable for:

Determining the order (p) of autoregressive (AR) models
Identifying direct dependencies in time series data
Distinguishing between different types of time series processes

Interpreting PACF plots

Key characteristics to observe:

Sharp cutoff after lag p indicates an AR(p) process
Gradual decay suggests moving average components
Significance bounds help identify meaningful correlations

Next generation time-series database

Try live demo Read documentation

Statistical estimation

The PACF can be estimated using several methods:

Durbin-Levinson Algorithm: $\phi_{kk} = \frac{\rho_k - \sum_{j=1}^{k-1} \phi_{k-1,j}\rho_{k-j}}{1 - \sum_{j=1}^{k-1} \phi_{k-1,j}\rho_j}$
Yule-Walker Equations: Solving the system of equations: $\rho_k = \sum_{j=1}^p \phi_j \rho_{k-j}$
Regression Method: Fitting successive autoregressions and extracting the coefficient of the last lag

Relationship with other time series concepts

The PACF is closely related to:

Maximum likelihood estimation in model fitting
Stationarity test procedures
State-space model identification

Applications in financial time series

In financial markets, PACF helps in:

Identifying trading signal dependencies
Risk factor analysis
Market microstructure modeling
Price prediction model development

The function is particularly valuable when analyzing:

Market returns
Trading volumes
Volatility patterns
Order flow dynamics

Best practices

When using PACF:

Always check for stationarity first
Use appropriate confidence intervals
Consider multiple lag orders
Compare with ACF for complete analysis
Account for seasonal effects

Computational considerations

Efficient PACF calculation requires:

Optimal memory management for large datasets
Handling missing or irregular data
Appropriate numerical precision
Efficient algorithm implementation

The computational complexity typically scales with both the number of observations and the maximum lag considered.