Learning Alignments

Ieva Kazlauskaite
University of Bath

Ivan Ustyuzhaninov
University of Tübingen

Carl Henrik Ek
University of Bristol

Neill D.F. Campbell
University of Bath

Overview

This project encompasses a series of work on composite models with applications to the temporal alignment of sequences; our aim is to automatically learn alignments between high-dimensional data in an unsupervised manner. Our proposed methods cast alignment learning in a framework where both alignment and data are modelled simultaneously. Further, we automatically infer groupings of different types of sequences within the same dataset. We derive probabilistic models built on non-parametric priors that allow for flexible warps while at the same time providing means to specify interpretable constraints.

Given a set of time series sequences, the temporal alignment task consists of finding monotonic warps of the inputs (which typically correspond to time) that remove the differences in the timing of the observations. There are three intrinsic sources of ambiguity in this problem that motivate the use of probabilistic modelling. Firstly, the temporal alignment problem is ill-posed: there are infinitely many ways to align a finite set of sequences and we’d like to model this warping uncertainty. Secondly, the observed sequences might correspond to multiple different unknown underlying functions, hence the assignment of sequences to groups is ambiguous. Furthermore, the observed sequences are often noisy, requiring a principled way to model the observational noise. We introduce a non-parametric probabilistic model of monotonic warps and model each sequence as a composition of such a warp and a standard GP. To allow for alignment in multiple groups and to find these groups in an unsupervised manner, we use probabilistic alignment objectives (such as GP-LVM or DPMM).

Results

We demonstrate the efficacy of our approach with superior quantitative performance to the state-of-the-art approaches and provide examples to illustrate the versatility of our model in automatic inference of sequence groupings, absent from previous approaches, as well as easy specification of high level priors for different modalities of data.

**Comparison to state-of-the-art alignment methods.** Average error on 25 datasets proposed by Zhou and De la Torre. The proposed method is on the right with the preceeding three methods as comparative variants of our approach.

MSE (SD)	SRVF	GP-LVM+BASIS	OURS
Alignment	6.4 (±1.7)	8.4 (±2.7)	5.9 (±1.1)
Warping	30.0 (±10.4)	9.7 (±4.9)	9.7 (±5.7)

Quantitative comparison of alignments and warps. The table considers the best competing methods on the dataset of Zhou and De la Torre with multiple true sequences (alignment and grouping task).

The following diagram illustrates the results of our two methods for automatic alignment and clustering of signals, one using the GP-LVM and the other using an explicit Dirichlet Process clustering model. The later is more appropriate for the heart beats data since we are informed by clinicians that the data should be grouped into distinct patterns.

**Automatic alignment and classification of heart beats data.** The heart beat sounds display a "lub dub, lub dub” pattern that varies temporally depending on the age, health, and state of the subject. (a) The unaligned signals disguise the patterns between signals as displayed in the unstructured GP-LVM manifold. (b) The aligned signals are grouped into two-distinct patterns by their location on the resulting manifold. (c) In our Dirichlet Process approach, we explicitly cluster the signals into two patterns (automatically discovered) and are able to improve the alignment quality. Dataset from the PASCAL Classifying Heart Sounds Challenge 2011 (CHSC2011) of Bentley et al.

Publications

Aligned Multi-Task Gaussian Process,
Olga Mikheeva, Ieva Kazlauskaite, Adam Hartshorne, Hedvig Kjellström, Carl Henrik Ek and Neill D. F. Campbell,
Int. Conf. on Artificial Intelligence and Statistics (AISTATS), 2022
[pdf] [code]

Monotonic Gaussian Process Flow,
Ivan Ustyuzhaninov, Ieva Kazlauskaite, Carl Henrik Ek and Neill D. F. Campbell,
Int. Conf. on Artificial Intelligence and Statistics (AISTATS), 2020
[pdf] [code]

Compositional Uncertainty in Deep Gaussian Processes,
Ivan Ustyuzhaninov, Ieva Kazlauskaite, Markus Kaiser, Erik Bodin, Neill D. F. Campbell and Carl Henrik Ek,
Conf. on Uncertainty in Artificial Intelligence (UAI), 2020
[pdf] [supplemental] [code]

Gaussian Process Latent Variable Alignment Learning,
Ieva Kazlauskaite, Carl Henrik Ek and Neill D. F. Campbell,
Int. Conf. on Artificial Intelligence and Statistics (AISTATS), 2019
[pdf] [code]

Sequence Alignment with Dirichlet Process Mixtures,
Ieva Kazlauskaite, Ivan Ustyuzhaninov, Carl Henrik Ek and Neill D. F. Campbell,
NeurIPS Workshop on Bayesian Non-Parametrics, 2018
[pdf]

Learning Alignments from Latent Space Structures,
Ieva Kazlauskaite, Carl Henrik Ek and Neill D. F. Campbell,
NeurIPS Workshop on Learning in High-Dimensions with Structure, 2016
[pdf]

Acknowledgements

This work has been supported by EPSRC CDE (EP/L016540/1) and CAMERA (EP/M023281/1) grants as well as the Royal Society. IK would like to thank the Frostbite Physics team at EA.