摘要: We consider the multiple time-series alignment problem, typically focusing on task of synchronizing motion videos same kind human activity. Finding an optimal global sequences is infeasible, while there have been several approximate solutions, including iterative pairwise warping algorithms and variants hidden Markov models. In this paper, we propose a novel probabilistic model that represents conditional densities latent target which are aligned with given observed through variables. By imposing certain constraints at learning stage, sensible for alignments can be learned very efficiently by EM algorithm. Compared to existing methods, our approach yields more accurate being robust local optima initial configurations. demonstrate its efficacy both synthetic real-world facial emotions activities.