作者: Stefano Ermon , Lantao Yu , Jiaming Song , Yang Song , Chenlin Meng
DOI:
关键词:
摘要: Autoregressive models use chain rule to define a joint probability distribution as a product of conditionals. These conditionals need to be normalized, imposing constraints on the …