作者: Nikhil Sheoran , Nayan Raju Vysyaraju , Varun Srivastava , Nisheeth Golakiya , Dhruv Singal
DOI:
关键词:
摘要: In some embodiments, interaction data associated with user interactions with a user interface of an interactive computing environment is identified, and goal clusters of the interaction data are computed based on sequences of the user interactions and performing inverse reinforcement learning on the goal clusters to return rewards and policies. Further, likelihood values of additional sequences of user interactions falling within the goal clusters are computed based on the policies corresponding to each of the goal clusters and assigning the additional sequences to the goal clusters with greatest likelihood values. Computing interface experience metrics of the additional sequences are computed using the rewards and the policies corresponding to the goal clusters of the additional sequences and transmitting the interface experience metrics to the online platform. The interface experience metrics are usable for …