作者: Nikhil Sheoran , Nayan Raju Vysyaraju , Varun Srivastava , Nisheeth Golakiya , Dhruv Singal
DOI:
关键词:
摘要: A method includes identifying interaction data associated with user interactions with a user interface of an interactive computing environment. The method also includes computing goal clusters of the interaction data based on sequences of the user interactions and performing inverse reinforcement learning on the goal clusters to return rewards and policies. Further, the method includes computing likelihood values of additional sequences of user interactions falling within the goal clusters based on the policies corresponding to each of the goal clusters and assigning the additional sequences to the goal clusters with greatest likelihood values. Furthermore, the method includes computing interface experience metrics of the additional sequences using the rewards and the policies corresponding to the goal clusters of the additional sequences and transmitting the interface experience metrics to the online platform. The …