首页>
外国专利>
DEEP REINFORCEMENT LEARNING FOR PERSONALIZED SCREEN CONTENT OPTIMIZATION
DEEP REINFORCEMENT LEARNING FOR PERSONALIZED SCREEN CONTENT OPTIMIZATION
展开▼
机译:个性化屏幕内容优化的深度强化学习
展开▼
页面导航
摘要
著录项
相似文献
摘要
Systems and methods are described for selecting content item identifiers for display. The system may identify a set of content items that are likely to be requested in the future based on a history of content item requests. The system then selects a first plurality of content categories using a category selection neural net and selects a first set of recommended content items for the first plurality of content categories. The system increases a reward score for the first plurality of content categories based on receiving a request for a content item that is included in the first set of recommended content items. The system also decreases the reward score for the first plurality of content categories based on determining that the requested content item is included in the set of content items that are likely to be requested in the future. The neural net is trained based on the reward score of the first plurality of content categories to reinforce reward score maximization. The trained neural net is the used to select content items for display.
展开▼