首页>
外国专利>
AUTOMATED OPTIMIZATION OF A MASS POLICY COLLECTIVELY PERFORMED FOR OBJECTS IN TWO OR MORE STATES AND A DIRECT POLICY PERFORMED IN EACH STATE
AUTOMATED OPTIMIZATION OF A MASS POLICY COLLECTIVELY PERFORMED FOR OBJECTS IN TWO OR MORE STATES AND A DIRECT POLICY PERFORMED IN EACH STATE
展开▼
机译:对两个或多个状态下的对象和每个状态下的对象直接执行的集体策略的自动优化
展开▼
页面导航
摘要
著录项
相似文献
摘要
An information processing apparatus that optimizes a policy in a transition model in which the number of targeted objects in each state transits according to the policy includes a cost constraint acquisition unit configured to acquire a cost constraint that constrains a total cost of the policy; a mass policy setting unit configured to set the number of objects targeted by a mass policy in each state, based on the predefined number of objects to belong to each state and a reach rate at which the mass policy reaches to an object, with respect to the mass policy collectively executed for the object in two or more states; and a processing unit configured to assume the reach rate of the mass policy as a variable of an optimization and maximize an objective function based on a total reward in a whole period while satisfying the cost constraint.
展开▼