首页> 外文会议>International Conference on Architecture of Computing Systems >Estimation of Reward and Decision Making for Trust-Adaptive Agents in Normative Environments
【24h】

Estimation of Reward and Decision Making for Trust-Adaptive Agents in Normative Environments

机译:规范环境信任自适应代理的奖励与决策估算

获取原文

摘要

In an open trusted Desktop Grid system with a normative environment incentives and sanctions may change during runtime. Every agent in the system computes work for other agents and also submits jobs to other agents. It has to decide for which agents it wants to work and to which agent it wants to give its jobs. We introduced a trust metric to isolate misbehaving agents. After getting a job processed by another agent it will get a reward. When processing a job for another agent it will get a positive trust-rating, but no direct reward. To come to a decision when accepting or rejecting jobs we need to be able to estimate the reward. Since the environment may change at runtime and to overcome delayed reward issues we use a neural network to estimate the reward based on the environment and trust level.
机译:在开放可信赖的桌面网格系统中,具有规范性环境激励和制裁在运行时可能会发生变化。系统中的每个代理都计算其他代理的工作,并将作业提交给其他代理商。它必须决定它想要工作的代理商以及它想要赋予其工作的代理人。我们介绍了一个信任度量标准来隔离行为不端的代理。通过另一个代理处理的工作后,它将获得奖励。为其他代理人处理作业时,它将获得一个积极的信任评级,但没有直接奖励。在接受或拒绝工作时,我们需要能够估计奖励的作业。由于环境在运行时可能会发生变化并克服延迟奖励问题,我们使用神经网络基于环境和信任级别来估算奖励。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号