首页> 外文会议>ASME international design engineering technical conferences and computers and information in engineering conference 2010 >THE MULTI-RELATIONSHIP EVALUATION DESIGN FRAMEWORK: DESIGNING TESTING PLANS TO COMPREHENSIVELY ASSESS ADVANCED AND INTELLIGENTTECHNOLOGIES
【24h】

THE MULTI-RELATIONSHIP EVALUATION DESIGN FRAMEWORK: DESIGNING TESTING PLANS TO COMPREHENSIVELY ASSESS ADVANCED AND INTELLIGENTTECHNOLOGIES

机译:多重关系评估设计框架:设计测试计划以全面评估先进和智能化 r n技术

获取原文
获取原文并翻译 | 示例

摘要

As new technologies develop and mature, it becomes critical to provide both formative and summative assessments on their performance. Performance assessment events range in form from a few simple tests of key elements of the technology to highly complex and extensive evaluation exercises targeting specific levels and capabilities of the system under scrutiny. Typically the more advanced the system, the more often performance evaluations are warranted, and the more complex the evaluation planning becomes. Numerous evaluation frameworks have been developed to generate evaluation designs intent on characterizing the performance of intelligent systems. Many of these frameworks enable the design of extensive evaluations, but each has its own focused objectives within an inherent set of known boundaries.rnThis paper introduces the Multi-Relationship Evaluation Design (MRED) framework whose ultimate goal is to automatically generate an evaluation design based upon multiple inputs. The MRED framework takes input goal data and outputs an evaluation blueprint complete with specific evaluation elements including level of technology to be tested, metric type, user type, and, evaluation environment. Some of MRED's unique features are that it characterizes these relationships and manages their uncertainties along with those associated with evaluation input. The authors will introduce MRED by first presenting relationships between four main evaluation design elements. These evaluation elements are defined and the relationships between them are established including the connections between evaluation personnel (not just the users), their level of knowledge, and decision-makingrnauthority. This will be further supported through the definition of key terms. An example will be presented in which these terms and relationships are applied to the evaluation design of an automobile technology. An initial validation step follows where MRED is applied to the speech translation technology whose evaluation design was inspired by the successful use of a pre-existing evaluation framework. It is important to note that MRED is still in its early stages of development where this paper presents numerous MRED outputs. Future publications will present the remaining outputs, the uncertain inputs, and MRED's implementation steps that produce the detailed evaluation blueprints.
机译:随着新技术的发展和成熟,对它们的性能进行形成性和总结性评估变得至关重要。绩效评估事件的形式从对技术关键要素的一些简单测试到针对系统特定级别和功能的高度复杂和广泛的评估活动。通常,系统越先进,就越需要进行绩效评估,评估计划也就越复杂。已经开发了许多评估框架来生成旨在表征智能系统性能的评估设计。这些框架中的许多框架都可以进行广泛的评估设计,但是每个框架在一组固有的已知边界内都有自己的重点目标。rn本文介绍了多关系评估设计(MRED)框架,该框架的最终目标是基于以下各项自动生成评估设计:在多个输入上。 MRED框架获取输入的目标数据并输出评估蓝图,其中包含特定的评估元素,包括要测试的技术水平,度量标准类型,用户类型以及评估环境。 MRED的一些独特功能是它可以表征这些关系并管理其不确定性以及与评估输入相关的不确定性。作者将通过首先介绍四个主要评估设计元素之间的关系来介绍MRED。定义了这些评估元素,并建立了它们之间的关系,包括评估人员(不仅是用户)之间的联系,他们的知识水平和决策权限。关键术语的定义将进一步支持这一点。将提供一个示例,其中将这些术语和关系应用于汽车技术的评估设计。随后的初始验证步骤是,将MRED应用于语音翻译技术,该技术的评估设计受到成功使用现有评估框架的启发。重要的是要注意,MRED仍处于开发的早期阶段,本文介绍了许多MRED的输出。未来的出版物将介绍剩余的输出,不确定的输入以及MRED的实施步骤,这些步骤将产生详细的评估蓝图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号