首页> 外文学位 >Psychometric properties of aviation safety performance evaluation instruments: Dependability of assessments.
【24h】

Psychometric properties of aviation safety performance evaluation instruments: Dependability of assessments.

机译:航空安全绩效评估工具的心理计量学特性:评估的可靠性。

获取原文
获取原文并翻译 | 示例

摘要

This dissertation presents research on the psychometric properties of the instruments used by Federal Aviation Administration (FAA) safety inspectors to assess the cockpit performance of airline crews. While psychometric studies have been conducted in training programs administered by the airlines themselves, little research has been done in this area on the processes and tools used by the FAA. The limited research that has been conducted suggested that the interrater reliability (IRR) of current assessments could be questionable. A version of the instrument currently in use by the FAA and two other instruments were used by active FAA inspectors who viewed eight videotaped scenarios of staged cockpit activities. Generalizability Theory (GT) analysis (Cronbach, Gleser, Nanda, & Rajaratnam, 1972) was chosen because of its ability to address multiple error sources in a single analytical framework. The main effect for raters was less than expected, normally a sign of good MR, but a substantial interaction effect was found between raters and the rating situation. An expected substantial effect was found between types of instruments, but the individual performance of each was found to be nearly equivalent in terms of allocation of error sources and generalizability coefficients. Further substantial situation-dependent effects were found when analyses were conducted for individual scenarios. The findings that the assessments made by these raters were highly variable between situations and types of instruments suggest problems with construct validity and definition of the dimensions of performance being evaluated. While there appears to be some consistency in the patterns of overall scoring, there appears to be considerable ambiguity with regard to measured constructs and related operational definitions that underlie the instruments' scales.
机译:本论文对联邦航空局(FAA)安全检查员用来评估机组人员驾驶舱性能的仪器的心理测量特性进行了研究。尽管已经在航空公司自己执行的培训计划中进行了心理测验研究,但在这一领域对FAA所使用的过程和工具的研究却很少。进行的有限研究表明,当前评估的跨度可靠性(IRR)可能令人怀疑。美国联邦航空局目前正在使用的该仪器的一种版本以及活跃的联邦航空局检查人员使用了另外两种仪器,他们检查了上演的驾驶舱活动的八个录像场景。选择概化理论(GT)分析(Cronbach,Gleser,Nanda和Rajaratnam,1972)是因为它能够在单个分析框架中解决多个错误源。评估者的主要影响小于预期,通常是良好的MR的迹象,但评估者与评估情况之间发现了重大的交互作用。在不同类型的工具之间发现了预期的实质性影响,但发现每种工具的单独性能在误差源分配和泛化系数方面几乎相等。当对单个场景进行分析时,发现了进一步的,与实际情况有关的影响。这些评估者的评估在情况和工具类型之间存在很大差异,这一发现表明,结构有效性和所评估绩效维度的定义存在问题。尽管总体评分方式似乎有些一致性,但对于衡量规模和构成工具规模的相关操作定义,似乎存在很大的歧义。

著录项

  • 作者

    Arendt, Donald N.;

  • 作者单位

    Northcentral University.;

  • 授予单位 Northcentral University.;
  • 学科 Engineering Industrial.; Psychology Industrial.; Psychology Psychometrics.
  • 学位 Ph.D.
  • 年度 2006
  • 页码 122 p.
  • 总页数 122
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 一般工业技术;工业心理学;心理学研究方法;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号