临床医学专业(本科)水平测试的等值方法比较研究

doi:10.3760/cma.j.cn115259-20210817-01034

中华医学教育杂志 ›› 2022, Vol. 42 ›› Issue (7): 577-580.DOI: 10.3760/cma.j.cn115259-20210817-01034

• 医学教育评估专栏 • 下一篇

临床医学专业(本科)水平测试的等值方法比较研究

张泉慧¹, 何惧², 任杰³, 张颖⁴, 卢燕⁵

¹国家医学考试中心信息评价部,北京 100097;
²国家医学考试中心,北京 100097;
³北京语言大学语言测试和人才测评研究所,北京 100083;
⁴国家医学考试中心考务管理部,北京 100097;
⁵国家医学考试中心发展研究部,北京 100097

收稿日期:2021-08-17 出版日期:2022-07-01 发布日期:2022-06-29
通讯作者: 卢燕, Email: luyan810206@163.com

A comparative study of equating methods applied in standardized competence test for clinical medicine undergraduates

Zhang Quanhui¹, He Ju², Ren Jie³, Zhang Ying⁴, Lu Yan⁵

¹Department of Information and Assessment, National Medicine Examination Center, Being 100097, China;
²National Medicine Examination Center, Being 100097,China;
³Institute of Language Testing and Talent Evaluation, Beijing Language and Culture University, Being 100083, China;
⁴Department of Examination Management, National Medicine Examination Center, Being 100097, China;
⁵Department of Development Research, National Medicine Examination Center, Being 100097, China

Received:2021-08-17 Online:2022-07-01 Published:2022-06-29
Contact: Lu Yan, Email: luyan810206@163.com

摘要/Abstract

摘要： 目的基于经典测验理论(classical test theory,CTT)和项目反应理论(item response theory,IRT)下的等值方法对2个年度临床医学专业(本科)水平测试(简称学业水平测试)考生作答情况进行分析,探讨学业水平测试中更为适合的等值方法。方法基于CTT方法,采用塔克(Tucker)观察分数线性等值方法、列文(Levine)观察分数线性等值方法、等百分位法、等百分位平滑法4种方法,基于IRT方法的单参数、双参数模型中,采用分别估计法、同时估计法和固定共同题参数估计法各3种校准方法进行等值探索,通过等值标准误来分析以上10种等值结果的稳定性。结果 CTT方法的等值误差在0.7～1.6之间,IRT方法的等值误差在0.2～0.6之间,IRT误差更小。CTT方法中,Tucker观察分数线性等值方法误差最小,为0.7,等百分位平滑法误差最大,为1.6;IRT方法中,单参数模型的等值结果优于双参数模型,单参数模型中,固定共同题参数估计法的误差最小,为0.2。结论学业水平测试等值可以选择IRT单参数模型中的固定共同题参数估计法,通过等值,年度2学业水平测试等值后的分数上调,合格标准保持不变,有效地实现了分数可比,保证了考试公平。

关键词: 临床医学专业, 水平测试, 经典测验理论, 项目反应理论, 等值

Abstract: Objective This paper analyzes equating methods applied in Standardized Competence Test for undergraduates of clinical medicine based on classical test theory (CTT) and item response theory (IRT) in order to explore a more suitable equating method. Methods The research uses four equating methods based on the CTT and six equating methods based on the IRT.CTT equating methods include Tucker observation score linear equating method,Levine observation score linear equating method, equipercentile equating smoothing method and equating standard error equating unsmoothed method. While in the one-parameter model and two-parameter model of IRT, three calibration methods are used which are linking separate calibration, concurrent calibration and fixed Item Parameter Calibration. The stability of the 10 equating results is analyzed by the equating standard error. Results The results show that the equating standard error of CTT method is 0.7~1.6, while the equating standard error of IRT method is 0.2~0.6, IRT equating standard error is smaller than CTT equating method. Among four CTT equating methods, the equating standard error of Tucker observation score linear equating method is 0.7 as the smallest one, the error of equipercentile equating method is 1.6 as the largest one. Among six IRT equating methods, the result of one-parameter model is better than that of two-parameter model and the error of fixed item parameter calibration is the smallest one in one-parameter model, which the equating standard error is 0.2. Conclusions The fixed item parameter calibration in one-parameter model of IRT can be selected as the equating method of this test. Through equating, the score of year 2 is improved, and the eligibility criteria remain unchanged, which effectively achieves the score comparability and ensures the fairness of the test.

Key words: Clinical medicine, Competence test, Classical test theory, Item response theory, Equating

中图分类号:

R-05

张泉慧, 何惧, 任杰, 张颖, 卢燕. 临床医学专业(本科)水平测试的等值方法比较研究[J]. 中华医学教育杂志, 2022, 42(7): 577-580, DOI: 10.3760/cma.j.cn115259-20210817-01034.

Zhang Quanhui, He Ju, Ren Jie, Zhang Ying, Lu Yan. A comparative study of equating methods applied in standardized competence test for clinical medicine undergraduates[J]. Chinese Journal of Medical Education, 2022, 42(7): 577-580, DOI: 10.3760/cma.j.cn115259-20210817-01034.

[1]	谢晋, 范盈盈, 柳云. 基于临床医学专业学生视角的医学人文教学现状与需求状况调查分析[J]. 中华医学教育杂志, 2026, 46(6): 419-426.
[2]	陈伟英, Wickremasinghe Vineth Vimukthi, 陈威龙. 斯里兰卡医学教育与行医注册考试改革[J]. 中华医学教育杂志, 2026, 46(5): 390-394.
[3]	陈丽华, 刘硕, 信斯言, 王雨, 吴红斌. 临床医学专业学生生成式人工智能使用行为影响因素研究[J]. 中华医学教育杂志, 2026, 46(3): 199-205.
[4]	刘芳, 张宇皓, 张莉娟. 临床医学专业新生第二课堂职业精神教育的理论框架和实现路径研究[J]. 中华医学教育杂志, 2026, 46(1): 31-36.
[5]	杨柳, 高翾, 李任鹏, 汪晶. 基于增强现实技术的功能磁共振成像融合重建在临床医学专业实习生医学影像学教学中的应用[J]. 中华医学教育杂志, 2025, 45(6): 414-418.
[6]	侯建林, 于晨, 程化琴, 方晨晨, 王维民. 中国本科临床医学专业学生境外访学与交流情况及其影响因素分析[J]. 中华医学教育杂志, 2025, 45(4): 283-287.
[7]	张巍瀚, 由由, 谢阿娜, 王维民. 导师指导对八年制临床医学专业毕业生科研产出的影响[J]. 中华医学教育杂志, 2025, 45(12): 895-899.
[8]	吴宁玲, 周璐, 谭丽娜, 童晓亮, 高丽华, 鲁建云, 陈静, 曾庆海, 曾金容. 情景模拟教学在皮肤性病学实践教学中的应用[J]. 中华医学教育杂志, 2024, 44(5): 346-348.
[9]	张涛, 荆雷, 张亚亚. 基于项目反应理论的医学心理学课程考试试卷质量分析[J]. 中华医学教育杂志, 2024, 44(5): 387-390.
[10]	韩芸峰, 刘思源, 李佳禾, 杨军. 八年制临床医学专业学生神经外科执业意愿及其影响因素的调查分析[J]. 中华医学教育杂志, 2024, 44(3): 171-175.
[11]	王美杰, 郑楠, 刘振洪, 张鼎, 朱楠, 刘振华. 黑龙江省住院医师规范化培训业务水平测试和结业考核成绩分析[J]. 中华医学教育杂志, 2024, 44(12): 953-956.
[12]	陈荟竹, 翟茁钰, 门宇琪, 谷士贤, 沈宁, 赵旻暐. 北京大学临床医学专业学生创新性实验项目的调查分析[J]. 中华医学教育杂志, 2024, 44(1): 44-47.
[13]	余雪红, 刘军, 石景芬. 行动学习法在临床医学专业本科生内科学见习教学中的应用[J]. 中华医学教育杂志, 2023, 43(9): 671-674.
[14]	吴燕华, 牟冬梅, 郑翔宇, 贾志芳, 张扬雨, 赵天业, 曹东慧, 李柏, 姜晶. 临床医学专业学位博士研究生临床研究方法学课程的调查分析[J]. 中华医学教育杂志, 2023, 43(8): 565-569.
[15]	周文静, 江哲涵, 欧阳劲樱, 王维民. 基于概化理论的临床医学专业(本科)水平测试临床基本技能考试质量分析[J]. 中华医学教育杂志, 2023, 43(5): 391-396.

临床医学专业(本科)水平测试的等值方法比较研究

A comparative study of equating methods applied in standardized competence test for clinical medicine undergraduates

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价