基于医学知识图谱的AI助教在病理学绘图评分中的应用研究

doi:10.3760/cma.j.cn115259-20250805-00890

中华医学教育杂志 ›› 2026, Vol. 46 ›› Issue (6): 433-437.DOI: 10.3760/cma.j.cn115259-20250805-00890

基于医学知识图谱的AI助教在病理学绘图评分中的应用研究

墨晶¹, 韩继媛¹, 赵秀兰¹, 闫景瑞², 孙保存¹

¹天津医科大学基础医学院病理学教研室,天津 300070;
²天津医科大学基础医学院教学办公室,天津 300070

收稿日期:2025-08-05 出版日期:2026-06-01 发布日期:2026-05-28
通讯作者: 孙保存, Email: sunbaocun@tmu.edu.cn

Application of AI assistant based on medical knowledge graphs in pathology drawing scoring

Mo Jing¹, Han Jiyuan¹, Zhao Xiulan¹, Yan Jingrui², Sun Baocun¹

¹Department of Pathology, School of Basic Medicine, Tianjin Medical University, Tianjin 300070, China;
²Teaching Office, School of Basic Medicine, Tianjin Medical University, Tianjin 300070, China

Received:2025-08-05 Online:2026-06-01 Published:2026-05-28
Contact: Sun Baocun, Email: sunbaocun@tmu.edu.cn

摘要/Abstract

摘要： 目的探讨基于医学知识图谱的AI助教在病理学绘图评分中应用的可靠性。方法 2025年2月,以天津医科大学2020 级“5+3”一体化临床医学专业学生的135份病理学绘图为资料来源,分别由AI助教、Kimi和5名病理学教师按照评分标准对绘图进行独立评分。评分维度包括专业性、准确性、逻辑性、内容完整性、知识运用能力、学习态度与规范性以及创新性与批判性思维。采用Wilcoxon秩和检验分析AI助教和Kimi的评分结果与教师评分的差异。以5名病理学教师的评分平均值作为参照标准,通过组内相关系数(intraclass correlation coefficient,ICC),比较AI助教和Kimi评分与教师评分的一致性。结果 AI助教、Kimi、教师的病理学绘图评分总分分别为68.0(10.0)分、82.0(9.0)分和81.2(7.3)分。AI助教总分低于教师总分,其差异具有统计学意义(P<0.001)。Kimi评分与教师评分差异无统计学意义(P=0.112)。AI助教各维度评分与教师评分的一致性均高于Kimi,其中AI助教在专业性(ICC=0.55)和准确性(ICC=0.56)维度上评分与教师呈中等一致性,而Kimi在专业性(ICC=0.24)和准确性(ICC=0.20)维度上评分与教师的评分一致性差。结论基于医学知识图谱的AI助教在病理学绘图评分中的可靠性优于通用人工智能模型,可以作为病理学绘图评分的辅助支持。AI助教评分较为严格,可以调整AI助教的评分设置对齐教师评分区间。

Abstract: Objective To explore the reliability of applying AI assistant based on medical knowledge graph in pathological drawing scoring. Methods The study was conducted in February 2025. A total of 135 pathological drawings from students of the 2020 ″5+3″ integrated clinical medicine program at Tianjin Medical University were collected as data sources. AI assistant, Kimi, and 5 pathology teachers independently scored the drawings according to the scoring criteria. The scoring dimensions included professionalism, accuracy, logic, content integrity, knowledge application ability, learning attitude and standardization, and innovation and critical thinking. Wilcoxon rank-sum test was used to analyze the differences between the scoring results of AI assistant, Kimi and teachers' scores. Taking as reference standard the average score of 5 pathology teachers, the intraclass correlation coefficient (ICC) was used to compare the consistency between the scores of AI assistant, Kimi and teachers. Results The total scores of pathological drawing were 68.0 (10.0) for AI assistant, 82.0 (9.0) for Kimi, and 81.2 (7.3) for teachers. The total score of AI assistant was lower than that of the teachers, and the difference was statistically significant (P<0.001). There was no statistically significant difference between the total score of Kimi and that of teachers (P=0.112). The consistency between AI assistant and teachers' scores in each scoring dimension was higher than that of Kimi. AI assistant showed moderate consistency with teachers in the dimensions of professionalism (ICC=0.55) and accuracy (ICC=0.56), while Kimi had poor consistency with teachers in professionalism (ICC=0.24) and accuracy (ICC=0.20). Conclusions The reliability of AI assistant based on medical knowledge graph in pathological drawing scoring is better than that of general artificial intelligence model, and it can be used as auxiliary support for pathological drawing scoring. The score of the AI assistant is relatively strict, and the scoring settings of the AI assistant can be adjusted to align with the teachers' scoring range.

中图分类号:

墨晶, 韩继媛, 赵秀兰, 闫景瑞, 孙保存. 基于医学知识图谱的AI助教在病理学绘图评分中的应用研究[J]. 中华医学教育杂志, 2026, 46(6): 433-437, DOI: 10.3760/cma.j.cn115259-20250805-00890.

Mo Jing, Han Jiyuan, Zhao Xiulan, Yan Jingrui, Sun Baocun. Application of AI assistant based on medical knowledge graphs in pathology drawing scoring[J]. Chinese Journal of Medical Education, 2026, 46(6): 433-437, DOI: 10.3760/cma.j.cn115259-20250805-00890.

参考文献

[1] 祝智庭,胡姣.教育数字化转型的理论框架[J].中国教育学刊,2022(4):41-49.
[2] 彭姿铭, 谭维智. 生成式人工智能时代学习的技术化重塑与教育应对[J].苏州大学学报(教育科学版),2025,13(1):25-34. DOI: 10.19563/j.cnki.sdjk.2025.01.003.
[3] 廖雨森, 马旭, 柯钧, 等. 基于学习情绪面部特征识别的课堂教学智慧评价方法[J].北京理工大学学报,2025,45(6):609-620. DOI: 10.15918/j.tbit1001-0645.2024.177.
[4] 孟文涛,庞夏雯.大学英语智慧评价体系的构建与应用研究[J].现代英语,2025(19):62-64.
[5] Birks S, Gray J, Darling-Pomranz C. Using artificial intelligence to provide a ″flipped assessment″ approach to medical education learning opportunities[J]. Med Teach, 2025,47(8):1377-1384. DOI: 10.1080/0142159X.2024.2434101.
[6] Yamamoto A, Koda M, Ogawa H, et al. Enhancing medical interview skills through AI-simulated patient interactions: nonrandomized controlled trial[J]. JMIR Med Educ, 2024,10:e58753. DOI: 10.2196/58753.
[7] 胡亚军, 李志勇. 病理学人工智能辅助教学系统的构建与分析[J].基础医学教育,2026,28(2):171-175. DOI: 10.13754/j.issn2095-1450.2026.02.14.
[8] 闫景瑞, 邓为民, 张士杰, 等. 基础医学教育阶段核心课程知识图谱的构建与应用[J].中华医学教育杂志,2024,44(3):176-179. DOI: 10.3760/cma.j.cn115259-20230301-00192.
[9] 郭雨语, 张越, 许衍辉, 等. 大语言模型在儿童白内障患者健康教育中的应用[J].中南大学学报(医学版),2025,50(10):1716-1726. DOI: 10.11817/j.issn.1672-7347.2025.250181.
[10] 杨仲朴.浅谈病理实验报告镜下绘图成绩评定[J].医学教育,1991(2):49.
[11] 核心素养研究课题组.中国学生发展核心素养[J].中国教育学刊,2016(10):1-3.
[12] Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research[J]. J Chiropr Med, 2016,15(2):155-163. DOI: 10.1016/j.jcm.2016.02.012.
[13] Menon V, Grover S, Gupta S, et al. A primer on reliability testing of a rating scale[J].Indian J Psychiatry,2025,67(7):725-729.DOI: 10.4103/indianjpsychiatry_584_25.
[14] 冷静,曹凌琳,王思瑜,等.面向深层语言特征的智能评估系统构建与应用验证[J].远程教育杂志,2025,43(6):91-100.DOI:10.15881/j.cnki.cn33-1304/g4.2025.06.011.
[15] 苗逢春. 生成式人工智能技术原理及其教育适用性考证[J]. 现代教育技术,2023,33(11):5-18. DOI:10.3969/j.issn.1009-8097.2023.11.001.

基于医学知识图谱的AI助教在病理学绘图评分中的应用研究

Application of AI assistant based on medical knowledge graphs in pathology drawing scoring

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	张晓楠, 陈克勇, 和霞, 臧小英. 医学生参与代际活动的偏好特征和参与意愿的影响因素研究[J]. 中华医学教育杂志, 2026, 46(6): 407-412.
[2]	谢晋, 范盈盈, 柳云. 基于临床医学专业学生视角的医学人文教学现状与需求状况调查分析[J]. 中华医学教育杂志, 2026, 46(6): 419-426.
[3]	李天龙, 刘露, 苏思慧, 潘纯, 黄晓波. 目标导向的进阶式教学在住院医师重症医学营养支持培训中的应用[J]. 中华医学教育杂志, 2026, 46(6): 457-463.
[4]	李冬凉, 董璐, 李萍萍. 基于住培学员视角的儿科住院医师儿童青少年精神病学培训现状与需求研究[J]. 中华医学教育杂志, 2026, 46(6): 464-470.
[5]	刘双, 张宏亮, 贺志飚, 黄伶智, 吴斯杰, 曾慧卉. 基于库伯体验式学习理论的大型综合医院应急演练培训的设计与实践[J]. 中华医学教育杂志, 2026, 46(6): 471-475.
[6]	方琳涵, 程淇, 宁佩珊, 胡国清. 国内外综合性大学医学数据科学课程开设情况的比较研究[J]. 中华医学教育杂志, 2026, 46(6): 476-480.
[7]	李珊, 齐建光, 何睿, 齐心, Olle ten Cate. 置信职业行为全球实施的挑战与区域经验[J]. 中华医学教育杂志, 2026, 46(5): 321-325.
[8]	李响, 冯敏, 李晓愚, 曾己文, 舒琦, 张妍, 蒋璇, 胡晶. 医学生参与大学生创新创业训练计划项目的现状分析[J]. 中华医学教育杂志, 2026, 46(5): 326-330.
[9]	刘英, 袁晴, 程琳傑, 尹婉宜, 江哲涵. 面向教育场景的虚拟标准化病人技术演进及其应用与展望[J]. 中华医学教育杂志, 2026, 46(5): 331-336.
[10]	许方泳, 张单单, 张迎盈, 黄约诺, 潘素素, 张济周. 中医住院医师规范化培训教师教学工作偏好研究[J]. 中华医学教育杂志, 2026, 46(5): 351-357.
[11]	高小卓, 朱延美, 张勇. 全视野数字切片联合基于案例学习教学在临床病理科住院医师规范化培训中的应用[J]. 中华医学教育杂志, 2026, 46(5): 358-362.
[12]	李玉婷, 杨先梅, 姚福琼, 刘军, 何书涵, 向虎, 陶用富. 四川省精神科医师转岗培训满意度调查分析[J]. 中华医学教育杂志, 2026, 46(5): 370-376.
[13]	胡琦琦, 任黎. 医学模拟中心专职人员岗位胜任力评价指标体系构建研究[J]. 中华医学教育杂志, 2026, 46(5): 377-382.
[14]	晁爽, 朱留宝, 杨帆, 付猛, 陈旭岩. 新加坡医学生培养模式的特色与启示[J]. 中华医学教育杂志, 2026, 46(5): 383-389.
[15]	陈伟英, Wickremasinghe Vineth Vimukthi, 陈威龙. 斯里兰卡医学教育与行医注册考试改革[J]. 中华医学教育杂志, 2026, 46(5): 390-394.