文献速递:深度学习肝脏肿瘤诊断---基于深度学习的肝细胞结节性病变在整片组织病理图像上的分类

本文主要是介绍文献速递:深度学习肝脏肿瘤诊断---基于深度学习的肝细胞结节性病变在整片组织病理图像上的分类,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!

Title 

题目

Deep Learning-Based Classification of Hepatocellular Nodular Lesions on Whole-Slide Histopathologic Images

基于深度学习的肝细胞结节性病变在整片组织病理图像上的分类

Background 

背景

Hepatocellular nodular lesions (HNLs) constitute a heterogeneous group of disorders. Differential diagnosis among these lesions, especially high-grade dysplasticnodules (HGDNs) and well-differentiated hepatocellular carci noma (WD-HCC), can be challenging, let alone biopsy speci mens. We aimed to develop a deep learning system to solve these puzzles, improving the histopathologic diagnosis of HNLs (WD-HCC, HGDN, low-grade DN, focal nodular hyperplasia,hepatocellular adenoma), and background tissues (nodularcirrhosis, normal liver tissue).

肝细胞结节性病变(HNLs)构成了一个异质性疾病群。这些病变之间的鉴别诊断,特别是高级别发育不良结节(HGDNs)与良性分化的肝细胞癌(WD-HCC),可能具有挑战性,更不用说活检样本了。我们旨在开发一个深度学习系统来解决这些难题,以提高HNLs(WD-HCC、HGDN、低级别DN、局灶性结节性增生、肝细胞腺瘤)以及背景组织(结节性肝硬化、正常肝组织)的组织病理诊断。

Conclusions

结论

We first developed a deep learning diagnostic model for HNLs, which performed well and contributed to enhancing the diagnosis rate of early HCC and risk stratification of patients with HNLs. Furthermore, HnAIM had significant ad vantages in patch-level recognition, with important diagnostic implications for fragmentary or scarce biopsy specimens.

我们首次开发了一个用于HNLs的深度学习诊断模型,该模型表现良好,并有助于提高早期HCC的诊断率和HNLs患者的风险分级。此外,HnAIM在补丁层面识别方面具有显著优势,对于零碎或稀缺的活检样本具有重要的诊断意义。

Results

结果

We obtained 213,280 patches from 1115 whole-slide images of 738 patients. An optimal model was finally chosen based on F1 score and area under the curve value, named hepatocellular-nodular artificial intelligence model (HnAIM), with the overall 7-category area under the curve of 0.935 in the independent external validation cohort. For biopsy specimens, the agreement rate with sub specialists’ majority opinion was higher for HnAIM than 9 pa thologists on both patch level and whole-slide images level.

我们从738名患者的1115张整片幻灯片图像中获得了213,280个补丁。基于F1得分和曲线下面积值,最终选择了一个最优模型,命名为肝细胞结节性人工智能模型(HnAIM),在独立外部验证队列中,7类别的曲线下面积为0.935。对于活检样本,HnAIM与亚专家多数意见的一致率高于9名病理学家,无论是在补丁层面还是整片幻灯片图像层面。

Method

方法

The samples consisting of surgical and biopsy specimens were collected from 6 hospitals. Each specimen was reviewed by 2 to 3 subspecialists. Four deep neural networks (ResNet50, InceptionV3, Xception,and the Ensemble) were used. Their performances were eval uated by confusion matrix, receiver operating characteristic curve, classification map, and heat map. The predictive efficiency of the optimal model was further verified by comparing with that of 9 pathologists.

样本包括手术和活检标本,这些标本收集自6家医院。每个标本由2至3名亚专科医生审核。使用了四个深度神经网络(ResNet50、InceptionV3、Xception和集成网络)。它们的性能通过混淆矩阵、接收者操作特征曲线、分类图和热图进行评估。通过与9名病理医生的诊断结果进行比较,进一步验证了最优模型的预测效率。

Figure

图片

Figure 1. Data, study design, and HnAIM classification framework. Six independent data sets (Headquarters, Lingnan andYuedong Hospital of SYSUTH, SYSUFH, FSFPH, and GZFPH) were used in this study. (A) The Headquarters and YuedongHospital of SYSUTH data sets were used for developing a 7-category discriminative model, while the other 4 data sets wereused for the external testing. (B) The distribution of the samples for each type of liver nodule in model development (left) andindependent external validation (right). (C) Flow chart of the study. The data sets of the 7 categories were divided into thetraining (70%), validation (15%), and testing (15%) sets. Then, ROIs were labeled with green masks for each category. Patcheswere extracted from ROIs by OpenSlide library at  40 magnification with a size of 1024  1024. The training set was used totrain the ensemble model based on 3 basic models, while the validation set was used to fine-tune superparameters, such as learning rate, and the testing set used to evaluate models’ performances by confusion matrix, ROC curve, WSI-level classi-fication map, and patch-level heat map. Patches of liver biopsy specimens were predicted by the optimal model and areshown using a histogram, while the model’s referral decisions were compared with the ones made by different levels ofpathologists.

图1. 数据、研究设计和HnAIM分类框架。本研究使用了六个独立数据集(总部、岭南及SYSUTH的粤东医院、SYSUFH、FSFPH和GZFPH)。(A) 总部和SYSUTH的粤东医院数据集用于开发7类鉴别模型,而其他四个数据集用于外部测试。(B) 模型开发中(左)和独立外部验证中(右)各类型肝结节样本的分布。(C) 研究流程图。7类数据集被划分为训练集(70%)、验证集(15%)和测试集(15%)。然后,每个类别的感兴趣区域(ROIs)用绿色遮罩标记。通过OpenSlide库以40倍放大从ROIs提取1024×1024大小的补丁。训练集用于基于三个基础模型训练集成模型,验证集用于调整超参数,如学习率,测试集用于通过混淆矩阵、ROC曲线、WSI级分类图和补丁级热图评估模型性能。肝活检标本的补丁由最优模型预测,并通过直方图显示,而模型的转诊决定与不同级别的病理医生所做的决定进行比较。

图片

Figure 2. Performance of deep learning models. (A) Classification results are shown by confusion matrices on the internal testing set for Resnet50, Inception V3, Xception, and the Ensemble model. Numbers represent the number of patches classified correctly (diagonal) and incorrectly (off the diagonal). (B) The ROC curve and the AUC value on the internal testing set for models of Resnet50 (black line), Inception V3 (blue line), Xception (green line), and Ensemble (red line). The Xception and the Ensemble models both performed the best, with AUC values of 0.9991, indicating models were trained with high accuracy. (C) The ROC curve and AUC value on the independent external validation using the Ensemble model (HnAIM) in FSFPH, SYSUFH, GZFPH, and the entire external data set.

图2. 深度学习模型的性能。(A) 在内部测试集上,Resnet50、Inception V3、Xception和集成模型的分类结果通过混淆矩阵显示。数字代表正确分类(对角线上)和错误分类(对角线外)的补丁数量。(B) 在内部测试集上,Resnet50(黑线)、Inception V3(蓝线)、Xception(绿线)和集成模型(红线)的ROC曲线和AUC值。Xception和集成模型的表现最佳,AUC值为0.9991,表明模型具有高精度的训练。(C) 使用集成模型(HnAIM)在FSFPH、SYSUFH、GZFPH和整个外部数据集上的独立外部验证的ROC曲线和AUC值。

图片

Figure 3. WSI-level panoramicclassification map of surgicalsample: (A) WD-HCC, (B)HGDN, (C), LDN, (D), FNH, and(E) HCA. (Left) Original WSIs(original magnification  0.4).(Middle) Classification mapswere constructed frommodel’s predictions of corresponding patches. Colorsfrom blue to red meantdifferent liver lesions. For NC,LGDN, HGDN, and WDHCC,gradually deepening coloreven indicated increased degree of malignancy (labels: 2,5–7). The diagnostic labelswere as follows: 0 for background, 1 for NNL, 2 for NC, 3for HCA, 4 for FNH, 5 forLGDN, 6 for HGDN, and 7 forWDHCC. (Right) Pie charts

quantitatively show the percentage of different categoriesin each WSI.

图3. 外科样本的WSI级全景分类图:(A) WD-HCC,(B) HGDN,(C) LDN,(D) FNH,和 (E) HCA。(左) 原始WSIs(原始放大倍数0.4)。(中) 分类图根据模型对应补丁的预测构建。颜色从蓝色到红色表示不同的肝脏病变。对于NC、LGDN、HGDN和WDHCC,颜色的逐渐加深甚至表示恶性程度的增加(标签:2,5-7)。诊断标签如下:0代表背景,1代表NNL,2代表NC,3代表HCA,4代表FNH,5代表LGDN,6代表HGDN,7代表WDHCC。(右) 饼图定量显示每个WSI中不同类别的百分比。

图片

Figure 4. Performance of HnAIM in biopsy specimens and comparison with pathologists. (A) Patch-level histogram of biopsy specimens shows the model’s predictions for 7 categories, with a focus on cell morphologic features. The category with the largest proportion was regarded as the final classification. Agreement rates with the majority opinion of subspecialists for the HnAIM and pathologists (3 each for junior, intermediate, and senior pathologist) on 7 categories across (B) all 961 patches and (C) 30 WSIs of biopsy specimens. To represent the average level of each group, the agreement rate was shown as the mean value across 3 pathologists. The error bars represent the 95% CIs. Potential reasons for disagreements among pathologists with HnAIM may include inherent uncertainty in the 2-dimensional interpretation of a 3-dimensional specimen, ambiguity in diagnostic guidelines, the limited number of tissue samples, and cognitive factors such as anchoring.

图4. HnAIM在活检标本中的表现及与病理医生的比较。(A) 活检标本的补丁级直方图显示了模型对7个类别的预测,重点关注细胞形态特征。占比最大的类别被视为最终分类。HnAIM与亚专家多数意见的一致率以及(B)所有961个补丁和(C)30个活检样本WSI中7个类别的病理医生(初级、中级和高级各3名)的一致率。为代表每组的平均水平,一致率以3名病理医生的平均值显示。误差条表示95%置信区间。病理医生与HnAIM之间意见不一的潜在原因可能包括对三维标本二维解读的固有不确定性、诊断指南的模糊性、组织样本数量有限以及认知因素如锚定效应。

Table

图片

Table 1.Seven-Category Agreement With Subspecialists’ Majority Opinion of 9 Pathologists and Hepatocellular-NodularArtificial Intelligence Model Based on Patches and Whole-Slide Images of 30 Liver Biopsy Specimens

表1. 基于30个肝活检标本的补丁和整片图像的九名病理学家和肝细胞结节性人工智能模型与亚专家多数意见的七类别一致性

图片

Table 2.Lesion Characteristics of Patients With Indefinite Diagnoses after 3 Independent Reviews

表2. 经过三次独立审查后,诊断不确定的患者的病变特征

这篇关于文献速递:深度学习肝脏肿瘤诊断---基于深度学习的肝细胞结节性病变在整片组织病理图像上的分类的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!



http://www.chinasem.cn/article/908812

相关文章

HarmonyOS学习(七)——UI(五)常用布局总结

自适应布局 1.1、线性布局(LinearLayout) 通过线性容器Row和Column实现线性布局。Column容器内的子组件按照垂直方向排列,Row组件中的子组件按照水平方向排列。 属性说明space通过space参数设置主轴上子组件的间距,达到各子组件在排列上的等间距效果alignItems设置子组件在交叉轴上的对齐方式,且在各类尺寸屏幕上表现一致,其中交叉轴为垂直时,取值为Vert

Ilya-AI分享的他在OpenAI学习到的15个提示工程技巧

Ilya(不是本人,claude AI)在社交媒体上分享了他在OpenAI学习到的15个Prompt撰写技巧。 以下是详细的内容: 提示精确化:在编写提示时,力求表达清晰准确。清楚地阐述任务需求和概念定义至关重要。例:不用"分析文本",而用"判断这段话的情感倾向:积极、消极还是中性"。 快速迭代:善于快速连续调整提示。熟练的提示工程师能够灵活地进行多轮优化。例:从"总结文章"到"用

基于人工智能的图像分类系统

目录 引言项目背景环境准备 硬件要求软件安装与配置系统设计 系统架构关键技术代码示例 数据预处理模型训练模型预测应用场景结论 1. 引言 图像分类是计算机视觉中的一个重要任务,目标是自动识别图像中的对象类别。通过卷积神经网络(CNN)等深度学习技术,我们可以构建高效的图像分类系统,广泛应用于自动驾驶、医疗影像诊断、监控分析等领域。本文将介绍如何构建一个基于人工智能的图像分类系统,包括环境

【前端学习】AntV G6-08 深入图形与图形分组、自定义节点、节点动画(下)

【课程链接】 AntV G6:深入图形与图形分组、自定义节点、节点动画(下)_哔哩哔哩_bilibili 本章十吾老师讲解了一个复杂的自定义节点中,应该怎样去计算和绘制图形,如何给一个图形制作不间断的动画,以及在鼠标事件之后产生动画。(有点难,需要好好理解) <!DOCTYPE html><html><head><meta charset="UTF-8"><title>06

学习hash总结

2014/1/29/   最近刚开始学hash,名字很陌生,但是hash的思想却很熟悉,以前早就做过此类的题,但是不知道这就是hash思想而已,说白了hash就是一个映射,往往灵活利用数组的下标来实现算法,hash的作用:1、判重;2、统计次数;

认识、理解、分类——acm之搜索

普通搜索方法有两种:1、广度优先搜索;2、深度优先搜索; 更多搜索方法: 3、双向广度优先搜索; 4、启发式搜索(包括A*算法等); 搜索通常会用到的知识点:状态压缩(位压缩,利用hash思想压缩)。

零基础学习Redis(10) -- zset类型命令使用

zset是有序集合,内部除了存储元素外,还会存储一个score,存储在zset中的元素会按照score的大小升序排列,不同元素的score可以重复,score相同的元素会按照元素的字典序排列。 1. zset常用命令 1.1 zadd  zadd key [NX | XX] [GT | LT]   [CH] [INCR] score member [score member ...]

【机器学习】高斯过程的基本概念和应用领域以及在python中的实例

引言 高斯过程(Gaussian Process,简称GP)是一种概率模型,用于描述一组随机变量的联合概率分布,其中任何一个有限维度的子集都具有高斯分布 文章目录 引言一、高斯过程1.1 基本定义1.1.1 随机过程1.1.2 高斯分布 1.2 高斯过程的特性1.2.1 联合高斯性1.2.2 均值函数1.2.3 协方差函数(或核函数) 1.3 核函数1.4 高斯过程回归(Gauss

【学习笔记】 陈强-机器学习-Python-Ch15 人工神经网络(1)sklearn

系列文章目录 监督学习:参数方法 【学习笔记】 陈强-机器学习-Python-Ch4 线性回归 【学习笔记】 陈强-机器学习-Python-Ch5 逻辑回归 【课后题练习】 陈强-机器学习-Python-Ch5 逻辑回归(SAheart.csv) 【学习笔记】 陈强-机器学习-Python-Ch6 多项逻辑回归 【学习笔记 及 课后题练习】 陈强-机器学习-Python-Ch7 判别分析 【学

系统架构师考试学习笔记第三篇——架构设计高级知识(20)通信系统架构设计理论与实践

本章知识考点:         第20课时主要学习通信系统架构设计的理论和工作中的实践。根据新版考试大纲,本课时知识点会涉及案例分析题(25分),而在历年考试中,案例题对该部分内容的考查并不多,虽在综合知识选择题目中经常考查,但分值也不高。本课时内容侧重于对知识点的记忆和理解,按照以往的出题规律,通信系统架构设计基础知识点多来源于教材内的基础网络设备、网络架构和教材外最新时事热点技术。本课时知识