发布日期: 2025-11-07

更新日期: 2025-11-27

文章字数: 2.3k

阅读时长: 9 分

阅读次数:

⚠️ 以下所有内容总结都来自于大语言模型的能力，如有错误，仅供参考，谨慎使用
🔴 请注意：千万不要用于严肃的学术场景，只能用于论文阅读前的初筛！
💗 如果您觉得我们的项目对您有帮助 ChatPaperFree ，还请您给我们一些鼓励！⭐️ HuggingFace免费体验

2025-11-07 更新

PLUTO-4: Frontier Pathology Foundation Models

Authors:Harshith Padigela, Shima Nofallah, Atchuth Naveen Chilaparasetti, Ryun Han, Andrew Walker, Judy Shen, Chintan Shah, Blake Martin, Aashish Sood, Elliot Miller, Ben Glass, Andy Beck, Harsha Pokkalla, Syed Ashar Javed

Foundation models trained on large-scale pathology image corpora have demonstrated strong transfer capabilities across diverse histopathology tasks. Building on this progress, we introduce PLUTO-4, our next generation of pathology foundation models that extend the Pathology-Universal Transformer (PLUTO) to frontier scale. We share two complementary Vision Transformer architectures in the PLUTO-4 family: a compact and efficient PLUTO-4S model optimized for multi-scale deployment using a FlexiViT setup with 2D-RoPE embeddings, and a frontier-scale PLUTO-4G model trained with a single patch size to maximize representation capacity and stability. Both models are pretrained using a self-supervised objective derived from DINOv2 on a large multi-institutional corpus containing 551,164 WSIs from 137,144 patients across over 50 institutions, spanning over 60 disease types and over 100 stains. Comprehensive evaluation across public and internal benchmarks demonstrates that PLUTO-4 achieves state-of-the-art performance on tasks requiring varying spatial and biological context, including patch-level classification, segmentation, and slide-level diagnosis. The compact PLUTO-4S provides high-throughput and robust performance for practical deployment, while PLUTO-4G establishes new performance frontiers across multiple pathology benchmarks, including an 11% improvement in dermatopathology diagnosis. These diverse improvements underscore PLUTO-4’s potential to transform real-world applications as a backbone for translational research and diagnostic use cases.

基于大规模病理学图像语料库训练的基石模型已在各种病理组织学任务中展现出强大的迁移能力。在此基础上，我们推出了PLUTO-4，这是我们下一代病理基石模型，它将病理通用转换器（PLUTO）扩展到前沿规模。我们分享了PLUTO-4系列中两种互补的愿景转换器架构：一个紧凑高效的PLUTO-4S模型，采用FlexiViT设置和2D-RoPE嵌入，优化多尺度部署；以及一个前沿规模的PLUTO-4G模型，使用单一补丁尺寸进行训练，以最大化表示能力和稳定性。这两种模型都使用来自DINOv2的基于自监督目标的预训练，在包含来自超过50个机构、涉及超过60种疾病类型和超过100种染色的大型多机构语料库上进行了训练，这些语料库包含来自超过5万名患者的超过5万张全景扫描图像（WSI）。在公共基准测试集和内部基准测试集上的全面评估表明，PLUTO-4在需要不同空间和生物学背景的任务上达到了最先进的性能，包括补丁级分类、分割和幻灯片级诊断。紧凑型的PLUTO-4S为实际部署提供了高通量和稳健的性能，而PLUTO-4G在多个病理学基准测试中建立了新的性能边界，包括皮肤病理学诊断提高了11%。这些多样化的改进突显了PLUTO-4作为翻译研究和诊断用例的骨干在现实世界应用中的潜力。

论文及项目相关链接

PDF

Summary

PLUTO-4是下一代病理学基础模型，扩展了病理学通用变压器（PLUTO），采用两种互补的Vision Transformer架构，适用于多种病理学任务。PLUTO-4在大型多机构数据集上进行预训练，并在公共和内部基准测试中表现出卓越性能，实现了包括斑块分类、分割和幻灯片级别诊断等任务的最新技术成果。其中，紧凑型的PLUTO-4S模型具有高吞吐量和稳健性能，适合实际部署，而前沿的PLUTO-4G模型则在多个病理学基准测试中建立了新的性能标准。

Key Takeaways

PLUTO-4是新一代病理学基础模型，在多种病理任务中表现出强大的迁移能力。
PLUTO-4家族包含两种互补的Vision Transformer架构：PLUTO-4S和PLUTO-4G。
PLUTO-4S是一个紧凑且高效的模型，适用于多尺度部署，具有高性能和鲁棒性。
PLUTO-4G是一个前沿规模的模型，通过单一补丁尺寸进行训练，以最大化表示能力和稳定性。
PLUTO-4在大规模多机构数据集上进行预训练，涵盖超过60种疾病类型和超过100种染色。
综合评估表明，PLUTO-4在包括斑块分类、分割和幻灯片级别诊断等任务上实现了最佳性能。

Cool Papers

点此查看论文截图

UniFault: A Fault Diagnosis Foundation Model from Bearing Data

Authors:Emadeldeen Eldele, Mohamed Ragab, Xu Qing, Edward, Zhenghua Chen, Min Wu, Xiaoli Li, Jay Lee

Machine fault diagnosis (FD) is a critical task for predictive maintenance, enabling early fault detection and preventing unexpected failures. Despite its importance, existing FD models are operation-specific with limited generalization across diverse datasets. Foundation models (FM) have demonstrated remarkable potential in both visual and language domains, achieving impressive generalization capabilities even with minimal data through few-shot or zero-shot learning. However, translating these advances to FD presents unique hurdles. Unlike the large-scale, cohesive datasets available for images and text, FD datasets are typically smaller and more heterogeneous, with significant variations in sampling frequencies and the number of channels across different systems and applications. This heterogeneity complicates the design of a universal architecture capable of effectively processing such diverse data while maintaining robust feature extraction and learning capabilities. In this paper, we introduce UniFault, a foundation model for fault diagnosis that systematically addresses these issues. Specifically, the model incorporates a comprehensive data harmonization pipeline featuring two key innovations. First, a unification scheme transforms multivariate inputs into standardized univariate sequences. Second, a novel cross-domain temporal fusion strategy mitigates distribution shifts and enriches sample diversity and count, improving the model generalization across varying conditions. UniFault is pretrained on over 6.9 million samples spanning diverse FD datasets, enabling superior few-shot performance. Extensive experiments on real-world FD datasets demonstrate that UniFault achieves state-of-the-art performance, setting a new benchmark for fault diagnosis models and paving the way for more scalable and robust predictive maintenance solutions.

机器故障诊断（FD）是预测性维护中的一项关键任务，能够实现早期故障检测并防止意外故障。尽管其重要性很高，但现有的FD模型都是针对特定操作的，在多种数据集上的泛化能力有限。基础模型（FM）在视觉和语言领域都表现出了显著的优势，即使在少量数据的情况下，也能通过小样本或零样本学习实现令人印象深刻的泛化能力。然而，将这些进展应用于FD却存在独特的障碍。与可用于图像和文本的大规模连贯数据集相比，FD数据集通常较小且更异质，不同系统和应用程序的采样频率和数据通道数量存在重大差异。这种异质性使得设计一个能够处理这种多样化数据并保持稳健的特征提取和学习能力的通用架构变得复杂。在本文中，我们引入了UniFault，这是一个用于故障诊断的基础模型，系统地解决了这些问题。具体而言，该模型结合了一个全面的数据协调管道，具有两个关键的创新点。首先，一个统一方案将多元输入转换为标准化的单变量序列。其次，一种新的跨域时间融合策略缓解了分布偏移问题并丰富了样本的多样性和数量，提高了模型在不同条件下的泛化能力。UniFault在涵盖多种FD数据集超过690万个样本上进行预训练，实现了优越的小样本性能。在真实世界的FD数据集上进行的广泛实验表明，UniFault达到了最新性能水平，为故障诊断模型设定了新的基准，并为更可扩展和稳健的预测性维护解决方案铺平了道路。

论文及项目相关链接

PDF

Summary：
机器故障检测是预测维护中的关键任务，现有的故障检测模型操作特定性较强，泛化能力有限。本文提出了UniFault这一通用故障检测模型，通过数据调和管道解决这些问题。模型将多元输入转化为标准化的一元序列，并采用跨域时间融合策略，提高模型在不同条件下的泛化能力。在真实世界的故障检测数据集上进行的大量实验表明，UniFault达到了最先进的性能，为故障检测模型树立了新标杆，为更可扩展和稳健的预测维护解决方案铺平了道路。

Key Takeaways：