PseudoCell: Hard Negative Mining as Pseudo Labeling for Deep Learning-Based Centroblast Cell Detection

IEEE Open J Eng Med Biol. 2024 May 30:5:514-523. doi: 10.1109/OJEMB.2024.3407351. eCollection 2024.

Abstract

Background: Deep learning models for patch classification in whole-slide images (WSIs) have shown promise in assisting follicular lymphoma grading. However, these models often require pathologists to identify centroblasts and manually provide refined labels for model optimization. Objective: To address this limitation, we propose PseudoCell, an object detection framework for automated centroblast detection in WSI, eliminating the need for extensive pathologist's refined labels. Methods: PseudoCell leverages a combination of pathologist-provided centroblast labels and pseudo-negative labels generated from undersampled false-positive predictions based on cell morphology features. This approach reduces the reliance on time-consuming manual annotations. Results: Our framework significantly reduces the workload for pathologists by accurately identifying and narrowing down areas of interest containing centroblasts. Depending on the confidence threshold, PseudoCell can eliminate 58.18-99.35% of irrelevant tissue areas on WSI, streamlining the diagnostic process. Conclusion: This study presents PseudoCell as a practical and efficient prescreening method for centroblast detection, eliminating the need for refined labels from pathologists. The discussion section provides detailed guidance for implementing PseudoCell in clinical practice.

Keywords: Centroblast cell detection; deep convolutional neural network; follicular lymphoma; hard negative mining; morphological features.

Grants and funding

This work was supported in part by the New Discovery and Frontier Research under Grant (R016420005, Fund 3), Mahidol University and in part by the NSRF via the Program Management Unit for Human Resources & Institutional Development, Research and Innovation under Grant B38G670007.