Context-Based Masking for Spontaneous Venous Pulsations Detection
- Publisher:
- SPRINGER-VERLAG SINGAPORE PTE LTD
- Publication Type:
- Chapter
- Citation:
- AI 2023: Advances in Artificial Intelligence, 2024, 14471 LNAI, pp. 520-532
- Issue Date:
- 2024-01-01
Closed Access
Filename | Description | Size | |||
---|---|---|---|---|---|
venous.pdf | Published version | 3.94 MB |
Copyright Clearance Process
- Recently Added
- In Progress
- Closed Access
This item is closed access and not available.
Spontaneous retinal venous pulsations (SVP) serve as vital dynamic biomarkers, representing rhythmic changes of the central retinal vein observed at the optic disc region (ODR) within an eye. SVPs serve as vital dynamic biomarkers, representing rhythmic changes of the central retinal vein observed at the optic disc region (ODR) within an eye. In light of their crucial clinical role, automatic detection of SVPs from fundus videos has become an area of burgeoning research. However, the inherent eye movements and the variability in retinal video quality present significant challenges to direct SVP detection via existing deep learning models. In response, we devise a spatio-temporal context-based masking approach (STC Masking), exploiting the spatiotemporal characteristics of SVPs to enhance their detection in retinal videos. We first apply a spatio-temporal mask to clip the video into an ODR-focused video tube. Diverging from conventional masking with gray or black blocks, we then employ a context masking method which using the original pixel values from video frames as the mask fill-in. The context mask map temporally transforms the dynamic video tubes into static tubes, thus changing the pulsation status of SVPs. Correspondingly, we adjust the SVP video labels based on the changing extent of masked regions to avoid ambiguity in data labelling. This innovative strategy provides more vivid videos which are similar to unmasked videos pixel-wise but having contrast semantics in SVP presenting regions. This enables network to capture the most discriminating regions through spatio-temporal variations, allowing explicit detection on SVP existence in the video. Our experiments illustrate the efficacy of our STC masking strategy, outperforming baseline methods. This work, thereby, underscores the potential of grid context-based masking for more accurate SVP detection in retinal video analysis.
Please use this identifier to cite or link to this item: