Poster + Paper
18 June 2024 Data augmentation via video frame interpolation: an application to cardiac ultrasound videos
Lucas Cedric Cervantes-Beutelspacher, Boris Escalante-Ramírez, Jimena Olveres
Author Affiliations +
Conference Poster
Abstract
In Machine Learning projects effective preparation of training datasets is essential. When dealing with image datasets, especially limited ones, data augmentation techniques play a crucial role in increasing the dataset size and diversity. These techniques, spanning from basic to deformable to deep learning augmentations, offer varying effects from simple noise addition to generating entirely new synthetic images.

In this study, we propose an alternative approach to augmenting a dataset utilizing a technique found in video processing called Video Frame Interpolation (VFI). Unlike traditional methods, with VFI we aim to produce images that are neither mere variations of the original images nor entirely synthetic ones, instead providing a middle ground where the images generated are synthetic temporal variations of the original ones. We propose to use pre-trained VFI networks in conjunction with Transfer Learning to develop specialized models capable of interpolating medical images with enough precision so that a medical specialist would deem them clinically plausible.

For this study, we worked with a model developed by Niklaus et al., on cardiac ultrasound videos and images alongside a seasoned cardiologist to provide an expert evaluation on the viability of this technique. Our findings indicate that the results produced by our fine-tuned model can indeed be considered realistic, and depending on the use case, the results of the pre-trained model can also be useful.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Lucas Cedric Cervantes-Beutelspacher, Boris Escalante-Ramírez, and Jimena Olveres "Data augmentation via video frame interpolation: an application to cardiac ultrasound videos", Proc. SPIE 12998, Optics, Photonics, and Digital Technologies for Imaging Applications VIII, 129981H (18 June 2024); https://doi.org/10.1117/12.3017618
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Interpolation

Machine learning

Echocardiography

Image quality

Data modeling

Convolution

Back to Top