Paper
19 July 2024 A multimodal feature fusion model for depression diagnosis based on BiLSTM+ViT
Muyao Li, Yan Xing, Yitan He, Yuhe Wang, Zhouyang Yu, Xiaoyu Wang, Pingchuan Linghu
Author Affiliations +
Proceedings Volume 13213, International Conference on Image Processing and Artificial Intelligence (ICIPAl 2024); 132133O (2024) https://doi.org/10.1117/12.3035205
Event: International Conference on Image Processing and Artificial Intelligence (ICIPAl2024), 2024, Suzhou, China
Abstract
A thorough diagnosis of depression, a mental illness that affects many people worldwide, must be made in light of the patient's medical background, present symptoms, and pertinent examination findings. In recent years, scholars have increasingly favored machine learning algorithms for depression diagnosis or prediction models. However, achieving higher prediction accuracy in prediction models remains challenging when relying solely on one modality. This research suggests a multi-modal feature fusion model for predicting depression tendency that is based on bi-directional LSTM and vision transformer (ViT). The model demonstrates an accuracy of 70.00%, surpassing that of single-modal depression tendency prediction models and presenting a novel approach to depression tendency prediction.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Muyao Li, Yan Xing, Yitan He, Yuhe Wang, Zhouyang Yu, Xiaoyu Wang, and Pingchuan Linghu "A multimodal feature fusion model for depression diagnosis based on BiLSTM+ViT", Proc. SPIE 13213, International Conference on Image Processing and Artificial Intelligence (ICIPAl 2024), 132133O (19 July 2024); https://doi.org/10.1117/12.3035205
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature fusion

Visual process modeling

Feature extraction

Data modeling

Transformers

Image processing

Visualization

Back to Top