Transformer-based multimodal feature enhancement networks for multimodal depression detection integrating video, audio and remote photoplethysmograph signals 范慧婷, ZhangXingnan, 徐盈盈, FangJiangxiong, 张石清, 赵小明, YuJun 4月 1, 2024 DOI