本文不保证时效性覆盖性
ICLR
MICCAI
-
[link] [code] [M3AE] [22]
Multi-modal Masked Autoencoders for Medical Vision-and-Language Pre-training -
[link] [code] [LMI] [21]
Multimodal Representation Learning via Maximization of Local Mutual Information
ICCV
- [link] [code] [GLoRIA] [21]
GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image Recognition
ECCV
- [link] [code] [BioVIL] [22]
Making the Most of Text Semantics to Improve Biomedical Vision–Language Processing
NeurIPS
- [link] [code] [MGCA] [22]
Multi-Granularity Cross-modal Alignment for Generalized Medical Visual Representation Learning
MLHC
- [link] [code] [ConVIRT] [22]
Contrastive Learning of Medical Visual Representations from Paired Images and Text
Nature Machine Intelligence
- [link] [code] [REFERS] [22]
Generalized radiograph representation learning via cross-supervision between images and free-text radiology reports
medRxiv
- [link] [code] [MedKLIP] [23]
MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training