[1] Shiping Ge, et al. Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning. AAAI 2025(CCF-A,Oral)
[2] Shiping Ge, et al. Short Video Ordering via Position Decoding and Successor Prediction. SIGIR 2024(CCF-A)
[3] Shiping Ge, et al. Learning Event-Specific Localization Preferences for Audio-Visual Event Localization. ACM MM 2023(CCF-A)
[4] Shiping Ge, et al. Learning Robust Multi-Modal Representation for Multi-Label Emotion Recognition via Adversarial Masking and Perturbation. WWW 2023(CCF-A)
[5] Shiping Ge, et al. Fine-Grained Alignment Network for Zero-Shot Cross-Modal Retrieval. TOMM 2025(CCF-B)
[6] Shiping Ge, et al. Dual-input attention network for automatic identification of detritus from river sands. Computers & Geosciences (SCI Q2)