publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
- IEEE AccessCap4Bridge: Caption-Guided Cross-Modal Contextualization with Stochastic Augmentation for Text-Video RetrievalIEEE Access, 2026
- CVPR
Follow the Saliency: Supervised Saliency for Retrieval-augmented Dense Video CaptioningarXiv preprint arXiv:2603.11460, 2026Accepted at CVPR 2026 - CVPR
SAIL: Similarity-Aware Guidance and Inter-Caption Augmentation-based Learning for Weakly-Supervised Dense Video CaptioningarXiv preprint arXiv:2603.05437, 2026Accepted at CVPR 2026
2025
- EMNLP
Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video CaptioningIn Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025 - ACM MM
SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image CaptioningIn Proceedings of the 33rd ACM International Conference on Multimedia, 2025