MinJu Jeon

Hanyang University ยท Multimodal AI Lab ยท AI Researcher

me/me.jpg

At ICCV 2025, Honolulu, Hawaii

Hi! I'm MinJu Jeon, a Master's student in Data Science at Hanyang University, advised by Prof. DongJin Kim, and currently a Research Intern at Naver Cloud, Voice Tech team.
My research centers on ๐Ÿง  Multimodal learning, spanning ๐ŸŽฌ Vision-language understanding (dense video captioning, text-video retrieval) and ๐Ÿ—ฃ๏ธ Multilingual speech (G2P, text-to-speech). I'm also drawn to โš™๏ธ Data-centric methods that improve model robustness across modalities and languages.

News

June 2026Joining LG AI Research as a Research Intern at the EXAONE Lab (Incoming)
Mar 2026Cap4Bridge accepted at IEEE Access 2026
Feb 2026Two papers accepted at CVPR 2026
Dec 2025Started research internship at Naver Cloud, Voice Tech Team
Aug 2025Sali4Vid accepted at EMNLP 2025 (Long, Main)

Background

June 2026 โ€“ Incoming
Research Intern, LG AI Research ยท EXAONE Lab
Incoming
Dec 2025 โ€“ Present
Research Intern, Naver Cloud ยท Voice Tech Team
Multilingual G2P & robust TTS for non-canonical text
Sep 2024 โ€“ Present
M.S. in Data Science, Hanyang University
Mar 2020 โ€“ Aug 2024
B.S. in Industrial Engineering, Hanyang University

selected publications

  1. EMNLP
    sali4vid.jpg
    Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning
    MinJu Jeon, Si-Woo Kim, Ye-Chan Kim, HyunGee Kim, and Dong-Jin Kim
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
    Dense Video Captioning Data-Centric
  2. IEEE Access
    Cap4Bridge: Caption-Guided Cross-Modal Contextualization with Stochastic Augmentation for Text-Video Retrieval
    MinJu Jeon, Hyungee Kim, Si-Woo Kim, Youngtaek Oh, Soeun Lee, and Dong-Jin Kim
    IEEE Access, 2026
    Text-Video Retrieval Data-Centric
  3. ACM MM
    sync.jpg
    SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image Captioning
    Si-Woo Kim, MinJu Jeon, Ye-Chan Kim, Soeun Lee, Taewhan Kim, and Dong-Jin Kim
    In Proceedings of the 33rd ACM International Conference on Multimedia, 2025
    Zero-shot Captioning Data-Centric
  4. CVPR
    starc.jpg
    Follow the Saliency: Supervised Saliency for Retrieval-augmented Dense Video Captioning
    Seunghee Choi, MinJu Jeon, Hyunwoo Oh, Jihwan Lee, and Dong-Jin Kim
    arXiv preprint arXiv:2603.11460, 2026
    Accepted at CVPR 2026
    Dense Video Captioning
  5. CVPR
    sail.jpg
    SAIL: Similarity-Aware Guidance and Inter-Caption Augmentation-based Learning for Weakly-Supervised Dense Video Captioning
    Ye-Chan Kim, SeungJu Cha, Si-Woo Kim, MinJu Jeon, Hyungee Kim, and Dong-Jin Kim
    arXiv preprint arXiv:2603.05437, 2026
    Accepted at CVPR 2026
    Dense Video Captioning