Yawen Zeng


moc.liamg@11gnezneway :liamE

Biography

I am a Algorithm Engineer at ByteDance AI Lab. I obtained my M.S. from the College of Computer Science and Electronic Engineering at Hunan University in 2022. Prior to this, I gained valuable experience working with Tencent WeChat. My research interests are focused on Large Vision Language Models, General Agent, and Multi-Modal Applications. I have had the privilege of publishing my work in esteemed academic conferences such as CVPR, AAAI, SIGIR, ACM MM, EMNLP, TNNLS, and TMM.

News

more news

Selected Publications

(*: equal contribution, ☨: correspondence)
VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool
Yan Wang, Yawen Zeng☨, Jingsheng Zheng, Xiaofen Xing, Jin Xu, Xiangmin Xu
ACL, 2024 (workshop)
HindRec: Aligning User Preferences for Recommendation via Hindsight Fine-tuning
Yawen Zeng*, Huanwen Wang*, Lingyu Chen, Wenshu Chen, Ran Chen, Hao Chen
KDD, 2024 (workshop)
RetrievalMMT: Retrieval-Constrained Multi-Modal Prompt Learning for Multi-Modal Machine Translation
Yan Wang*, Yawen Zeng*, Junjie Liang, Xiaofen Xing, Jin Xu, Xiangmin Xu
ICMR, 2024
FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing Model
Xiangyu Li, Xinjie Shen, Yawen Zeng, Xiaofen Xing, Jin Xu
WWW, 2024
Energy-based Automated Model Evaluation
Ru Peng, Heming Zou, Haobo Wang, Yawen Zeng, Zenan Huang, Junbo Zhao
ICLR, 2024
Multi-Prompts Learning with Cross-Modal Alignment for Attribute-based Person Re-Identification
Yajing Zhai*, Yawen Zeng*, Zhiyong Huang, Zheng Qin, Xin Jin, Da Cao
AAAI, 2024
Temporally Language Grounding with Multi-modal Multi-Prompt Tuning
Yawen Zeng, Ning Han, Keyu Pan, Qin Jin
TMM, 2023
Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models
Keyu Pan, Yawen Zeng
arXiv preprint, 2023
RewardTLG: Learning to Temporally Language Grounding from Flexible Reward
Yawen Zeng, Keyu Pan, Ning Han
SIGIR, 2023
Multi-Modal Knowledge Hypergraph for Diverse Image Retrieval
Yawen Zeng, Qin Jin, Tengfei Bao, Wenfeng Li
AAAI, 2023
Contrastive Topic-enhanced Network for Video Captioning
Yawen Zeng, Yiru Wang, Dongliang Liao, Gongfu Li, Jin Xu, Bo Liu, Xiangmin Xu, Hong Man
ESWA, 2023
BiC-Net: Learning Efficient Spatio-Temporal Relation for Text-Video Retrieval
Ning Han, Yawen Zeng, Chuhao Shi, Guangyi Xiao, Hao Chen, Jingjing Chen
TOMM, 2023
Better Sign Language Translation with Monolingual Data
Ru Peng, Yawen Zeng, Junbo Zhao
arXiv preprint, 2023
Keyword-Based Diverse Image Retrieval with Variational Multiple Instance Graph
Yawen Zeng, Yiru Wang, Dongliang Liao, Gongfu Li, Weijie Huang, Jin Xu, Da Cao, Hong Man
TNNLS, 2022
Point Prompt Tuning for Temporally Language Grounding
Yawen Zeng
SIGIR, 2022
Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation
Ru Peng*, Yawen Zeng*, Junbo Zhao
EMNLP, 2022
HybridVocab: Towards Multi-Modal Machine Translation via Multi-Aspect Alignment
Ru Peng*, Yawen Zeng*, Junbo Zhao
ICMR, 2022
TriReID: Towards Multi-Modal Person Re-Identification via Descriptive Fusion Model
Yajing Zhai*, Yawen Zeng*, Da Cao, Shaofei Lu
ICMR, 2022
Fine-grained cross-modal alignment network for text-video retrieval
Ning Han, Jingjing Chen, Guangyi Xiao, Hao Zhang, Yawen Zeng, Hao Chen
ACM MM, 2021
Moment is Important: Language-Based Video Moment Retrieval via Adversarial Learning
Yawen Zeng, Da Cao, Hanling Zhang, Jiao Xu, Zheng Qin
TOMM, 2021
Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval
Yawen Zeng, Da Cao, Xiaochi Wei, Meng Liu, Zhou Zhao, Zheng Qin
CVPR, 2021
Adversarial Video Moment Retrieval by Jointly Modeling Ranking and Localization
Da Cao, Yawen Zeng, Xiaochi Wei, Liqiang Nie, Richang Hong, Zheng Qin
ACM MM, 2020
STRONG: Spatio-Temporal Reinforcement Learning for Cross-Modal Video Moment Localizationn
Da Cao, Yawen Zeng, Meng Liu, Xiangnan He, Meng Wang, Zheng Qin
ACM MM, 2020

Research Experience

2022.07 - till now AI Lab, ByteDance
Algorithm Engineer
Mentored by Hang Li, Tengfei Bao, Feng Zhang
  • Multi-Modal Knowledge Graph and Large Vision Language Models.
  • 2020.10 - 2022.06 WeChat, Tencent
    Research Intern
    Mentored by Jin Xu
  • Multi-Modal Retrieval and Visual Question Answering.
  • 2018.11 - 2022.06 College of Computer Science and Electronic Engineering, Hunan University
    Master Student
    Supervised by Da Cao
  • Temporally Language Grounding and Cross-Modal Retrieval.
  • Academic Service

    Program Committee Member for EMNLP.
    Conference Reviewer for CVPR, ICCV, ECCV, ACL, EMNLP, AAAI, ACM MM.

    Last updated on Jun 2024.