I am a final-year Ph.D. student at Renmin University of China, advised by Prof. Qin Jin. My research focuses on Vision and Language, involving cross-modal generation, video understanding, and automatic evaluation. Currently, I am exploring book-level story evaluation and generation.
π₯ News
- 2024.09: Our survey paper What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation was released. Visit our GitHub page for a quick review.
- 2024.09: Completed my research internship at Alibaba Group. Itβs my pleasure to work with my mentor, Tiezheng Ge.
- 2024.08: Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline was accepted by ACL 2024.
- 2023.10: Visual captioning at will: Describing images and videos guided by a few stylized sentences was accepted by ACM MM 2023.
- 2023.07: Attractive storyteller: Stylized visual storytelling with unpaired text was accepted by ACL 2023.
π Publications

Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline
Dingyi Yang, Chunru Zhan, Ziheng Wang, Biao Wang, Tiezheng Ge, Bo Zheng, Qin Jin
- We explore the task of Synchronized Video Storytelling, generating text to narrate the ongoing video scenes and serve as useful voiceovers.
- We release a new benchmark in the advertising domain called E-SyncVidStory.
- We propose an effective framework named VideoNarrator, which simultaneously supports storyline generation and controllable video story generation.

What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation
Dingyi Yang, Qin Jin
- We provide an in-depth discussion of the challenges in evaluating various story generation tasks.
- We analyze standardized criteria, addressing issues of inconsistent definitions and vague expressions.
- We systematically review traditional, LLM-based, and collaborative evaluation methods, highlighting their strengths and limitations in the context of story evaluation.
- We suggest potential future research directions, extending from story evaluation to general evaluations.

Visual captioning at will: Describing images and videos guided by a few stylized sentences
Dingyi Yang, Hongyu Chen, Xinglin Hou, Tiezheng Ge, Yuning Jiang, Qin Jin
- We explore Few-Shot Stylized Visual Captioning, which aims to generate captions in any desired style, using only a few examples as guidance during inference, without requiring further training.
- We propose an effective framework that handles multiple styles with a single model. It extracts the style representation from stylized samples, and aligns visual information to generate stylized captions.

Attractive storyteller: Stylized visual storytelling with unpaired text
Dingyi Yang, Qin Jin
- We explore Stylized Visual Storytelling, generating stories with complicated styles for image sequences.
- We propose StyleVSG, which applies style-specific parameters to control text style and incorporates a memory module to maintain context coherence.
- To balance style accuracy and visual relevance, we employ multi-task training using our pseudo {images-stylized story} pairs and {images-factual story} pairs.
- An experimental study of text representation methods for cross-site purchase preference prediction using the social text data. Ting Bai, Hong-Jian Dou, Wayne Xin Zhao, Dingyi Yang, Ji-Rong Wen. Journal of Computer Science and Technology, 32(4): 828-842, 2017.
π§ Projects
- Map Visualization Toolkit: A comprehensive toolkit for map data processing, visualization, and interaction. Role: Creator and sole developer.
- COVID-19 Visualization and Visual Analytics: A series of data visualization projects, helping people to understand the epidemic. Role: Main developer, design and set up the map visualization websites.
π Honors and Awards
- 2024.09 National Scholarship (Top 0.2% in China)
- 2022.09 and 2023.09 The First Prize Scholarship.
- 2019.09 Academic Excellence Scholarship.
- 2016.09 The Second Prize Sa-Shixuan Elite Fund Scholarship.
- 2016.09 The First Prize Scholarship.
π Educations
- 2021.09 - 2024.10 (now), Renmin University of China. Supervisor: Prof. Qin Jin.
- 2018.09 - 2021.06, Peking University. Supervisor: Prof. Xiaoru Yuan.
- 2014.09 - 2018.06, Renmin University of China. Advisor: Prof. Yongcai Wang and Prof. Xin Zhao.
π¬ Invited Talks
- 2024.09.28, βAI+Xβ National Forum for Outstanding Doctoral Students.
π» Experiences
- 2022 - 2024, Research Intern, Alibaba Group, Beijing, China.
- 2019 - 2021, Winter Olympic Visualization Project, Beijing Municipal Commission of Transport, Beijing, China.
- 2020, COVID-19 Visualization WeChat Mini Program, Tencent HealthCare, Beijing, China.