I am currently an Associate Professor at the School of Computer Science and Technology, Hangzhou Dianzi University, and a member of the Brain–Computer Collaborative Intelligence Laboratory.

I received my Ph.D. degree in Information and Communication Engineering from the College of Information Science and Electronic Engineering, Zhejiang University in 2025. During my doctoral studies, I worked in the Machine Vision and Navigation Laboratory (机器视觉与导航实验室) led by Prof. Zhiyu Xiang (项志宇), under the supervision of Assoc. Prof. Xiaojin Gong (龚小谨).

My current research interests include computer vision, anomaly detection, brain–computer interfaces, large language models, and the applications of artificial intelligence in healthcare.

*Opening!*I am currently recruiting Master’s students (2026Fall). Whether your strengths lie in software or hardware, if you are passionate about tackling challenging problems and aspire to make a difference in the world, feel free to email me at sunshy@hdu.edu.cn without hesitation!

🔥 News

  • 2025.02: 🎉 Our paper about Delving into Instance Modeling for Weakly Supervised Video Anomaly Detection has been accepted to IEEE Transactions on Circuits and Systems for Video Technology (IF=8.3).
  • 2024.07: 🎉 Our paper about Text-Driven Scene-Decoupled (TDSD) Weakly Supervised Video Anomaly Detection has been accepted to ACM Multimedia (MM) 2024.
  • 2024.07: 🎉 Our paper about Event-Driven Weakly Supervised Video Anomaly Detection has been accepted to Image and Vision Computing.
  • 2024.05: We release the code of Multi-scale Bottleneck Transformer (MSBT), we encourage you to integrate the MSBT module into your framework to enhance the performance of feature fusion.
  • 2024.03: 🎉 Our paper about MultiScale Bottleneck Transformer (MSBT) has been accepted to IEEE International Conference on Multimedia and Expo (ICME) 2024.
  • 2023.03: 🎉 Our paper about Long-Short Temporal Co-Teaching (LSTC) for Weakly Supervised Video Anomaly Detection has been accepted to IEEE International Conference on Multimedia and Expo (ICME) 2023.
  • 2023.02: 🎉 Our paper about Hierarchical Semantic Contrast (HSC) for Scene-Aware Video Anomaly Detection has been accepted to IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023.

📝 Publications

🏹 Selected Projects

TMM 2025
sym

Enhancing Weakly Supervised Multimodal Video Anomaly Detection through Text Guidance
Shengyang Sun, Jiashen Hua, Junyi Feng, Xiaojin Gong

Project

  • TGMVAD is the first work to employ in-context learning (ICL) to the task of weakly-supervised multimodal video anomaly detection for the purpose of augmenting text samples.
TCSVT 2025
sym

Delving Into Instance Modeling for Weakly Supervised Video Anomaly Detection
Shengyang Sun, Jiashen Hua, Junyi Feng, Dongxu Wei, Baisheng Lai, Xiaojin Gong

  • This is the first work to our knowledge that deliberately explores the issue of anomaly contamination and dilution along the temporal dimension, which is overlooked by prior MIL-based weakly-supervised video anomaly detection works.
CVPR 2023
sym

Hierarchical semantic contrast for scene-aware video anomaly detection
Shengyang Sun, Xiaojin Gong

Code

  • We build a scene-aware reconstruction framework composed of scene-aware feature encoders and objectcentric feature decoders for anomaly detection.
  • We propose hierarchical semantic contrastive learning to regularize the encoded features in the latent spaces, making normal features more compact within the same semantic classes and separable between different classes.
ACM MM 2024
sym

TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection
Shengyang Sun, Jiashen Hua, Junyi Feng, Dongxu Wei, Baisheng Lai, Xiaojin Gong

  • This is the first work to address scene-dependent video anomaly detection under a weakly supervised setting.
ICME 2024
sym

Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection
Shengyang Sun, Xiaojin Gong

Code

  • We propose a multi-scale bottleneck transformer (MSBT)-based fusion module. It leverages a reduced number of bottleneck tokens to transmit gradually condensed information from one modality to another and a bottleneck token-based weighting scheme to weight the fused features, effectively addressing the information redundancy and modality imbalance problems.
ICME 2023
sym

Long-Short Temporal Co-Teaching for Weakly Supervised Video Anomaly Detection
Shengyang Sun, Xiaojin Gong

Code

  • We employ a co-teaching strategy to train short- and long-term networks alternatively and iteratively. The two networks can explicitly learn from abnormal events with varying durations.

🌟 Latest Works

📖 Educations

  • 2021.09 - 2025.03, Zhejiang University, Ph.D., Information and Communication Engineering
  • 2017.09 - 2021.06, National University of Defense Technology, M.E., Electronic Engineering
  • 2013.09 - 2017.06, National University of Defense Technology, B.S., Information Engineering

💬 Invited Talks

  • 2025.03, Lectures of Academic Climbing Program,Hangzhou Dianzi University, Hangzhou, China
  • 2021.12, The 91st Academic Forum of the College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, China