Xiaodan Zhang  张晓丹

Hits

Associate Professor, Master's Supervisor
College of Computer Science, Beijing University of Technology
Office: Rm M812, Science Building
Email: zhangxiaodan@bjut.edu.cn

 Google Scholar   Github 

Biography

I am an Associate Professor in the College of Computer Science and Beijing Institute of Artificial Intelligence, Beijing University of Technology. I obtained my Ph.D. degree from the University of Chinese Academy of Sciences in 2018 under the supervision of Prof. Jianbin Jiao and Prof. Qixiang Ye. I also obtained a joint Ph.D. degree from the City University of Hong Kong in 2018 under the supervision of Prof. Qingxiong Yang and Prof. Rynson W.H. Lau. My research focuses on critical theoretical and applied technologies in Artificial Intelligence, particularly Computer Vision and Natural Language Processing. I have published numerous papers in esteemed international journals and conferences, including IEEE TIP, Pattern Recognition, AAAI, ACM MM, EMNLP, and COLING. I have also led and contributed to several National Natural Science Foundation projects and received funding for high-level overseas returnees from Beijing in 2022.

Research Interests: Multimodal Large Language Models, Medical Image Processing, and Natural Language Processing

News

  • [2025.01] One paper was accepted by IEEE TIP.
  • [2024.12] One paper was accepted by AAAI 2025 (CCF-A).
  • [2024.09] One paper was accepted by EMNLP 2024 Findings.
  • [2024.08] We proposed a LLM-based gaming AI (Llama_Dou) that achieves runner-up at the Chinese Collegiate Computer Gaming AI Contest.
  • [2024.05] One paper was accepted by IEEE Transactions on Emerging Topics in Computational Intelligence (JCR Q1).
  • [2024.02] One paper was accepted by Multimedia Systems (JCR Q1).
  • [2024.01] One paper was accepted by Multimedia Systems (JCR Q1).
  • [2023.11] One paper was accepted by Computers in Biology and Medicine (JCR Q1).
  • [2023.10] One paper was accepted to EMNLP 2023 (Oral Presentation) Paper.
  • [2023.06] One paper was nominated as the best paper of ChinaMM 2023.
  • [2023.05] Two papers were accepted to ChinaMM 2023 Paper.
  • [2022.07] One paper was accepted to COLING 2022 (Oral Presentation) Paper.
  • [2021.10] One paper was accepted to BIBM 2021 (Oral Presentation) Paper.

Publications [Google Scholar]

☆ Medical Report Generation

MEPNet: Medical Entity-balanced Prompting Network for Brain CT Report Generation.

Xiaodan Zhang, Yanzhao Shi, Junzhong Ji, Chengxin Zheng, and Liangqiong Qu.
AAAI Conference on Artificial Intelligence
 AAAI 2025  | Paper  | Bibtex 

See Detail Say Clear: Towards Brain CT Report Generation via Pathological Clue-driven Representation Learning.

Chengxin Zheng, Junzhong Ji, Yanzhao Shi, Xiaodan Zhang*, Liangqiong Qu.
The 2024 Conference on Empirical Methods in Natural Language Processing (Findings)
 EMNLP 2024 Findings  | Paper  | Bibtex 

Co-occurrence Relationship Driven Hierarchical Attention Network for Brain CT Report Generation.

Xiaodan Zhang, Shixin Dou, Junzhong Ji, Ying Liu, Zheng Wang.
IEEE Transactions on Emerging Topics in Computational Intelligence
 TETCI 2024  | Paper  | Bibtex 

Weakly Guided Attention Model with Hierarchical Interaction for Brain CT Report Generation.

Xiaodan Zhang, Sisi Yang, Yanzhao Shi, Junzhong Ji*, Ying Liu, Zheng Wang and Huimin Xu.
Computers in Biology and Medicine
 Comput Biol Med 2023  | Paper  | Bibtex 

Granularity Matters: Pathological Graph-driven Cross-modal Alignment for Brain CT Report Generation.

Yanzhao Shi, Junzhong Ji, Xiaodan Zhang*, Liangqiong Qu* and Ying Liu.
The 2023 Conference on Empirical Methods in Natural Language Processing (Oral Long Paper)
 EMNLP 2023  | Paper  | Bibtex 

Prior Tissue Knowledge-Driven Contrastive Learning for Brain CT Report Generation.

Yanzhao Shi, Junzhong Ji, Xiaodan Zhang*, Ying Liu, Zheng Wang and Huimin Xu.
Multimedia Systems
 Multimedia Syst 2024  | Paper  | Bibtex 

GHCL: Gaussian Heuristic Curriculum Learning for Brain CT Report Generation.

Qingya Shen, Yanzhao Shi, Xiaodan Zhang*, Junzhong Ji, Ying Liu and Huimin Xu.
Multimedia Systems
 Multimedia Syst 2024  | Paper  | Bibtex 

Cross-modal Contrastive Attention Model for Medical Report Generation.

Xiao Song, Xiaodan Zhang*, Junzhong Ji, Ying Liu, Pengxu Wei.
In Proceedings of the 29th International Conference on Computational Linguistics (Oral Long Paper)
COLING 2022 | Paper  | Bibtex 

Weakly Guided Hierarchical Encoder-Decoder Network for Brain CT Report Generation.

Sisi Yang, Junzhong Ji, Xiaodan Zhang*, Ying Liu, and Zheng Wang.
IEEE International Conference on Bioinformatics and Biomedicine (Oral Long Paper)
BIBM 2021 | Paper  | Bibtex 

☆ Image Captioning

Intra- and Inter-Head Orthogonal Attention for Image Captioning.

Xiaodan Zhang, Aozhe Jia, Junzhong Ji, Liangqiong Qu and Qixiang Ye.
IEEE Transactions on Image Processing
IEEE Trans. Image Process. 2025 | Paper  | Bibtex 

Relation constraint self-attention for image captioning.

Junzhong Ji, Mingzhan Wang, Xiaodan Zhang*, Minglong Lei, Liangqiong Qu
Neurocomputing
Neurocomput. 2022 | Paper  | Bibtex 

Divergent-convergent Attention for Image Captioning.

Junzhong Ji, Zhuoran Du, Xiaodan Zhang*
Pattern Recognition
PR 2021 | Paper  | Bibtex 

Spatio-temporal Memory Attention for Image Captioning.

Junzhong Ji, Cheng Xu, Xiaodan Zhang*, Boyue Wang, Xinhang Song.
IEEE Transactions on Image Processing
TIP 2020 | Paper  | Bibtex 

Image Captioning via Semantic Element Embedding.

Xiaodan Zhang, Shengfeng He, Xinhang Song, Rynson W. H. Lau, Jianbin Jiao, Qixiang Ye.
Neurocomputing
Neurocomput. 2020 | Paper  | Bibtex 

Keyword-driven Image Captioning via Context-dependent Bilateral LSTM.

Xiaodan Zhang, Shengfeng He, Xinhang Song, Pengxu Wei, Shuqiang Jiang, Qixiang Ye, Jianbin Jiao, Rynson W.H. Lau.
Proceedings of IEEE International Conference on Multimedia and Expo
ICME. 2017 | Paper  | Bibtex 

Rich Image Description Based on Regions.

Xiaodan Zhang, Xinhang Song, Shuqiang Jiang, Qixiang Ye, Jianbin Jiao.
Proceedings of the 23rd ACM International Conference on Multimedia
ACM MM. 2015 | Paper  | Bibtex 

☆ Other Research

Multi-scale Superpixel Based Hierarchical Attention Model for Brain CT Classification.

Xiao Song, Xiaodan Zhang, Junzhong Ji, Ying Liu
Journal of Visual Communication and Image Representation
JVCIR 2023 | Paper  | Bibtex 

Generic Attention-model Explainability by Weighted Relevance Accumulation.

Yiming Huang, Aozhe Jia, Xiaodan Zhang*, and Jiawei Zhang
In Proceedings of the 5th ACM International Conference on Multimedia in Asia
MM Asia 2023 | Paper  | Bibtex 

Awards and Honors

  • High-level overseas returnees from Beijing 2022.
  • International Postdoctoral Exchange Fellowship Program(Talent-Introduction Program) 2018

Laboratory

Our lab fosters a positive atmosphere where everyone collaborates harmoniously. We frequently engage in academic discussions, encouraging creativity and teamwork. Our lab currently has several NVIDIA deep learning servers. We welcome students from relevant fields such as computer science and mathematics who are eager to engage in research. Preference will be given to those with a solid foundation in deep learning and strong programming skills. Please indicate your CET-6, IELTS, or TOEFL scores in your application.

Student 1

Yanzhao Shi (时彦钊)

Nickname: 小时

Medical Report Generation

yanzhaoshi0927@outlook.com

Student 2

Qingya Shen (申青雅)

Nickname: 大师姐

Medical Report Generation

shenqingya10010@163.com

Student 3

Chengxin Zheng (郑诚信)

Nickname: 郑老师

Medical Multimodal LLM

Chaunxey_zeta@outlook.com

Student 3

Yuanzhen Guo (郭元祯)

Interpretability in MLLM

957004627@qq.com

Student 3

Wenhong Song (宋文红)

Nickname: 大红

Medical VQA

2969719964@qq.com

Student 3

Hua Wang (王华)

Nickname: 华哥

Natural Language Processing

liumengyan23@gmail.com

Student 3

Wanyu Zhang (张婉玉)

Natural Language Processing

wanyuxiaobai@163.com

Student 3

Xinyue Li (李欣月)

Natural Language Processing

lixinyuelxyyy@163.com

Our lab is in a phase of rapid development, and we hope everyone can strive for progress together!

Services

Reviewer: