Xiantao Hu (胡现韬)


Home


Xiantao Hu

Xiantao Hu

Ph.D. Student

Pattern Computing and Application Laboratory (PCA Lab), Nanjing University of Science and Technology (NJUST), China.

[Google Scholar]

E-mail: xiantaohu@njust.edu.cn, huxiantao481@gmail.com  


Research Interests

  • Visual object tracking

  • Visual Multimodal


News

  • Dec. 2024: One paper was accepted by AAAI 2025.


Education & Experience

  • Ph.D. Septembe 2024 -- Present
    PCA Lab, School of Computer Science and Technology, Nanjing University of Science and Technology (NJUST), China.
    Advised by Prof. Jian Yang and Associate Prof. Ying Tai.

  • M.E. September 2021 -- June 2024
    School of Computer Science and Technology, Guangxi Normal University (GXNU), China.
    Advised by Prof. Bineng Zhong .

  • B.E. September 2017 -- June 2021
    School of Computer Science and Technology, Guangxi Normal University (GXNU), China.


Publications

Conference Papers (* indicates contributed equally, # indicates corresponding authors)

  1. Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking. [Paper] [Code]
    Xiantao Hu, Ying Tai#, Xu Zhao, Chen Zhao, Zhenyu Zhang, Jun Li, Bineng Zhong, Jian Yang#.
    AAAI Conference on Artificial Intelligence (AAAI), Oral, 2025.

Journal Papers (* indicates contributed equally, # indicates corresponding authors)

  1. Adaptive Perception for Unified Visual Multi-modal Object Tracking.
    Xiantao Hu, Bineng Zhong#, Qihua Liang, Liangtao Shi, Zhiyi Mo, Ying Tai, Jian Yang.
    IEEE Transactions on Artificial Intelligence (TAI), 2025.

  2. Mamba Adapter: Efficient Multi-Modal Fusion for Vision-Language Tracking.
    Liangtao Shi, Bineng Zhong, Qihua Liang, Xiantao Hu, Zhiyi Mo, Shuxiang Song.
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025.

  3. SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding.
    Liangtao Shi, Ting Liu, Xiantao Hu, Yue Hu, Quanjun Yin, Richang Hong.
    IEEE IEEE Transactions on Multimedia (TMM), 2025.

  4. Towards Modalities Correlation for RGB-T Tracking.
    Xiantao Hu, Bineng Zhong#, Qihua Liang, Shengping Zhang, Ning Li, Xianxian Li.
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024.

  5. Transformer Tracking via Frequency Fusion.
    Xiantao Hu, Bineng Zhong#, Qihua Liang, Shengping Zhang, Ning Li, Xianxian Li, Rongrong Ji.
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023.


Academic Services

Conference Reviewer
  • Winter Conference on Applications of Computer Vision (WACV)

  • International Joint Conference on Artificial Intelligence (IJCAI)

Journal Reviewer
  • IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  • IEEE Transactions on Multimedia (TMM)


Honors and Awards

  • 11/2024 – Third in ICPR 2024 Multi-Modal Visual Pattern Recognition Challenge – Action Recognition Track, and get "Best Research paper"
  • 12/2023 – Third in 2023 Yangtze River Delta (Wuhu) Artificial Intelligence Competition - Non motorized vehicle recognition without helmet based on object detection
  • 10/2023 – National Graduate Scholarship
  • 08/2023 – Thrid in IFlytek Developer Competition – Stable Diffusion Identification Challenge
  • 07/2023 – Second in ICCV2023 Workshop challenge – VisDrone2023
  • 06/2023 – Second in CVPR2023 AVA Accessibility Vision and Autonomy Challenge – Segmentation Track
  • 06/2023 – Thrid in CVPR2023 AVA Accessibility Vision and Autonomy Challenge – Keypoint Track
  • 05/2023 – Third in FGVC10 Workshop at CVPR – PlantTraits2023
  • 07/2022 – First in ECV2022 - Outdoor Billboard Recognition