Research
I'm supervised by Prof. Thomas H. Lee in Stanford. During my undergrad, I was supervised by Prof. Wankou Yang in Southeast Univ. Also, I'm glad to had opportunities to work with Prof. Mingming Gong in UniMelb, Prof. Pengtao Xie in UC San Diego, Prof. Bo Zhao in Shanghai Jiao Tong Univ., Prof. Hao Dong in Peking Univ., Prof. Yangang Wang and Prof. Songlin Du in SEU.
OscNet: Machine Learning on CMOS Oscillator Networks
Wenxiao Cai, Thomas. H. Lee
[arXiv full length paper]
SpatialBot: Precise Depth Understanding With Vision Language Models
Wenxiao Cai, ..., Hao Dong*, Bo Zhao*
ICRA 2025 [arXiv full length paper], [GitHub].
Object-level Geometric Structure Preserving for Natural Image Stitching
Wenxiao Cai, Wankou Yang*
AAAI 2025 [arXiv full length paper], [GitHub]
UAV Image Stitching by Estimating Orthographic Projection with RGB Cameras
Wenxiao Cai, Songlin Du*, Wankou Yang*
Journal of Visual Communication and Image Representation, [JVCI]
VDD: Varied Drone Dataset for Semantic Segmentation
Wenxiao Cai, ..., Wankou Yang*
Journal of Visual Communication and Image Representation,
[arXiv], [GitHub], [JVCI Available Soon]
Probabilistic Modeling of Disparity Uncertainty for Robust and Efficient Stereo Matching
Wenxiao Cai, Dongting Hu, Ruoyan Yin, Jiankang Deng, Huan Fu, Wankou Yang*, Mingming Gong*
[arXiv]
Knowledge NeRF: Few-shot Novel View Synthesis for Dynamic Articulated Objects
Wenxiao Cai, ..., Junming Leo Chen, Yangang Wang*
[arXiv], [GitHub]