Hao Feng - Homepage

Hao Feng 冯浩

Ph.D.

University of Science and Technology of China (USTC)

Email: haof@mail.ustc.edu.cn

About Me

Ph.D. from University of Science and Technology of China. My Ph.D. advisors are Prof. Houqiang Li and Prof. Wengang Zhou. I obtained my Bachelor degree (2018) in Xidian University. I’m currently working on Document AI and Multimodal LLM at ByteDance.

Publications

	Hao Feng, Shu Wei, Xiang Fei, Wei Shi, Yingdong Han, Lei Liao, Jinghui Lu, Binghong Wu, Qi Liu, Chunhui Lin, Jingqun Tang, Hao Liu, Can Huang Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting Annual Meeting of the Association for Computational Linguistics (ACL), 2025 [pdf] [code] [bibtex]
	Hao Feng, Wengang Zhou, Jiajun Deng, Qi Tian, Houqiang Li DocScanner: Robust Document Image Rectification with Progressive Learning International Journal of Computer Vision (IJCV), 2025 [pdf] [code] [bibtex]
	Hao Feng, Qi Liu, Hao Liu, Jingqun Tang, Wengang Zhou, Houqiang Li, Can Huang DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding SCIENCE CHINA Information Sciences (SCIS), 2024 [pdf] [bibtex]
	Hao Feng, Wendi Wang, Shaokai Liu, Jiajun Deng, Wengang Zhou, Houqiang Li DeepEraser: Deep Iterative Context Mining for Generic Text Eraser IEEE Transactions on Multimedia (TMM), 2024 [pdf] [code] [bibtex]
	Hao Feng, Keyi Zhou, Wengang Zhou, Yufei Yin, Jiajun Deng, Qi Sun, Houqiang Li Recurrent Generic Contour-based Instance Segmentation with Progressive Learning IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024 [pdf] [code] [bibtex]
	Hao Feng, Shaokai Liu, Jiajun Deng, Wengang Zhou, Houqiang Li Deep Unrestricted Document Image Rectification IEEE Transactions on Multimedia (TMM), 2023 [pdf] [code] [bibtex]
	Hao Feng, Wendi Wang, Jiajun Deng, Wengang Zhou, Li Li, and Houqiang Li SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning International Conference on Computer Vision (ICCV), 2023 [pdf] [code] [bibtex]
	Hao Feng, Wengang Zhou, Jiajun Deng, Yuechen Wang, and Houqiang Li Geometric Representation Learning for Document Image Rectification European Conference on Computer Vision (ECCV), 2022 [pdf] [code] [bibtex]
	Hao Feng, Yuechen Wang, Wengang Zhou, Jiajun Deng, and Houqiang Li DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction ACM International Conference on Multimedia (ACM MM), Oral, 2021 [pdf] [code] [bibtex]
	Wendi Wang, Hao Feng, Wengang Zhou, Zhaokang Liao, Houqiang Li Model-Aware Pre-Training for Radial Distortion Rectification IEEE Transactions on Image Processing (TIP), 2023 [pdf] [code] [bibtex]
	Shaokai Liu, Hao Feng, Wengang Zhou Rethinking Supervision in Document Unwarping: A Self-consistent Flow-free Approach IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023 [pdf] [bibtex]
	Huijie Yao, Wengang Zhou, Hao Feng, Hezhen Hu, Hao Zhou, Houqiang Li Sign Language Translation with Iterative Prototype International Conference on Computer Vision (ICCV), 2023 [pdf] [bibtex]
	Yonghui Wang, Wengang Zhou, Hao Feng, Li Li, Houqiang Li Progressive Recurrent Network for Shadow Removal Computer Vision and Image Understanding (CVIU), 2023 [pdf] [bibtex]

Manuscripts

	Hao Feng, Zijian Wang, Jingqun Tang, Jinghui Lu, Hao Liu, Wengang Zhou, Houqiang Li, Can Huang UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding [pdf] [bibtex]
	Bozhi Luan, Hao Feng, Hong Chen, Wengang Zhou, Houqiang Li TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding [pdf] [code] [bibtex]

Academic Services

Invited Reviewer for journals and conferences, including TMM, TCSVT, CVPR, ICCV, ECCV, AAAI, ACM MM, ACL, SIGGRAPH, etc.