Hao Feng   冯浩
Ph.D. University of Science and Technology of China (USTC) Email: haof@mail.ustc.edu.cn |
I am a Ph.D. at University of Science and Technology of China. My Ph.D. advisors are Prof. Houqiang Li and Prof. Wengang Zhou. I obtained my Bachelor degree (2018) in Xidian University. I’m currently working on Document AI at ByteDance.
My research interests include computer vision and document AI, and I am currently working on multi-modality understanding.
|
Hao Feng, Qi Liu, Hao Liu, Jingqun Tang, Wengang Zhou, Houqiang Li, Can Huang
DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding Science China Information Sciences (SCIS), 2024 [pdf] [bibtex] |
|
Hao Feng, Wendi Wang, Shaokai Liu, Jiajun Deng, Wengang Zhou, Houqiang Li
DeepEraser: Deep Iterative Context Mining for Generic Text Eraser IEEE Transactions on Multimedia (TMM), 2024 [pdf] [code] [bibtex] |
|
Hao Feng, Keyi Zhou, Wengang Zhou, Yufei Yin, Jiajun Deng, Qi Sun, Houqiang Li
Recurrent Generic Contour-based Instance Segmentation with Progressive Learning IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024 [pdf] [code] [bibtex] |
|
Hao Feng, Shaokai Liu, Jiajun Deng, Wengang Zhou, Houqiang Li
Deep Unrestricted Document Image Rectification IEEE Transactions on Multimedia (TMM), 2023 [pdf] [code] [bibtex] |
|
Hao Feng, Wendi Wang, Jiajun Deng, Wengang Zhou, Li Li, and Houqiang Li
SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning International Conference on Computer Vision (ICCV), 2023 [pdf] [code] [bibtex] |
|
Hao Feng, Wengang Zhou, Jiajun Deng, Yuechen Wang, and Houqiang Li
Geometric Representation Learning for Document Image Rectification European Conference on Computer Vision (ECCV), 2022 [pdf] [code] [bibtex] |
|
Hao Feng, Yuechen Wang, Wengang Zhou, Jiajun Deng, and Houqiang Li
DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction ACM International Conference on Multimedia (ACM MM), Oral, 2021 [pdf] [code] [bibtex] |
|
Wendi Wang, Hao Feng, Wengang Zhou, Zhaokang Liao, Houqiang Li
Model-Aware Pre-Training for Radial Distortion Rectification IEEE Transactions on Image Processing (TIP), 2023 [pdf] [code] [bibtex] |
|
Shaokai Liu, Hao Feng, Wengang Zhou
Rethinking Supervision in Document Unwarping: A Self-consistent Flow-free Approach IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023 [pdf] [bibtex] |
|
Huijie Yao, Wengang Zhou, Hao Feng, Hezhen Hu, Hao Zhou, Houqiang Li
Sign Language Translation with Iterative Prototype International Conference on Computer Vision (ICCV), 2023 [pdf] [bibtex] |
|
Yonghui Wang, Wengang Zhou, Hao Feng, Li Li, Houqiang Li
Progressive Recurrent Network for Shadow Removal Computer Vision and Image Understanding (CVIU), 2023 [pdf] [bibtex] |
|
Hao Feng, Zijian Wang, Jingqun Tang, Jinghui Lu, Hao Liu, Wengang Zhou, Houqiang Li, Can Huang
UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding [pdf] [bibtex] |
|
Hao Feng, Wengang Zhou, Jiajun Deng, Qi Tian, Houqiang Li
DocScanner: Robust Document Image Rectification with Progressive Learning [pdf] [code] [bibtex] |
|
Bozhi Luan, Hao Feng, Hong Chen, Wengang Zhou, Houqiang Li
TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding [pdf] [code] [bibtex] |
Invited Reviewer for journals and conferences, including TMM, TCSVT, CVPR, ICCV, ECCV, AAAI, ACM MM, ACL, etc. |