The phote was token at Las Vegas in 2022.

Chen Zhenfang (陈振方)

Email: chenzhenfang2013 [at]
Address: 314 Main Street, Cambridge, MA, USA, 02142.

 Github   Linkedin  DBLP

About Me

I am a researcher at MIT-IBM Watson AI Lab in Cambridge, MA, USA. I received my Ph.D. degree from the Department of Computer Science at The University of Hong Kong, where I was a member of the Computer Vision Lab, advised by Prof. Kenneth K.Y. Wong. Prior to that, I got my B.Sc. from Sun Yat-sen University in 2016.

My interests are centered around machine learning and its applications to computer vision and natural language processing. My ultimate research goal is to develop an AI system that can perceive and reason about the physical world, and communicate with humans in natural language.


Recent Research highlights

Interactive Prompting for Reasoning

ICLR 2022: ComPhy

ICLR 2023: PG-TD

ICLR 2021: Dynamic Concept Learner

CoRL22: Embodied Concept Learner

NeurIPS22: S3-NeRF

Publications [Bibtex]

Planning with Large Language Models for Code Generation

Shun Zhang, Zhenfang Chen, Yikang Shen, Mingyu Ding, Joshua B. Tenenbaum, and Chuang Gan


Paper Project

Deep Face Video Inpainting via UV Mapping

Wenqi Yang, Zhenfang Chen, Chaofeng Chen, Guanying Chen, Kwan-Yee K. Wong

IEEE Transactions on Image Processing (TIP), 2023

Paper Project

S3-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint

Wenqi Yang, Guanying Chen, Chaofeng Chen, Zhenfang Chen, Kwan-Yee K. Wong


Paper Project Code

Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following

Mingyu Ding, Yan Xu, Zhenfang Chen, David Daniel Cox, Ping Luo, Joshua B. Tenenbaum, Chuang Gan


Paper Project Code

PS-NeRF: Neural Inverse Rendering for Multi-view Photometric Stereo

Wenqi Yang, Guanying Chen, Chaofeng Chen, Zhenfang Chen, Kwan-Yee K. Wong


Paper Project Code

A Unified Framework for Masked and Mask-Free Face Recognition via Feature Rectification

Shaozhe Hao, Chaofeng Chen, Zhenfang Chen, Kwan-Yee K. Wong


Paper Github

ComPhy: Compositional Physical Reasoning of Objects and Events from Videos

Zhenfang Chen, Kexin Yi, Yunzhu Li, Mingyu Ding, Antonio Torralba, Joshua B. Tenenbaum, Chuang Gan


Paper Project Code

Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language

Mingyu Ding, Zhenfang Chen, Tao Du, Ping Luo, Joshua B. Tenenbaum, Chuang Gan


Paper Project Code

STAR: A Benchmark for Situated Reasoning in Real-World Videos

Bo Wu, Shoubin Yu, Zhenfang Chen , Joshua B. Tenenbaum, Chuang Gan

NeurIPS2021 (Datasets and Benchmarks Track)

Paper Project

The Blessings of Unlabeled Background in Untrimmed Videos

Yuan Liu, Jingyuan Chen, Zhenfang Chen, Bing Deng, Jianqiang Huang, Hanwang Zhang


Paper Code

Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning

Zhenfang Chen, Jiayuan Mao, Jiajun Wu, Kwan-Yee K. Wong, Joshua B. Tenenbaum, Chuang Gan


Paper Project Code

Cops-Ref: A new Dataset and Task on Compositional Referring Expression Comprehension

Zhenfang Chen, Peng Wang, Lin Ma, Kwan-Yee K. Wong, Qi Wu


Paper Code & Data

Look Closer to Ground Better: Weakly-Supervised Temporal Grounding of Sentence in Video

Zhenfang Chen, Lin Ma, Wenhan Luo, Peng Tang, Kwan-Yee K. Wong



Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video

Zhenfang Chen, Lin Ma, Wenhan Luo, Kwan-Yee K. Wong

ACL2019 (Oral Presentation)

Paper Code

Learning Local Similarity with Spatial Relations for Object Retrieval

Zhenfang Chen, Zhanghui Kuang, Wayne Zhang, Kwan-Yee K. Wong



Boosting up scene text detectors with guided CNN.

Xiaoyu Yue, Zhanghui Kuang, Zhaoyang Zhang, Zhenfang Chen, Pan He, Yu Qiao and Wayne Zhang

BMVC2018 (Oral Presentation)


Aggregated deep feature from activation clusters for particular object retrieval.

Zhenfang Chen, Zhanghui Kuang, Kwan-Yee K. Wong, Wayne Zhang

MM17 (Thematic Workshop)


PhD Dissertation

Deep learning for visual retrieval, visual grounding and visual reasoning

Zhenfang Chen

Dept. of Computer Science, The University of Hong Kong, 2021

HKU Thesis Online

Previous Experience

Professional Services

  • Senior Program Committee (SPC) Member:
      AAAI 2023
  • Conference Reviewer:
  • Journal Reviewer:
      Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
      International Journal of Computer Vision (IJCV)
      Transactions on Image Processing (TIP)
      Transactions on Multimedia Computing Communications and Applications (TOMM)
      Pattern Recognition (PR)
      Transactions on Neural Networks and Learning Systems (TNNLS)


  • M. Braun Postgraduate Prizes, HKU 2019-2020
  • Postgraduate Scholarships (PGS), HKU 2016-2020

Teaching Assistant at HKU

  • [Spring, 2020]: COMP3270 Artificial Intelligence
  • [Spring, 2019]: COMP7404 Computational Intelligence and Machine Learning
  • [Spring, 2018]: COMP7404 Computational Intelligence and Machine Learning
  • [Summer, 2017]: COMP7502 Image Processing and Computer Vision