About Me

CV      Contact: menglong AT cis.upenn.edu

I'm a PhD student in Computer Information Science, GRASP Lab, University of Pennsylvania, working with Kostas Daniilidis. My research interests are Computer Vision, Robotics and Machine Learning, specialized in object recognition, 3D pose estimation, human action recognition, visual SLAM and text recognition. I obtained a Bachelor's degree in Computer Science from Fudan University in 2010 and a Master's degree in Robotics from the University of Pennsylvania in 2012.

Code

Fast object detector - Active Deformable Part Models.
Video annotation tool for human joints, Amazon Mturk deployable.
Camera pose estimation, involved in Google Project Tango.
Text detection and recognition, ROS package, PR2 deployable.
Fast A* planning algorithm with quadtree decomposition.

Publications

1. Pose and Shape Estimation with Discriminatively Learned Parts
M. Zhu, X. Zhou and K. Daniilidis,
[Technical Report] arXiv:1502.00192 [cs.CV], 2015.
[PDF / bibtex]
        @article{ZhuADPM14arxiv,
          author    = {Menglong Zhu and
                       Xiaowei Zhou and
                       Kostas Daniilidis},
          title     = {Pose and Shape Estimation with Discriminatively Learned Parts},
          journal   = {CoRR},
          volume    = {abs/1502.00192},
          year      = {2015},
          ee        = {http://arxiv.org/abs/1502.00192},
          bibsource = {DBLP, http://dblp.uni-trier.de}
        }         
2. Active Deformable Part Models Inference
M. Zhu, N. Atanasov, G. J. Pappas, and K. Daniilidis,
European Conference on Computer Vision (ECCV), 2014.
[PDF / bibtex / video / project page (code)]
@InProceedings{ZhuADPM2014,
  author    = {M. Zhu and N. Atanasov and G. Pappas and K. Daniilidis},
  title     = {{Active Deformable Part Models Inference}},
  year      = {2014},
  booktitle = {European Conference on Computer Vision (ECCV)}
}         
3. Active Deformable Part Models
M. Zhu, N. Atanasov, G. J. Pappas, and K. Daniilidis,
[Technical Report] arXiv:1404.0334 [cs.CV], 2014.
[PDF / bibtex]
@article{ZhuADPM14arxiv,
  author    = {Menglong Zhu and
               Nikolay Atanasov and
               George J. Pappas and
               Kostas Daniilidis},
  title     = {Active Deformable Part Models},
  journal   = {CoRR},
  volume    = {abs/1404.0334},
  year      = {2014},
  ee        = {http://arxiv.org/abs/1404.0334},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}         
4. Semantic Localization Via the Matrix Permanent
N. Atanasov, M. Zhu, G. J. Pappas, and K. Daniilidis,
Robotics: Science and Systems (RSS), 2014.
[PDF / bibtex]
@InProceedings{Atanasov_SemanticLocalization_RSS14,
  author = {N. Atanasov and M. Zhu and K. Daniilidis and G. Pappas},
  title = {{Semantic Localization Via the Matrix Permanent}},
  year = {2014},
  booktitle={Robotics: Science and Systems (RSS)}
}         
5. Single Image 3D Object Detection and Pose Estimation for Grasping
M. Zhu, K. Derpanis, Y. Yang, S. Brahmbhatt, M. Zhang,
C. Phillips, M. Lecce and K. Daniilidis,
International Conference on Robotics and Automation (ICRA), 2014.
[PDF / bibtex / video / project page]
@article{zhu2014grasping,
  title     = {Single Image 3D Object Detection 
               and Pose Estimation for Grasping},
  author    = {Zhu, Menglong and Derpanis, Konstantinos G 
               and Yang, Yinfei and Brahmbhatt, Samarth 
               and Zhang, Mabel and Phillips, Cody 
               and Lecce, Matthieu and Daniilidis, Kostas}
  booktitle = {International Conference on Robotics and Automation}
  year      = {2014} 
}         
6. From Actemes to Action: A Strongly-supervised Representation
for Detailed Action Understanding

W. Zhang, M. Zhu and K. Derpanis,
International Conference on Computer Vision (ICCV), 2013.
[PDF / bibtex / video / project page / action dataset]
@inproceedings{zhang2013actemes,
  title     = {From Actemes to Action: A Strongly-supervised 
               Representation for Detailed Action Understanding},
  author    = {Zhang, Weiyu and Zhu, Menglong 
               and Derpanis, Konstantinos G},
  booktitle = {International Conference on Computer Vision},
  pages     = {2248--2255},
  year      = {2013},
}         
7. Monocular Visual Odometry and Dense 3D Reconstruction
for On-Road Vehicles

M. Zhu, S. Ramalingam, Y. Taguchi and T. Garass,
European Conference on Computer Vision (ECCV), workshop on CVVT , 2012.
[PDF / bibtex / video]
@inproceedings{zhu2012monocular,
  title     = {Monocular visual odometry and dense 3d 
               reconstruction for on-road vehicles},
  author    = {Zhu, Menglong and Ramalingam, Srikumar 
               and Taguchi, Yuichi and Garaas, Tyler},
  booktitle = {European Conference on Computer Vision, 
               Workshops and Demonstrations},
  pages     = {596--606},
  year      = {2012},
}         
8. Literate PR2: Text detection and recognition for indoor environment
M. Zhu, K. Derpanis, K. Daniilidis,
Robotics Operating System (ROS), 2011.
[ROS wiki / video]

Patents

Method and System for Determining Poses of Vehicle-Mounted Cameras
for In-Road Obstacle Detection,
US 20140037136 A1.
M. Zhu, S. Ramalingam and Y. Taguchi, [Link]

Datasets

15-class action dataset, frame-by-frame annotated human body joints.

Invited Talks

April 09, 2014, VASC Seminar at CMU

Teaching

Advanced Robotics, 2015 Spring, Learn to program a quadrotor
Machine Perception, 2013 Spring
Machine Learning, 2012 Fall, Remebering Ben Taskar
Introduction to Computer Programming, 2012 Spring
Programming Languages and Techniques III, 2011 Spring
Programming Languages and Techniques III, 2011 Fall