Take a look at this video, notice that everyone looks directly at the camera:
It is not clear how high the cameras are mounted, they could very well be 4' high as they are at Megvii's offices and/or it could be that subjects are told to look directly at the cameras.
The challenge is most people do not look directly at surveillance cameras due to a combination of (1) cameras not being normally mounted 4' high (10' or higher is more common) and (2) people having a tendency to look down when they walk and (3) people not walking directly in line to the camera (either the camera is tilted left or right or the person is not walking perpendicular to the camera, etc.).
The result is a good marketing video but an unrealistic portrayal of real use. If every camera can pick up every person, you can graph the route and timing of every person. The problem is that, regardless of the vendor, it will be logistically quite hard to accomplish this due to camera positioning, actions of people and the layout of buildings and streets. Once users realize that it generally does not work, it gets ignored. See poorly working video analytics.