For Part 2, I calibrate x distances.
I setup a box and take four points of the corners and get color+depth data. I then move the camera and repeat.
The final equation is dx' = Z * dx / 444. In decimal form, it is dx' = .002 * Z * dx.
I have included the derivation.