I want to convert the video like this and save it as Npy:
The file should contain an array with shape 15 x 2 x length.
The first dimension (15) corresponds the 15 body joint defined here.
The second dimension (2) corresponds to x and y coordinates.
The third dimension (length) is the temporal dimension.