Thank you for the recently released evaluation code. When evaluating on the COCO dataset, it seems that the CLIP scores and other metrics are obtained by comparing images generated from the quantized model with those from the full-precision model. In that case, how can we compute the CLIP scores of the original FP16 model as reported in the paper? Thank you!