This is a collaborative work of JHU Smile Lab and CMU MSP Lab. Please cite https://arxiv.org/abs/2511.00256.