Training Details

#13

by RedFairy - opened Jan 14

Jan 14

Thanks for the great work!
I have a question on the training detail on the conditioning mechanism. Specifically, does the model takes rendered point cloud at the novel view as an image condition? Or the model works simply by adding camera tokens (as text tokens) to the prompt?
Thank you!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment