Brian9999 commited on
Commit
fcdfebe
·
0 Parent(s):

Super-squash branch 'main' using huggingface_hub

Browse files
Files changed (3) hide show
  1. .gitattributes +41 -0
  2. README.md +54 -0
  3. model.pt +3 -0
.gitattributes ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ autoencoder.jit filter=lfs diff=lfs merge=lfs -text
37
+ decoder.jit filter=lfs diff=lfs merge=lfs -text
38
+ encoder.jit filter=lfs diff=lfs merge=lfs -text
39
+ tokenizer/autoencoder.jit filter=lfs diff=lfs merge=lfs -text
40
+ tokenizer/decoder.jit filter=lfs diff=lfs merge=lfs -text
41
+ tokenizer/encoder.jit filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,54 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - video
5
+ - relighting
6
+ - inverse-rendering
7
+ - diffusion
8
+ - cosmos
9
+ pipeline_tag: image-to-image
10
+ ---
11
+
12
+ # World Inverse Renderer
13
+
14
+ Video inverse rendering model based on NVIDIA Cosmos 7B video diffusion transformer, fine-tuned on [custom dataset](https://github.com/ShandaAI/AlayaRenderer?tab=readme-ov-file).
15
+
16
+ ## Model Description
17
+
18
+ This model performs **inverse rendering** on images and videos: given an input RGB frame, it estimates physically-based G-buffer maps:
19
+
20
+ - **Basecolor** (albedo)
21
+ - **Normal** (surface normals)
22
+ - **Depth**
23
+ - **Roughness**
24
+ - **Metallic**
25
+
26
+ These G-buffers can then be used with a forward renderer to relight the scene under arbitrary environment lighting (HDRI maps).
27
+
28
+ ## Architecture
29
+
30
+ - Based on NVIDIA Cosmos 7B video diffusion transformer
31
+ - Fine-tuned on [custom dataset](https://github.com/ShandaAI/AlayaRenderer?tab=readme-ov-file)
32
+ - Supports both single-image and multi-frame video inverse rendering
33
+
34
+ ## Usage
35
+
36
+ ```bash
37
+ # Inverse rendering on images
38
+ CUDA_HOME=$CONDA_PREFIX PYTHONPATH=$(pwd) python cosmos_predict1/diffusion/inference/inference_inverse_renderer.py \
39
+ --checkpoint_dir checkpoints --diffusion_transformer_dir Diffusion_Renderer_Inverse_Cosmos_7B \
40
+ --dataset_path=your_input_images/ --num_video_frames 1 --group_mode webdataset \
41
+ --video_save_folder=output/ --save_video=False
42
+
43
+ # Inverse rendering on video frames
44
+ CUDA_HOME=$CONDA_PREFIX PYTHONPATH=$(pwd) python cosmos_predict1/diffusion/inference/inference_inverse_renderer.py \
45
+ --checkpoint_dir checkpoints --diffusion_transformer_dir Diffusion_Renderer_Inverse_Cosmos_7B \
46
+ --dataset_path=your_video_frames/ --num_video_frames 57 \
47
+ --video_save_folder=output/
48
+ ```
49
+
50
+ ## Requirements
51
+
52
+ - Python 3.10
53
+ - NVIDIA GPU with >= 16GB VRAM (48GB+ recommended)
54
+ - CUDA 12.0+
model.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d29e1ebc916fd704e81b2f32eb9b3098568647ea0220143367a2e715e235835e
3
+ size 28940339610