File size: 2,111 Bytes
1180ddd
 
 
 
 
 
 
 
 
e102d86
 
356c907
e102d86
356c907
1180ddd
e102d86
 
 
 
 
 
 
 
1180ddd
 
 
 
3fcbf51
1180ddd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6294a17
89dc03f
6294a17
89dc03f
6294a17
89dc03f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
---
license: mit
tags:
- computer-vision
- pytorch
- volumetric-video
- multimedia-systems 
---

<div align="center">

# ReVo: A Cross-Layer Reliable Volumetric Videoconferencing System

[![Project Website](https://img.shields.io/badge/Project-Website-blue.svg)](https://umassos.github.io/revo-website/)

**Ankur Aditya**<sup>1*</sup>, **Diptyaroop Maji**<sup>1*</sup>, **Lingdong Wang**<sup>1</sup>, **Bhavya Ramakrishna**<sup>2</sup>, **Ramesh Sitaraman**<sup>1,3</sup>, **Prashant Shenoy**<sup>1</sup>

<sup>1</sup>University of Massachusetts Amherst &nbsp;&nbsp;&nbsp; <sup>2</sup>Dolby Labs &nbsp;&nbsp;&nbsp; <sup>3</sup>Akamai Tech
<br>
<sup>*</sup> *Student authors with equal contribution*

</div>

This repository contains the pre-trained neural loss recovery module checkpoints for the **ReVo** system. These models are designed to perform neural loss recovery for volumetric videoconferencing systems, recovering lost packets across different video codecs.

## Model Details

The repository includes 6 distinct models, organized by the compression codec and the RGB and Depth modality.

**Supported Codecs:**
* `h264` (Advanced Video Coding)
* `h265` (High Efficiency Video Coding / HEVC)
* `dcvcrt` (Deep Contextual Video Compression)



## Repository Structure
```text
umass-lass/ReVo
β”œβ”€β”€ h264/
β”‚   β”œβ”€β”€ h264_rgb.pth
β”‚   β”œβ”€β”€ h264_depth.pth
β”œβ”€β”€ h265/
β”‚   β”œβ”€β”€ h265_rgb.pth
β”‚   β”œβ”€β”€ h265_depth.pth
└── dcvcrt/
    β”œβ”€β”€ dcvcrt_rgb.pth
    β”œβ”€β”€ dcvcrt_depth.pth
```

# How to Use?

```python
from huggingface_hub import hf_hub_download
import torch

# Define the specific codec and modality you want to load
codec = "h265"      # Options: 'h264', 'h265', 'dcvcrt'
modality = "depth"  # Options: 'rgb', 'depth'

# Download the specific checkpoint
checkpoint_path = hf_hub_download(
    repo_id="umass-lass/ReVo", 
    filename=f"{codec}/{codec}_{modality}.pth"
)

# Load the weights into your PyTorch model
# model.load_state_dict(torch.load(checkpoint_path))
print(f"Successfully downloaded to: {checkpoint_path}")
```

# Citation
(Coming Soon)