ReVo: A Cross-Layer Reliable Volumetric Videoconferencing System

Project Website

Ankur Aditya1*, Diptyaroop Maji1*, Lingdong Wang1, Bhavya Ramakrishna2, Ramesh Sitaraman1,3, Prashant Shenoy1

1University of Massachusetts Amherst     2Dolby Labs     3Akamai Tech
* Student authors with equal contribution

This repository contains the pre-trained neural loss recovery module checkpoints for the ReVo system. These models are designed to perform neural loss recovery for volumetric videoconferencing systems, recovering lost packets across different video codecs.

Model Details

The repository includes 6 distinct models, organized by the compression codec and the RGB and Depth modality.

Supported Codecs:

  • h264 (Advanced Video Coding)
  • h265 (High Efficiency Video Coding / HEVC)
  • dcvcrt (Deep Contextual Video Compression)

Repository Structure

umass-lass/ReVo
β”œβ”€β”€ h264/
β”‚   β”œβ”€β”€ h264_rgb.pth
β”‚   β”œβ”€β”€ h264_depth.pth
β”œβ”€β”€ h265/
β”‚   β”œβ”€β”€ h265_rgb.pth
β”‚   β”œβ”€β”€ h265_depth.pth
└── dcvcrt/
    β”œβ”€β”€ dcvcrt_rgb.pth
    β”œβ”€β”€ dcvcrt_depth.pth

How to Use?

from huggingface_hub import hf_hub_download
import torch

# Define the specific codec and modality you want to load
codec = "h265"      # Options: 'h264', 'h265', 'dcvcrt'
modality = "depth"  # Options: 'rgb', 'depth'

# Download the specific checkpoint
checkpoint_path = hf_hub_download(
    repo_id="umass-lass/ReVo", 
    filename=f"{codec}/{codec}_{modality}.pth"
)

# Load the weights into your PyTorch model
# model.load_state_dict(torch.load(checkpoint_path))
print(f"Successfully downloaded to: {checkpoint_path}")

Citation

(Coming Soon)

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support