Text-to-Image
Transformers
Safetensors
English
multi_modality
reinforcement-learning
grpo
gcpo
multimodal
Instructions to use KonstantinosKK/Janus-Pro-7B-GCPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use KonstantinosKK/Janus-Pro-7B-GCPO with Transformers:
# Load model directly from transformers import MultiModalityCausalLM model = MultiModalityCausalLM.from_pretrained("KonstantinosKK/Janus-Pro-7B-GCPO", dtype="auto") - Notebooks
- Google Colab
- Kaggle
GCPO Janus-Pro-7B (Transformers-Compatible)
Github · Paper · arXiv · Checkpoints · Website
Janus-Pro-7B finetuned with Guidance Contrastive Policy Optimization (GCPO), a per-token credit assignment method for GRPO-style RL. Each token's advantage is weighted by the KL divergence between the policy's predictions under a positive vs. negative prompt — using the classifier-free guidance signal as a token saliency map.
Usage
Please refer to the Janus Github Repository.
License
The use of Janus-Pro models is subject to the DeepSeek Model License.
- Downloads last month
- 30
Model tree for KonstantinosKK/Janus-Pro-7B-GCPO
Base model
deepseek-ai/Janus-Pro-7B