view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11, 2025 โข 104
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper โข 2511.09057 โข Published Nov 12, 2025 โข 79
Cross-lingual Transfer Learning for Javanese Dependency Parsing Paper โข 2401.12072 โข Published Jan 22, 2024
Sleeping Ai Mindfulness Apps ๐ Generate recommendations and risk assessments aligned with Kalbe Group values
fadliaulawi/distilbert-base-uncased-finetuned-squad-d5716d28 Question Answering โข Updated Jul 27, 2023 โข 1