Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Luckeciano Carvalho Melo's picture
2 1

Luckeciano Carvalho Melo

luckeciano
·
https://luckeciano.github.io
  • LuckecianoMelo
  • luckeciano

AI & ML interests

Reinforcement Learning

Recent Activity

published a model about 1 month ago
luckeciano/Llama-3.1-8B-Instruct-CAPO-Base-v2-FisherMaskToken-1e-10-HessianMaskToken-0.0-LR-7.5e-7_2916
published a model about 1 month ago
luckeciano/Llama-3.1-8B-Instruct-CAPO-Base-v2-FisherMaskToken-1e-9-HessianMaskToken-0.0-LR-7.5e-7_9573
published a model about 1 month ago
luckeciano/Llama-3.1-8B-Instruct-CAPO-Base-v2-FisherMaskToken-1e-8-HessianMaskToken-0.0-LR-7.5e-7_8245
View all activity

Organizations

CEIA Reinforcement Learning's profile picture LLMsMaxEntRL's profile picture

upvoted a paper over 1 year ago

Deep Bayesian Active Learning for Preference Modeling in Large Language Models

Paper • 2406.10023 • Published Jun 14, 2024 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs