Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
JeonJinhyeok's picture
Open to Work
4

JeonJinhyeok

jinn33
·

AI & ML interests

None yet

Organizations

None yet

jinn33 's collections 1

article collection
  • EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

    Paper • 2509.22576 • Published Sep 26, 2025 • 137
  • AgentBench: Evaluating LLMs as Agents

    Paper • 2308.03688 • Published Aug 7, 2023 • 26
  • DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

    Paper • 1910.01108 • Published Oct 2, 2019 • 23
  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model

    Paper • 2305.18290 • Published May 29, 2023 • 66
article collection
  • EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

    Paper • 2509.22576 • Published Sep 26, 2025 • 137
  • AgentBench: Evaluating LLMs as Agents

    Paper • 2308.03688 • Published Aug 7, 2023 • 26
  • DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

    Paper • 1910.01108 • Published Oct 2, 2019 • 23
  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model

    Paper • 2305.18290 • Published May 29, 2023 • 66
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs