Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
fire0517's picture

fire0517

fire0517

AI & ML interests

None yet

Organizations

None yet

Collections 1

llm
  • DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

    Paper • 2501.12948 • Published Jan 22, 2025 • 441
llm
  • DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

    Paper • 2501.12948 • Published Jan 22, 2025 • 441

models 8

fire0517/Pyramids

Reinforcement Learning • Updated Mar 25, 2025

fire0517/ppo-SnowballTarget

Reinforcement Learning • Updated Mar 22, 2025 • 1

fire0517/SnowballTarget1

Updated Mar 22, 2025

fire0517/hf_deeprl_unit4_reinforce

Reinforcement Learning • Updated Mar 21, 2025

fire0517/q-taxi3-learning

Reinforcement Learning • Updated Mar 18, 2025

fire0517/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Mar 18, 2025

fire0517/ppo-Huggy-test1

Reinforcement Learning • Updated Mar 13, 2025

fire0517/model_test1

Reinforcement Learning • Updated Mar 10, 2025

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs