Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
alanayu lee's picture
4

alanayu lee

alanayu

AI & ML interests

None yet

Recent Activity

new activity 17 days ago
Qwen/Qwen3-Next-80B-A3B-Instruct:请问一下,使用megatron微调Qwen3-Next时,设置--target_modules为"all-linear"能否训练到Qwen3NextGatedDeltaNet部分?
new activity 3 months ago
meituan-longcat/LongCat-Flash-Chat:这个模型是不是还不能用VLLM推理?
new activity 7 months ago
Qwen/Qwen3-30B-A3B:How to train the Qwen3-30B-A3B using Reinforcement Learning?
View all activity

Organizations

None yet

New activity in Qwen/Qwen3-Next-80B-A3B-Instruct 17 days ago

请问一下,使用megatron微调Qwen3-Next时,设置--target_modules为"all-linear"能否训练到Qwen3NextGatedDeltaNet部分?

👀 2
#41 opened 17 days ago by
alanayu
New activity in meituan-longcat/LongCat-Flash-Chat 3 months ago

这个模型是不是还不能用VLLM推理?

🚀 1
#9 opened 3 months ago by
alanayu
New activity in Qwen/Qwen3-30B-A3B 7 months ago

How to train the Qwen3-30B-A3B using Reinforcement Learning?

#34 opened 7 months ago by
alanayu
New activity in unsloth/Qwen3-30B-A3B-GGUF 7 months ago

Not compatible with transformers library

4
#8 opened 8 months ago by
Xeenxavier007
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs