Model checkpoints generated during an ongoing research effort into the acceleration potential and tuning quality of LLMs with RL fine tuning.
Scott Biggs
ScottBiggs2
AI & ML interests
I'm an AI researcher working on scalable generative modeling and reinforcement learning, with recent work in sparse RL acceleration and preference-based optimization. I release models and artifacts related to research, industry collaboration, and experimental exploration.
Recent Activity
liked
a model
about 16 hours ago
dllm-collection/Qwen3-0.6B-diffusion-mdlm-v0.1
liked
a dataset
about 18 hours ago
allenai/tulu-3-sft-mixture
authored
a paper
1 day ago
DeepWeightFlow: Re-Basined Flow Matching for Generating Neural Network Weights