Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Ziheng Zhou
josephziheng
3
4
5
Follow
0 followers
·
1 following
josephziheng
JosephZZ
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Less is More: Early Stopping Rollout for On-Policy Distillation
submitted
a paper
about 1 month ago
Less is More: Early Stopping Rollout for On-Policy Distillation
reacted
to
salma-remyx
's
post
with 🔥
about 2 months ago
SciCrafter measured something AI practitioners have intuited: frontier agents are improving at executing inside well-framed problems, but lag at framing the problem in the first place. GPT-5.2, Gemini-3-Pro, and Claude Opus 4.5 all plateaued near 26% on a new Minecraft benchmark for probing AI capabilities in the discovery-to-application loop. So the authors ran targeted interventions: * Hints about what to investigate doubled performance. * A structured experimentation template added 7-14 more points. * Structured consolidation beat free-form summaries by 6 points. * Curriculum context beat independent task-solving. These interventions helped the agent frame what’s worth investigating, and structure what gets learned so it compounds. The bottleneck for AI in scientific workflows is upstream of execution. Their findings are congruent with the design patterns we've adopted at Remyx AI to help AI teams close the development loop scientifically. Agents work well inside structured loops, but they perform poorly when tasked with creating the structure. Instrumenting your scientific workflows offers greater leverage than scaling compute with a less informed search. In the work of building production AI systems, teams are flying through execution. The bigger challenge is identifying which experiments moved which production outcome, or what to try next. One of the more interesting results I found this week by tracking work in AI for scientific workflows using Remyx: https://engine.remyx.ai/papers/d8f23b9b-b14b-4ada-b44e-ccfc221c06b4
View all activity
Organizations
None yet
josephziheng
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
2 models
almost 3 years ago
baichuan-inc/Baichuan2-13B-Base
Text Generation
•
Updated
Dec 24, 2023
•
2.3k
•
82
internlm/internlm-20b
Text Generation
•
Updated
Jan 24, 2024
•
714
•
77
liked
3 datasets
almost 3 years ago
zhiqings/dromedary-65b-verbose-clone-v0
Viewer
•
Updated
Jun 23, 2023
•
361k
•
44
•
11
iamketan25/roleplay-instructions-dataset
Viewer
•
Updated
Apr 24, 2023
•
3.15k
•
159
•
33
PKU-Alignment/BeaverTails
Viewer
•
Updated
Oct 17, 2023
•
364k
•
18.7k
•
109