Jin Chen
VanZieks
ยท
AI & ML interests
Large language model
Recent Activity
upvoted
a
paper
about 12 hours ago
BABE: Biology Arena BEnchmark
upvoted
a
paper
about 13 hours ago
Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities
upvoted
a
paper
5 months ago
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction
Organizations
None yet