Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
ALIENS's picture
8 8

ALIENS

ALIENS232
6b4b86ec-928a-4b7e-9c1e-8d5f009e3272's profile picture
·
  • ALIENS

AI & ML interests

None yet

Organizations

jilin university's profile picture

upvoted a paper 3 months ago

Can Vision-Language Models Solve the Shell Game?

Paper • 2603.08436 • Published Mar 9 • 39
upvoted 2 papers 5 months ago

ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World

Paper • 2505.19095 • Published May 25, 2025 • 2

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 207
upvoted a paper 7 months ago

ARE: Scaling Up Agent Environments and Evaluations

Paper • 2509.17158 • Published Sep 21, 2025 • 36
upvoted a paper 11 months ago

Can Large Multimodal Models Actively Recognize Faulty Inputs? A Systematic Evaluation Framework of Their Input Scrutiny Ability

Paper • 2508.04017 • Published Aug 6, 2025 • 11
upvoted a paper about 1 year ago

Don't Take the Premise for Granted: Evaluating the Premise Critique Ability of Large Language Models

Paper • 2505.23715 • Published May 29, 2025 • 2
upvoted 2 papers over 1 year ago

StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following

Paper • 2502.14494 • Published Feb 20, 2025 • 15

Large Language Model Evaluation via Matrix Nuclear-Norm

Paper • 2410.10672 • Published Oct 14, 2024 • 19
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs