Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
tuandunghcmut 's Collections
Gemma 4 Text-Only
Qwen3.5 Text-Only
MT-LLM
Agentic Benchmarks
Safety SFT
Tool Calling dataset for search domain
Document Layout Analysis Dataset
Post-training Dataset
RL-Papers
Visual Chain-of-Thought Reasoning Benchmarks
LLM for Security Benchmarks/Datasets
Visual-CoT/GCoT related
Text Embedding Papers
EMPTY A
Quantized versions of LLMs/MLLMs
Multilingual Sentiment Analysis Dataset
LLM Series
LLM/MLLM (20B - 80B, fit on 1-2 A100/H100)
SLM
MLLM (100B - 300B)
Benchmarks for evaluating LLMs/MLLMs
Conversation Dataset
Multilingual Parallel Text Corpus
Multilingual Pretraining Corpus for Southeast Asian Language

Document Layout Analysis Dataset

updated Mar 26
Upvote
1

  • jordanparker6/publaynet

    Viewer • Updated Jul 19, 2022 • 27.4k • 1.86k • 37

    Note Recommended by https://github.com/ibm-aur-nlp/PubLayNet?tab=readme-ov-file


  • docling-project/DocLayNet-v1.2

    Viewer • Updated Feb 10, 2025 • 80.9k • 1.64k • 18

  • juliozhao/DocSynth300K

    Viewer • Updated Oct 24, 2024 • 229k • 501 • 55

    Note https://huggingface.co/papers/2410.12628


  • creative-graphic-design/PubLayNet

    Viewer • Updated 6 days ago • 358k • 1.08k • 10

  • Layout Generation Dataset

    Collection
    2 items • Updated Jul 27, 2024 • 1

  • tuandunghcmut/D4LA

    Viewer • Updated Mar 1 • 11.1k • 93 • 1
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs