Miscellaneous Text Datasets for Language Models izumi-lab/oscar2301-ja-filter-ja-normal Viewer • Updated Jul 29, 2023 • 31.4M • 203 • 6 izumi-lab/mc4-ja Viewer • Updated Jul 29, 2023 • 87.4M • 4.24k • 6 izumi-lab/mc4-ja-filter-ja-normal Viewer • Updated Jul 29, 2023 • 52.6M • 856 • 5 izumi-lab/wikinews-ja-20230728 Viewer • Updated Jul 29, 2023 • 4.28k • 82 • 5
Japanese General Pre-trained Language Models izumi-lab/deberta-v2-base-japanese Fill-Mask • 0.1B • Updated 18 days ago • 4.72k • • 5 izumi-lab/deberta-v2-small-japanese Fill-Mask • 26.2M • Updated 18 days ago • 51 izumi-lab/bert-small-japanese Fill-Mask • Updated Dec 9, 2022 • 118 • 5 izumi-lab/electra-base-japanese-discriminator 0.1B • Updated 18 days ago • 47 • 2
llm-japanese-dataset izumi-lab/llm-japanese-dataset Viewer • Updated Jan 18, 2024 • 9.07M • 421 • 142 izumi-lab/llm-japanese-dataset-vanilla Viewer • Updated Feb 17, 2024 • 2.49M • 302 • 33
Japanese LoRA-tuned LLMs izumi-lab/stormy-7b-10ep Updated Jun 26, 2023 • 5 izumi-lab/llama-13b-japanese-lora-v0-1ep Updated May 23, 2023 • 11 izumi-lab/llama-7b-japanese-lora-v0-5ep Updated Jun 23, 2023 • 3 Paused 4 LLaMA 13B Japanese LoRA v0 1 epoch 🐨 4
Japanese Financial Pre-trained Language Models izumi-lab/bert-base-japanese-fin-additional 0.1B • Updated Jun 16, 2025 • 493 • 3 izumi-lab/bert-small-japanese-fin Fill-Mask • 18.1M • Updated 18 days ago • 59 • 2 izumi-lab/electra-small-japanese-fin-discriminator 13.8M • Updated 18 days ago • 32 izumi-lab/electra-small-japanese-fin-generator Fill-Mask • 13.8M • Updated Oct 21, 2023 • 14
Miscellaneous Text Datasets for Language Models izumi-lab/oscar2301-ja-filter-ja-normal Viewer • Updated Jul 29, 2023 • 31.4M • 203 • 6 izumi-lab/mc4-ja Viewer • Updated Jul 29, 2023 • 87.4M • 4.24k • 6 izumi-lab/mc4-ja-filter-ja-normal Viewer • Updated Jul 29, 2023 • 52.6M • 856 • 5 izumi-lab/wikinews-ja-20230728 Viewer • Updated Jul 29, 2023 • 4.28k • 82 • 5
Japanese LoRA-tuned LLMs izumi-lab/stormy-7b-10ep Updated Jun 26, 2023 • 5 izumi-lab/llama-13b-japanese-lora-v0-1ep Updated May 23, 2023 • 11 izumi-lab/llama-7b-japanese-lora-v0-5ep Updated Jun 23, 2023 • 3 Paused 4 LLaMA 13B Japanese LoRA v0 1 epoch 🐨 4
Japanese General Pre-trained Language Models izumi-lab/deberta-v2-base-japanese Fill-Mask • 0.1B • Updated 18 days ago • 4.72k • • 5 izumi-lab/deberta-v2-small-japanese Fill-Mask • 26.2M • Updated 18 days ago • 51 izumi-lab/bert-small-japanese Fill-Mask • Updated Dec 9, 2022 • 118 • 5 izumi-lab/electra-base-japanese-discriminator 0.1B • Updated 18 days ago • 47 • 2
Japanese Financial Pre-trained Language Models izumi-lab/bert-base-japanese-fin-additional 0.1B • Updated Jun 16, 2025 • 493 • 3 izumi-lab/bert-small-japanese-fin Fill-Mask • 18.1M • Updated 18 days ago • 59 • 2 izumi-lab/electra-small-japanese-fin-discriminator 13.8M • Updated 18 days ago • 32 izumi-lab/electra-small-japanese-fin-generator Fill-Mask • 13.8M • Updated Oct 21, 2023 • 14
llm-japanese-dataset izumi-lab/llm-japanese-dataset Viewer • Updated Jan 18, 2024 • 9.07M • 421 • 142 izumi-lab/llm-japanese-dataset-vanilla Viewer • Updated Feb 17, 2024 • 2.49M • 302 • 33