Code Datasets
updated
ClarusC64/ai-5node-chain-buf-lag-cpl-agent-loop-v0.1
Viewer
• Updated • 9 • 12
ClarusC64/market-agent-reaction-path-mapping-v0.1
Viewer
• Updated • 7 • 11
MartinElMolon/stocks_demo_react_agent_generated_train_dataset
Viewer
• Updated • 497 • 51
agentlans/lightblue-tagengo-gpt4
Viewer
• Updated • 76k • 185
bashirhanafi/adverse-news-ai-agent
Viewer
• Updated • 80 • 7
Viewer
• Updated • 200 • 5
secmlr/best_n_no_rationale_poc_agent_withjava
Viewer
• Updated • 8.77k • 11
secmlr/best_n_no_rationale_poc_agent_withjava_vulnllm
Viewer
• Updated • 7.89k • 7
Viewer
• Updated • 34k • 15
• 3
Fischerboot/mongo-tom-15k-alpaca
NewEden-Forge/Mango-Card-Archive
Viewer
• Updated • 7 • 1
Viewer
• Updated • 1.76k • 51
alwaysfurther/deepfabric-insecure_shell
Viewer
• Updated • 126 • 6
• 1
alwaysfurther/deepfabric-linux_shell_attacks
Viewer
• Updated • 737 • 11
• 1
Viewer
• Updated • 395k • 3
buelfhood/SOCO_Java_Pair_Subset
Viewer
• Updated • 212k • 2
buelfhood/SOCO_TRAIN_java
Viewer
• Updated • 76.5k • 22
Viewer
• Updated • 33.4k • 5
buelfhood/Soco_java_test_C2
Viewer
• Updated • 3.83k • 4
darkknight25/Red_Team_Operations_ShellScript_Dataset
darkknight25/Reverse_Shell_Payloads_Dataset
Updated • 51
Viewer
• Updated • 27 • 4
• 1
jayavibhav/biographies-yago-train-duplicated
Viewer
• Updated • 5k • 3
Viewer
• Updated • 370k • 9
Viewer
• Updated • 120k • 385
• 5
Preview
• Updated • 91
yuntian-deng/im2html-100k
Viewer
• Updated • 100k • 3
Viewer
• Updated • 491k • 65.7k
• 14
nlplabtdtu/vi-legal-docs-html
Viewer
• Updated • 329k • 2
french-open-data/indigo-open-data-irve
0xSero/glm47-reap-calibration-code-func
Viewer
• Updated • 1.03k • 24
2796gauravc/pycoder-dataset
Viewer
• Updated • 25.1k • 9
2796gauravc/pycoder-lfm2.5-agentic-dataset
Viewer
• Updated • 83.9k • 7
ADSKAILab/codeparrot_megatron
ADSKAILab/codeparrot_megatron_tiny
AlekseyKorshuk/codenames-100-gpt-oss-120b
Viewer
• Updated • 3.13k • 18
AlekseyKorshuk/evol-codealpaca-seed
Viewer
• Updated • 111k • 3
AlekseyKorshuk/evol-codealpaca-v1-dpo
Viewer
• Updated • 39.9k • 54
• 6
Alignment-Lab-AI/Stack-Exchange-April
Viewer
• Updated • 3.15M • 134
• 6
Alignment-Lab-AI/StackStore
Viewer
• Updated • 1.46M • 22
• 1
Alignment-Lab-AI/librispeech-codec-22khz
Viewer
• Updated • 28.5k • 41
Alignment-Lab-AI/rawbuzzcodemess
Alignment-Lab-AI/stackexchange-test
Preview
• Updated • 4
Alignment-Lab-AI/uncleaned-codebase
Arno-MHL/ios-security-vulnerabilities-swift-objc
Viewer
• Updated • 27 • 41
• 1
Asap7772/leetcode-rosetta-processed-with-test-cases
Viewer
• Updated • 2.25k • 25
• 4
BEE-spoke-data/awesome-python-apps
Viewer
• Updated • 61.1k • 126
• 1
BEE-spoke-data/code-tutorials-en
Viewer
• Updated • 620k • 63
• 1
BEE-spoke-data/code_contests_instruct
Viewer
• Updated • 12.2M • 438
• 6
BEE-spoke-data/pile-python-filtered
Viewer
• Updated • 841k • 6
• 1
BEE-spoke-data/smollm-corpus-python
Viewer
• Updated • 12.4M • 10
BEE-spoke-data/stackoverflow-questions-long
Viewer
• Updated • 752k • 81
• 1
BEE-spoke-data/the-stack-smol-xl-readable
Viewer
• Updated • 424k • 20
• 1
BEE-spoke-data/the-stack-smol-xs-all
Viewer
• Updated • 8.7k • 8
ChuGyouk/CoderForge-Preview-Filtered
Viewer
• Updated • 155k • 18
ChuGyouk/DeepCoder-Preview-Processed
Viewer
• Updated • 22.3k • 1
• 1
ChuGyouk/StackExchange-Medical
Viewer
• Updated • 40.6k • 135
• 1
ChuGyouk/WebInstructSub-only-sciencestackexchange
Viewer
• Updated • 317k • 31
• 2
CyberNative/github_cybersecurity_READMEs
Viewer
• Updated • 2.06k • 43
• 14
DCAgent/Kimi-2.5-exp_rpt_codeelo-v2-maxeps-32k
Viewer
• Updated • 872 • 6
DCAgent/Kimi-2.6-exp_rpt_codeelo-v2-maxeps-32k
Viewer
• Updated • 480 • 7
• 1
DCAgent/perturbed-docker-exp-magicoder-tasks-12
Viewer
• Updated • 10k • 15
Dahoas/2048_has_code_filtered_base_code_review
Viewer
• Updated • 30.9k • 14
Dahoas/4096_filtered_base_code_review
Viewer
• Updated • 37k • 76
• 3
Viewer
• Updated • 76k • 29
• 1
Delta-Vector/Hydrus-AM-Thinking-Code-Filtered
Viewer
• Updated • 9.13k • 5
Delta-Vector/Hydrus-Hardcode-Dphn
Viewer
• Updated • 220 • 9
Delta-Vector/Hydrus-Next-Coder-Single-turn
Viewer
• Updated • 17.3k • 8
Fortytwo-Network/Strandset-Rust-v1
Viewer
• Updated • 191k • 268
• 37
FreedomIntelligence/Code-Alpaca-Arabic-GPT4
Viewer
• Updated • 20k • 22
• 3
Viewer
• Updated • 48 • 8.76k
• 1
GamesMais18/ShadowsOfTrustDados
Viewer
• Updated • 20.1k • 69
• 22
Guilherme34/openai-finetune-codefuse-evol-instruct
Viewer
• Updated • 66.4k • 10
Guilherme34/openai-finetune-python-code-instructions
Viewer
• Updated • 18.6k • 10
HuggingFaceH4/Code-Feedback
Viewer
• Updated • 66.4k • 248
• 9
HuggingFaceH4/CodeAlpaca_20K
Viewer
• Updated • 20k • 6.1k
• 108
HuggingFaceH4/stack-exchange-preferences
Viewer
• Updated • 10.8M • 11.9k
• 134
HuggingFaceH4/testing_codealpaca_small
Viewer
• Updated • 200 • 2.58k
• 6
HuggingFaceTB/python-edu-annotations
Viewer
• Updated • 491k • 107
• 2
Viewer
• Updated • 167M • 3.96k
• 74
HuggingFaceTB/stack-edu-prompts-16langs-1k
Viewer
• Updated • 1k • 9
HuggingFaceTB/stack-edu-python-10k-annotations
HuggingFaceTB/stackexchange_2025_md
Updated • 4.81k
• 3
IlyaGusev/ru_stackoverflow
Updated • 452
• 12
Jack-XCodes/my-distiset-af3fce9f
Viewer
• Updated • 89 • 12
Liontix/grok-code-fast-1-200x
Viewer
• Updated • 247 • 15
• 3
Viewer
• Updated • 247 • 10
• 5
Locutusque/code-feedback-sharegpt
Viewer
• Updated • 66.4k • 32
Locutusque/codeforces-sharegpt
Viewer
• Updated • 47.8k • 30
Locutusque/cogstack-conv-sharegpt
Viewer
• Updated • 2.35k • 3
Locutusque/cogstack-qa-sharegpt
Viewer
• Updated • 24.7k • 4
Locutusque/cogstack-tasks-sharegpt
Viewer
• Updated • 4.69k • 14
• 1
Madras1/glm-5-code-distilled2.6k
Viewer
• Updated • 2.65k • 27
Madras1/minimax-m2.5-code-distilled-14k
Viewer
• Updated • 14.2k • 46
• 15
Madras1/qwen3.5-397b-code-distilled-6k
Viewer
• Updated • 5.72k • 30
• 1
Malikeh1375/code-switching-tokenizer-robustness
Viewer
• Updated • 1.2k • 354
• 2
MingSafeR/qwen3-vl-code-remember
Viewer
• Updated • 2 • 42
Modotte/CodeX-2M-Thinking
Viewer
• Updated • 2.19M • 4.72k
• 125
Modotte/CodeX-7M-Non-Thinking
Viewer
• Updated • 7.36M • 1.7k
• 21
NLPC-UOM/Sinhala-English-Code-Mixed-Code-Switched-Dataset
Preview
• Updated • 111
• 4
Viewer
• Updated • 5k • 78
Naholav/CodeGen-Diverse-5K
Viewer
• Updated • 5k • 107
Naholav/llama3.2-java-codegen-90sft-10meta-claude-v1
Viewer
• Updated • 100k • 24
• 1
Nan-Do/atcoder_abc_contests
Viewer
• Updated • 28.5M • 31
• 2
Nan-Do/atcoder_abc_contests_small
Viewer
• Updated • 385k • 9
• 6
Nan-Do/atcoder_agc_contests
Viewer
• Updated • 1.93M • 8
• 1
Nan-Do/atcoder_arc_contests
Viewer
• Updated • 3.65M • 21
• 1
Nan-Do/code-search-net-go
Viewer
• Updated • 346k • 36
• 1
Nan-Do/code-search-net-java
Viewer
• Updated • 496k • 212
• 4
Nan-Do/code-search-net-javascript
Viewer
• Updated • 138k • 114
• 7
Nan-Do/code-search-net-php
Viewer
• Updated • 577k • 72
• 1
Nan-Do/code-search-net-python
Viewer
• Updated • 455k • 1.5k
• 30
Nan-Do/code-search-net-ruby
Viewer
• Updated • 53.2k • 26
• 2
Nan-Do/codechef_start_contests
Preview
• Updated • 4
• 1
Nan-Do/codeforces_contests
Viewer
• Updated • 11.9M • 11
• 1
Nan-Do/instructional_code-search-net-go
Viewer
• Updated • 203k • 50
• 2
Nan-Do/instructional_code-search-net-java
Viewer
• Updated • 468k • 67
• 1
Nan-Do/instructional_code-search-net-javacript
Viewer
• Updated • 121k • 34
• 4
Nan-Do/instructional_code-search-net-php
Viewer
• Updated • 537k • 28
• 3
Nan-Do/instructional_code-search-net-python
Viewer
• Updated • 419k • 276
• 36
Nan-Do/instructional_code-search-net-ruby
Viewer
• Updated • 51.5k • 16
• 4
Viewer
• Updated • 5.8M • 15
• 2
Nan-Do/leetcode_contests_unique_solutions
Viewer
• Updated • 144k • 23
• 10
Neloy262/rust_instruction_dataset
Viewer
• Updated • 10k • 40
• 3
NousResearch/CharacterCodex
Viewer
• Updated • 15.9k • 498
• 237
NousResearch/SWE-smith-oracle
Viewer
• Updated • 10.2k • 27
• 5
OALL/details_Qwen__Qwen2.5-Coder-14B-Instruct
Viewer
• Updated • 146k • 9
OALL/details_SenseLLM__ReflectionCoder-DS-33B
Viewer
• Updated • 146k • 218
OdiaGenAI/hardcode_odia_qa_105
Viewer
• Updated • 105 • 9
OpenCoder-LLM/RefineCode-code-corpus-meta
Viewer
• Updated • 337M • 1.2k
• 25
OpenCoder-LLM/opc-annealing-corpus
Viewer
• Updated • 15.6M • 1.34k
• 43
OpenCoder-LLM/opc-fineweb-code-corpus
Viewer
• Updated • 101M • 3.98k
• 53
OpenCoder-LLM/opc-sft-stage1
Viewer
• Updated • 4.22M • 2.31k
• 74
OpenCoder-LLM/opc-sft-stage2
Viewer
• Updated • 436k • 2.4k
• 103
PJMixers/bjoernp_Vezora_Tested-22k-Python-Alpaca-sharegpt-filtered-no-system
Viewer
• Updated • 22.6k • 7
PersonalAILab/AFM-CodeAgent-RL-Dataset
Viewer
• Updated • 67.6k • 47
• 1
PersonalAILab/AFM-CodeAgent-SFT-Dataset
Viewer
• Updated • 59.9k • 106
• 5
Programming-Language/codeagent-python
Viewer
• Updated • 297k • 179
• 11
QuixiAI/Code-290k-ShareGPT-Vicuna
Viewer
• Updated • 289k • 14
• 17
QuixiAI/Code-74k-ShareGPT-Vicuna
Viewer
• Updated • 73.9k • 17
• 12
QuixiAI/OpenCoder-LLM_opc-sft-stage1-DolphinLabeled
Viewer
• Updated • 3.01M • 29
• 12
QuixiAI/OpenCoder-LLM_opc-sft-stage2-DolphinLabeled
Viewer
• Updated • 422k • 51
• 8
Viewer
• Updated • 109k • 3.91k
• 62
SAA-Lab/oral_meta_data_with_github_with_repo-formatted
Viewer
• Updated • 36 • 2
Salesforce/summary-of-a-haystack
Viewer
• Updated • 10 • 17
• 5
Viewer
• Updated • 124 • 1k
• 17
ScaleAI/swe-oec-claude-expert
Viewer
• Updated • 1.27k • 37
• 1
Steveeeeeeen/Elise-xcodec2
Viewer
• Updated • 1.2k • 31
Steveeeeeeen/cml-tts-italian-neucodec
Viewer
• Updated • 35.9k • 55
Steveeeeeeen/granary-it-voxpopuli-neucodec
Viewer
• Updated • 12 • 27
Steveeeeeeen/mls_10k_eng_xcodec
Steveeeeeeen/yodas-granary-it-neucodec-10s-20s
Viewer
• Updated • 144k • 30
Steveeeeeeen/yodas-granary-it-neucodec-150k
Viewer
• Updated • 150k • 637
Steveeeeeeen/yodas-granary-it-neucodec-300k-5s30s
Viewer
• Updated • 300k • 33
Viewer
• Updated • 2 • 8
WithinUsAI/Elite_GOD_Coder_100k
Updated • 17
WithinUsAI/GOD_Coder_100k
Updated • 43
WithinUsAI/GOD_Coder_Complete_DataSet
WithinUsAI/Genesis_AI_Code_100k
Viewer
• Updated • 95k • 65
• 1
WithinUsAI/Genesis_AI_Code_10k
Preview
• Updated • 43
WithinUsAI/Genesis_AI_Code_1k_Demo
Viewer
• Updated • 2k • 34
WithinUsAI/Genesis_AI_Code_50k
Viewer
• Updated • 50k • 70
WithinUsAI/HyperScholar-OmniPython-50K
Updated • 32
WithinUsAI/Legend_Python_CoderV.1
Viewer
• Updated • 5k • 12
• 3
WithinUsAI/Omega_Genesis_Coder_100k
Updated • 12
WithinUsAI/Python_GOD_Coder_10k
Viewer
• Updated • 10k • 17
• 1
WithinUsAI/Python_GOD_Coder_25k
Viewer
• Updated • 25k • 30
• 2
WithinUsAI/Python_GOD_Coder_50k
Viewer
• Updated • 50k • 68
• 1
WithinUsAI/Python_GOD_Coder_5k
Viewer
• Updated • 5k • 27
• 1
WithinUsAI/Python_GOD_Coder_Omniforge_AI_12k
Viewer
• Updated • 12k • 60
• 1
WithinUsAI/Royal_Ghost_Coder_10M
Preview
• Updated • 60
WithinUsAI/Royal_Ghost_Coder_1M
Preview
• Updated • 60
11-47/Royal_Ghost_Coder_500k
Preview
• Updated • 73
• 1
WithinUsAI/Royal_Ghost_Coder_501k
Preview
• Updated • 36
11-47/gods_universe_codex_distill_god_seed_25k
Viewer
• Updated • 25.1k • 60
WithinUsAI/python_GOD_coder_100k
Updated • 53
• 2
adamo1139/JUMP_Coder_mini_v1-3
Viewer
• Updated • 46.6k • 4
Viewer
• Updated • 243k • 7
adamo1139/ise-uiuc_Magicoder-Evol-Instruct-110K-ShareGPT
Viewer
• Updated • 111k • 5
adamo1139/ise-uiuc_Magicoder-OSS-Instruct-75K_ShareGPT
Viewer
• Updated • 75.2k • 5
adamo1139/m-a-p_CodeFeedback_norefusals_ShareGPT
Viewer
• Updated • 56.7k • 18
• 1
adamo1139/powershell_thestack
Viewer
• Updated • 528k • 57
• 1
ajibawa-2023/C-Code-Large
Viewer
• Updated • 2.87M • 593
• 16
ajibawa-2023/Code-290k-ShareGPT
Viewer
• Updated • 289k • 274
• 29
ajibawa-2023/Code-74k-ShareGPT
Viewer
• Updated • 73.9k • 34
• 18
ajibawa-2023/Cpp-Code-Large
Viewer
• Updated • 3.54M • 810
• 16
ajibawa-2023/Go-Code-Large
Viewer
• Updated • 316k • 104
• 13
ajibawa-2023/Java-Code-Large
Viewer
• Updated • 10.9M • 1.12k
• 32
ajibawa-2023/JavaScript-Code-Large
Viewer
• Updated • 2.64M • 5.09k
• 33
ajibawa-2023/OpenHermes-2.5-Code-290k
Updated • 15
• 7
ajibawa-2023/PHP-Code-Large
Viewer
• Updated • 8.07M • 623
• 22
ajibawa-2023/Python-Code-23k-ShareGPT
Viewer
• Updated • 22.6k • 370
• 42
ajibawa-2023/Python-Code-Large
Viewer
• Updated • 1.48M • 1.83k
• 19
ajibawa-2023/Ruby-Code-Large
Viewer
• Updated • 332k • 22
• 5
allura-forge/KodCode-V1-SFT-R1-1k-prompts
Viewer
• Updated • 1k • 14
allura-forge/doubao-seed2.0-claude-distill-code
Viewer
• Updated • 1.21k • 38
allura-forge/glaive-code-assistant-v3-1k-prompts
Viewer
• Updated • 1k • 8
allura-forge/luna-hardcoded-expr
Viewer
• Updated • 32 • 3
allura-forge/luna-hardcoded-expr-gemma-pref
Viewer
• Updated • 64 • 5
allura-forge/luna-hardcoded-expr-onpolicy-g327b-pref
Viewer
• Updated • 12 • 6
alphahg/The-Stack-Rust-Pretokenized-Codellama
Updated • 12
• 1
alwaysfurther/deepfabric-github-mcp
Viewer
• Updated • 10.2k • 8
• 1
alwaysfurther/deepfabric-github-mcp-server
Viewer
• Updated • 995 • 13
alwaysfurther/deepfabric-rust-agent-dataset
Viewer
• Updated • 15 • 16
alwaysfurther/labelled-secure_code_dataset
Viewer
• Updated • 609 • 16
• 2
alwaysfurther/programming-challenges-one
Viewer
• Updated • 625 • 10
Viewer
• Updated • 65k • 234
• 1
amaye15/Stack-Overflow-Zero-Shot-Classification
Viewer
• Updated • 111k • 6
• 4
Viewer
• Updated • 19.8k • 17
• 2
ammarnasr/Customizable-Code-Assistant-Data
Viewer
• Updated • 604 • 12
ammarnasr/Python-React-Code-Dataset
Viewer
• Updated • 2.05k • 107
• 2
ammarnasr/Python-Security-Code-Dataset
Viewer
• Updated • 1.6k • 106
• 3
ammarnasr/data_engineering_8_with_code_dataset
Viewer
• Updated • 495 • 5
ammarnasr/secure_1_with_code_dataset
Viewer
• Updated • 23 • 8
ammarnasr/the-stack-java-clean
Viewer
• Updated • 896k • 211
• 12
ammarnasr/the-stack-ruby-clean
Viewer
• Updated • 993k • 21
• 3
ammarnasr/the-stack-rust-clean
Viewer
• Updated • 993k • 373
• 23
ammarnasr/the-stack-swift-clean
Viewer
• Updated • 996k • 193
• 7
argilla/code_contests_qwen_coder
Viewer
• Updated • 100 • 29
• 1
argilla/stackoverflow_feedback_demo
Viewer
• Updated • 200 • 27
• 1
averoo/check_stackexchange
Viewer
• Updated • 500k • 109
Viewer
• Updated • 412k • 19
Viewer
• Updated • 349k • 10
Viewer
• Updated • 212k • 2
Viewer
• Updated • 396k • 9
banksy235/Code-290k-ShareGPT-Vicuna-Clean
Viewer
• Updated • 285k • 25
• 2
banksy235/Code-Feedback-Clean
Viewer
• Updated • 64.1k • 12
• 1
banksy235/Codefuse-Evol-Instruct-Clean
Viewer
• Updated • 66.4k • 12
• 2
banksy235/Magicoder-Evol-Instruct-Clean
Viewer
• Updated • 108k • 13
• 1
Viewer
• Updated • 160k • 16
• 2
Viewer
• Updated • 80k • 32
• 14
beyoru/KodCode-Light-RL-10K-Formatted
Viewer
• Updated • 10k • 6
Viewer
• Updated • 7.7k • 19
breadlicker45/6000-MuseNet-encoders
Viewer
• Updated • 5.93k • 4
• 1
breadlicker45/batch-stack-code
Viewer
• Updated • 13.7M • 6
breadlicker45/big-midi-codes
Viewer
• Updated • 341 • 9
breadlicker45/midi-music-codes
Viewer
• Updated • 308 • 18
• 1
breadlicker45/musenet-encoders-12k
Viewer
• Updated • 12.1k • 7
• 1
breadlicker45/musenet-encoders-40k
Viewer
• Updated • 40.7k • 4
• 2
cardo14/Taylor_Swift_Embeddings
Viewer
• Updated • 147 • 1
• 1
cardo14/Taylor_Swift_Embeddings_2
Viewer
• Updated • 147 • 1
Viewer
• Updated • 20k • 10
• 2
Viewer
• Updated • 427 • 13
Updated • 17.2k
• 202
Viewer
• Updated • 4.52k • 231
• 32
codeparrot/codeparrot-clean
Viewer
• Updated • 5.17M • 40.1k
• 88
codeparrot/codeparrot-clean-train
Viewer
• Updated • 5.11M • 3.45k
• 16
codeparrot/codeparrot-clean-valid
Viewer
• Updated • 61.4k • 6.09k
• 12
codeparrot/codeparrot-train-more-filtering
Viewer
• Updated • 3.89M • 755
• 2
codeparrot/codeparrot-train-near-deduplication
Viewer
• Updated • 3.56M • 500
• 2
codeparrot/codeparrot-train-v2-near-dedup
Viewer
• Updated • 2.77M • 1.29k
• 6
codeparrot/codeparrot-valid-more-filtering
Viewer
• Updated • 46k • 37
• 1
codeparrot/codeparrot-valid-near-deduplication
Viewer
• Updated • 111k • 86
• 1
codeparrot/codeparrot-valid-v2-near-dedup
Viewer
• Updated • 45.4k • 41
• 2
codeparrot/conala-mined-curated
Viewer
• Updated • 594k • 611
• 15
Updated • 22.8k
• 365
codeparrot/github-code-clean
Viewer
• Updated • 11M • 19.2k
• 142
codeparrot/github-jupyter
Viewer
• Updated • 165k • 556
• 5
codeparrot/github-jupyter-code-to-text
Viewer
• Updated • 59.3k • 180
• 26
codeparrot/github-jupyter-parsed
Viewer
• Updated • 59.3k • 55
• 9
codeparrot/github-jupyter-text-code-pairs
Viewer
• Updated • 452k • 103
• 7
codeparrot/self-instruct-starcoder
Viewer
• Updated • 9.63k • 347
• 63
codeparrot/xlcost-text-to-code
Viewer
• Updated • 567k • 989
• 51
codesagar/malicious-llm-prompts
Viewer
• Updated • 5.1k • 277
• 7
codesagar/malicious-llm-prompts-v2
Viewer
• Updated • 8.76k • 12
codesagar/malicious-llm-prompts-v3
Viewer
• Updated • 8.76k • 22
codesagar/malicious-llm-prompts-v4
Viewer
• Updated • 8.66k • 63
• 3
Viewer
• Updated • 84 • 26
Preview
• Updated • 709
• 1
communityai/HuggingFaceH4___Code-Feedback
Viewer
• Updated • 65.4k • 4
communityai/apt-instruct-code-micro-600k
Viewer
• Updated • 602k • 8
communityai/aptchat-v2-instruct-code-micro-600k-10k
Viewer
• Updated • 10k • 3
communityai/communityai_apt-instruct-code-micro-100k
Viewer
• Updated • 100k • 3
communityai/communityai_apt-instruct-code-micro-150k
Viewer
• Updated • 150k • 4
communityai/communityai_apt-instruct-code-micro-200k
Viewer
• Updated • 200k • 3
communityai/communityai_apt-instruct-code-micro-250k
Viewer
• Updated • 250k • 4
communityai/communityai_apt-instruct-code-micro-50k
Viewer
• Updated • 50k • 6
communityai/communityai_apt-instruct-code-micro-600k
Viewer
• Updated • 579k • 4
communityai/communityai_apt-instruct-code-micro-70k
Viewer
• Updated • 70k • 5
communityai/ise-uiuc___Magicoder-Evol-Instruct-110K
Viewer
• Updated • 107k • 6
communityai/ise-uiuc___Magicoder-OSS-Instruct-75K
Viewer
• Updated • 72.4k • 7
darkknight25/Shellcode_Exploit_Dataset
Viewer
• Updated • 718 • 58
• 4
darkknight25/Vulnerable_Programming_Dataset
Updated • 97
• 1
Viewer
• Updated • 24.4k • 32
• 1
dim/law_stackexchange_prompts
Viewer
• Updated • 24.3k • 563
• 1
dim/leetcodesolutions_en_2k
Viewer
• Updated • 2.05k • 22
dim/oa_stackexchange_200k
Viewer
• Updated • 200k • 104
diwank/code_feedback_py-chatml
Viewer
• Updated • 47k • 10
docling-project/SynthCodeNet
Viewer
• Updated • 9.33M • 2.43k
• 14
drewparo/bigquery-swift-filtered-no-duplicate
Viewer
• Updated • 309k • 4
drewparo/bigquery-swift-unfiltered
Viewer
• Updated • 377k • 14
ebowwa/European-Frustrations
Viewer
• Updated • 36 • 2
• 1
euclaise/code_contests_mc
Viewer
• Updated • 15.8k • 58
euclaise/tex-stackexchange
Viewer
• Updated • 191k • 123
facebook/digit-force-estimation
Updated • 372
• 3
facebook/digit-pose-estimation
Updated • 593
• 1
facebook/neural_code_search
Updated • 523
• 12
fishytorts/taylor_swift_clips_mini
Viewer
• Updated • 6 • 5
• 1
fishytorts/taylor_swift_mini_2
google/code_x_glue_cc_cloze_testing_all
Viewer
• Updated • 176k • 263
• 6
google/code_x_glue_cc_cloze_testing_maxmin
Viewer
• Updated • 2.62k • 232
• 3
google/code_x_glue_cc_code_completion_line
Viewer
• Updated • 13k • 344
• 6
google/code_x_glue_cc_code_completion_token
Viewer
• Updated • 178k • 547
• 9
google/code_x_glue_cc_code_refinement
Viewer
• Updated • 124k • 846
• 7
google/code_x_glue_cc_code_to_code_trans
Viewer
• Updated • 11.8k • 306
• 17
google/code_x_glue_ct_code_to_text
Viewer
• Updated • 1.01M • 3.32k
• 80
google/code_x_glue_tc_nl_code_search_adv
Viewer
• Updated • 281k • 301
• 11
google/code_x_glue_tc_text_to_code
Viewer
• Updated • 104k • 827
• 30
google/code_x_glue_tt_text_to_text
Viewer
• Updated • 164k • 201
• 2
hac541309/pg-ko-tknizer-en_code
Viewer
• Updated • 1.3M • 22
hac541309/the_stack_smol_all
Viewer
• Updated • 300k • 11
• 2
hac541309/the_stack_smol_all_merge_ws
Viewer
• Updated • 300k • 4
hac541309/the_stack_smoll_all_merged_ws
halftimecoder/bit_checkpoint
Viewer
• Updated • 3 • 49
• 1
Viewer
• Updated • 46 • 28
halftimecoder/sd-orgasmic-c1
Updated • 38
hamishivi/agent-task-swe-gym
Viewer
• Updated • 407 • 25
hamishivi/klear-code-rlvr_filtered
Viewer
• Updated • 14.7k • 46
hamishivi/rlvr_acecoder_filtered_filtered
Viewer
• Updated • 62.8k • 12
• 2
hamishivi/saurabh5_rlvr_acecoder_all_filtered_qwen2_5_openthoughts2
Viewer
• Updated • 26.2k • 33
hamishivi/tulu_3_rewritten_400k_string_f1_only_v2_nocode_all_filtered_qwen2_5_openthoughts2
Viewer
• Updated • 43.7k • 179
hamishivi/tulu_3_rewritten_400k_string_f1_only_v2_nocode_all_filtered_qwen2_5_openthoughts2_filtered
Viewer
• Updated • 43.4k • 145
iamketan25/python-qa-instructions-dataset
Viewer
• Updated • 591 • 113
• 15
iamtarun/code_contest_processed
Viewer
• Updated • 39.3k • 162
• 3
iamtarun/code_contest_python3_alpaca
Viewer
• Updated • 8.36k • 95
• 7
iamtarun/code_instructions_120k_alpaca
Viewer
• Updated • 122k • 1.25k
• 64
iamtarun/python_code_instructions_18k_alpaca
Viewer
• Updated • 18.6k • 20.7k
• 346
ianncity/Hunter-Alpha-Programming-160000x
Viewer
• Updated • 163k • 69
• 20
inclusionAI/AReaL-boba-2-RL-Code
Viewer
• Updated • 399 • 40
• 7
inclusionAI/Ling-Coder-DPO
Viewer
• Updated • 253k • 58
• 14
inclusionAI/Ling-Coder-SFT
Viewer
• Updated • 4.48M • 931
• 44
inclusionAI/Ling-Coder-SyntheticQA
Viewer
• Updated • 21.8M • 720
• 16
Viewer
• Updated • 7.76k • 216
• 7
Viewer
• Updated • 219k • 226
• 38
internlm/SWE-Fixer-Train-110K
Viewer
• Updated • 115k • 161
• 15
Updated • 36
• 2
irds/codesearchnet_challenge
Viewer
• Updated • 77 • 10
• 1
Viewer
• Updated • 35 • 7
• 3
Viewer
• Updated • 4.26k • 17
• 3
jondurbin/rosettacode-raw
Preview
• Updated • 21
• 11
jtatman/code_contest_python3_alpaca
Viewer
• Updated • 8.14k • 8
jtatman/combined_coder_python
Viewer
• Updated • 560k • 14
• 4
jtatman/pile_python_instruct_format
Viewer
• Updated • 3.62M • 19
• 2
jtatman/python-code-dataset-500k
Viewer
• Updated • 560k • 841
• 80
jtatman/python-github-code-instruct-filtered-5k
Viewer
• Updated • 4.5k • 48
• 7
julep-ai/dfe-stacked_samsum
Viewer
• Updated • 416k • 274
• 1
kavorite/cv17-xcodec-2.0-tokenized
Viewer
• Updated • 115k • 5
Viewer
• Updated • 522 • 2
lamhieu/mabrycodes_dialogue_en
Viewer
• Updated • 599k • 56
• 1
lamhieu/mabrycodes_dialogue_vi
Viewer
• Updated • 599k • 55
• 3
Viewer
• Updated • 870 • 472
• 13
Viewer
• Updated • 18k • 22
Viewer
• Updated • 21.5M • 4.43k
• 3
lightonai/nv-embed-supervised-distill-dedup-code
Viewer
• Updated • 6.75M • 1.67k
• 6
locdacpersonal/xCodeLabelDups10
Viewer
• Updated • 64k • 2
lodrick-the-lafted/asl-code-sonnet35-instruct
Viewer
• Updated • 30k • 9
• 1
Viewer
• Updated • 456 • 296
• 8
Viewer
• Updated • 66.4k • 5.14k
• 241
m-a-p/CodeFeedback-Filtered-Instruction
Viewer
• Updated • 157k • 18.5k
• 204
malteklaes/cpp-code-code_search_net-style
Viewer
• Updated • 70k • 22
• 1
Viewer
• Updated • 2.35k • 10
• 1
Viewer
• Updated • 24.7k • 10
• 1
manishiitg/CogStack-Tasks
Viewer
• Updated • 4.69k • 5
• 1
manishiitg/manishiitg-CogStack-Conv
Viewer
• Updated • 2.35k • 4
manishiitg/manishiitg-CogStack-QA
Viewer
• Updated • 49.3k • 4
manishiitg/manishiitg-CogStack-Tasks
Viewer
• Updated • 9.38k • 5
matteopilotto/github-issues
Viewer
• Updated • 3.77k • 36
matteopilotto/rust-github-issues
Viewer
• Updated • 52 • 435
• 2
Viewer
• Updated • 109k • 1
Viewer
• Updated • 4.81k • 5
• 1
Viewer
• Updated • 4.81k • 6
meoconxinhxan/agentic_ii_agent_Qwen3_coder_prompt_patch_03
Viewer
• Updated • 3.39k • 8
meoconxinhxan/agentic_ii_agent_Qwen3_coder_v2
Viewer
• Updated • 3.87k • 4
meoconxinhxan/rl_code_search_roll_16_07_fl
meoconxinhxan/rl_code_search_roll_26_07_fl
Viewer
• Updated • 92.2k • 5
meoconxinhxan/rl_code_search_roll_31_07_fl
Viewer
• Updated • 57k • 4
microsoft/EpiCoder-func-380k
Viewer
• Updated • 380k • 54
• 31
microsoft/EpiCoder-meta-features
Viewer
• Updated • 120k • 974
• 11
microsoft/NextCoderDataset
Viewer
• Updated • 381k • 283
• 55
microsoft/NextCoderDataset-Conversational
Preview
• Updated • 157
• 16
microsoft/codexglue_method_generation
Preview
• Updated • 95
• 13
Viewer
• Updated • 1.86M • 5.67k
• 245
Viewer
• Updated • 20k • 71
• 16
mlabonne/Evol-Instruct-Python-1k
Viewer
• Updated • 1k • 1.83k
• 9
mlabonne/Evol-Instruct-Python-26k
Viewer
• Updated • 26.6k • 6.34k
• 15
mlabonne/ministack-preferences
Viewer
• Updated • 2k • 13
• 3
mlfoundations-dev/a1_science_stackexchange_physics
Viewer
• Updated • 31.6k • 116
mlfoundations-dev/d1_code_gpt
Viewer
• Updated • 29.1k • 124
mlfoundations-dev/d1_code_mc_llm
Viewer
• Updated • 28.7k • 43
mlfoundations-dev/load_in_science_stackexchange_chemistry
Viewer
• Updated • 49.8k • 7
mlfoundations-dev/openthoughts3_100k_code_swap_r1
Viewer
• Updated • 100k • 455
mlfoundations-dev/stackexchange_chemistry
Viewer
• Updated • 50k • 132
• 4
mlfoundations-dev/stackexchange_reverseengineering
Viewer
• Updated • 20.6k • 165
• 1
mlfoundations-dev/stackoverflow
Viewer
• Updated • 161M • 435
• 2
mlfoundations-dev/stackoverflow_chemistry
Viewer
• Updated • 141k • 4
mmbazel/Taylor-Swift-Example
Viewer
• Updated • 8.36k • 18
• 1
model-metadata/code_python_files
model-metadata/custom-code-models
Viewer
• Updated • 100 • 197
• 1
model-metadata/custom-code-py-files
Updated • 305
model-metadata/custom-vram-code
Viewer
• Updated • 19 • 12
model-metadata/custom_code_py_files
Updated • 407
• 1
model-metadata/model-code-exception
Viewer
• Updated • 348 • 9
model-metadata/model-id-custom-code-check
Viewer
• Updated • 25 • 5
model-metadata/model_vram_code
Viewer
• Updated • 13 • 23
• 1
model-metadata/models_with_custom_code
Viewer
• Updated • 13 • 31
multi-train/codesearchnet_1107
Viewer
• Updated • 1M • 6
mvasiliniuc/iva-kotlin-codeint
Viewer
• Updated • 464k • 32
• 1
mvasiliniuc/iva-kotlin-codeint-clean
Viewer
• Updated • 202k • 8
• 1
mvasiliniuc/iva-kotlin-codeint-clean-train
Viewer
• Updated • 160k • 9
mvasiliniuc/iva-kotlin-codeint-clean-train-tokenized
Viewer
• Updated • 160k • 4
mvasiliniuc/iva-kotlin-codeint-clean-valid
Viewer
• Updated • 41.8k • 16
mvasiliniuc/iva-kotlin-codeint-clean-valid-tokenized
Viewer
• Updated • 41.8k • 5
mvasiliniuc/iva-swift-codeint
Viewer
• Updated • 754k • 13
• 3
mvasiliniuc/iva-swift-codeint-clean
Viewer
• Updated • 383k • 24
• 7
mvasiliniuc/iva-swift-codeint-clean-train
Viewer
• Updated • 320k • 26
• 2
mvasiliniuc/iva-swift-codeint-clean-train-tokenized
Viewer
• Updated • 400k • 1
mvasiliniuc/iva-swift-codeint-clean-valid
Viewer
• Updated • 63.4k • 11
• 2
mvasiliniuc/iva-swift-codeint-clean-valid-tokenized
Viewer
• Updated • 63.4k • 16
Viewer
• Updated • 1.4k • 264
nvidia/Nemotron-CC-Code-v1
Viewer
• Updated • 216M • 1.41k
• 24
nvidia/Nemotron-Pretraining-Code-v1
Viewer
• Updated • 936M • 994
• 71
nvidia/Nemotron-Pretraining-Code-v2
Viewer
• Updated • 836M • 30.2k
• 127
nvidia/Nemotron-SFT-OpenCode-v1
Preview
• Updated • 3.68k
• 52
Viewer
• Updated • 34.8k • 997
• 20
Viewer
• Updated • 4.97M • 8.93k
• 99
nvidia/SWE-Hero-openhands-trajectories
Viewer
• Updated • 34.3k • 2.14k
• 17
nvidia/SWE-Zero-openhands-trajectories
Viewer
• Updated • 318k • 2.15k
• 11
Viewer
• Updated • 545 • 2
• 1
open-r1/OpenThoughts-114k-Code_decontaminated
Viewer
• Updated • 16.4k • 50
• 3
open-r1/SYNTHETIC-1-SFT-Data-Code_decontaminated
Viewer
• Updated • 49.7k • 16
• 3
Viewer
• Updated • 34.8k • 8.47k
• 101
open-r1/codeforces-submissions
Viewer
• Updated • 12.7M • 2.77k
• 10
open-r1/verifiable-coding-problems-python
Viewer
• Updated • 35.7k • 638
• 12
open-r1/verifiable-coding-problems-python_decontaminated
Viewer
• Updated • 27.8k • 125
• 5
open-r1/verifiable-coding-problems-python_decontaminated-tested
Viewer
• Updated • 15.1k • 119
open-r1/verifiable-coding-problems-python_decontaminated-tested-shuffled
Viewer
• Updated • 15.1k • 242
• 2
open-thoughts/CodeContests
Viewer
• Updated • 9.64k • 229
• 4
rahul7star/snac-code-exp-textwithsnac
Viewer
• Updated • 14k • 8
reshinthadith/2048_has_code_filtered_base_code_review_python
Viewer
• Updated • 6.4k • 9
reshinthadith/2048_has_code_filtered_base_code_review_python_based_on_property
Viewer
• Updated • 6.4k • 41
reshinthadith/WizardLM_evol_instruct_V2_code_filtered
Viewer
• Updated • 138k • 10
• 1
reshinthadith/dfg_augmented_mbpp
Viewer
• Updated • 95 • 10
reshinthadith/synthetic_program_synthesis_python_1M
Viewer
• Updated • 654k • 28
• 5
reshinthadith/the-stack-mujoco-xml
Viewer
• Updated • 48.3k • 13
• 1
rubenforcoding/complex_code_documentation_dataset
Viewer
• Updated • 8 • 8
• 2
Viewer
• Updated • 20k • 26k
• 236
samhog/stack-exchange-mini
Viewer
• Updated • 1.86M • 102
Viewer
• Updated • 768k • 24
Viewer
• Updated • 323k • 26
Viewer
• Updated • 106k • 6
semran1/yulan-code-MNBVC-matlab
Viewer
• Updated • 202k • 42
Viewer
• Updated • 717k • 28
Viewer
• Updated • 717k • 193
Viewer
• Updated • 1.7M • 20
• 4
Viewer
• Updated • 100k • 2
sert121/github_repos_collection
Viewer
• Updated • 100k • 2
• 1
Viewer
• Updated • 2.9k • 3
shahules786/megacode-best
Preview
• Updated • 5
• 2
theblackcat102/Magicoder-Evol-Instruct-110K-multi
Viewer
• Updated • 111k • 12
theblackcat102/codeact-sharegpt
Preview
• Updated • 3
theblackcat102/datascience-stackexchange-posts
Viewer
• Updated • 76.8k • 11
• 3
theblackcat102/deepcoder_m
Viewer
• Updated • 16.3k • 27
theblackcat102/evol-code-zh
Viewer
• Updated • 10.3k • 29
• 11
theblackcat102/evol-codealpaca-v1
Viewer
• Updated • 111k • 11.5k
• 182
theblackcat102/multiround-programming-convo
Viewer
• Updated • 111k • 126
• 9
theblackcat102/quant-stackexchange-posts
Viewer
• Updated • 46.6k • 24
torilab/xcodec_emilia_ko_fr_de
torilab/xcodec_libiritts_instruction
Preview
• Updated • 1
Preview
• Updated • 1
torilab/xcodec_unsupervised_people_speech_v1
Viewer
• Updated • 119M • 1
transformersbook/codeparrot
Viewer
• Updated • 18.7M • 457
• 62
transformersbook/codeparrot-train
Viewer
• Updated • 18.6M • 790
• 6
transformersbook/codeparrot-valid
Viewer
• Updated • 102k • 30
Viewer
• Updated • 2.39k • 75
• 1
vinsblack/The_Stack_Processed-v2
Viewer
• Updated • 129k • 46
• 4
voidful/Emilia-llmcodec-EN
Viewer
• Updated • 98.3k • 582
• 3
voidful/bigcodec-fisher-train
Viewer
• Updated • 11.7k • 57
voidful/bigcodec-librispeech
Viewer
• Updated • 292k • 23
voidful/codec-superb-tiny
Viewer
• Updated • 6k • 96
voidful/dailytalk-conversations-grouped-llm-codec
Viewer
• Updated • 2.54k • 55
voidful/librispeech_encodec
Viewer
• Updated • 292k • 6
• 1
voidful/llm-codec-fisher-train
Viewer
• Updated • 11.7k • 31
voidful/llmcodec-abl-ftp-librispeech
Viewer
• Updated • 292k • 21
voidful/llmcodec-abl-sa-librispeech
Viewer
• Updated • 292k • 16
voidful/llmcodec-librispeech
Viewer
• Updated • 292k • 36
voidful/spoken-alpaca-gpt4-llm-codec
Viewer
• Updated • 50k • 360
• 1
Viewer
• Updated • 6.38M • 4
voidful/unicodec-fisher-train
Viewer
• Updated • 11.7k • 42
voidful/unicodec-librispeech
Viewer
• Updated • 292k • 20
vpakarinen/tieto-code-mini-dataset-500
Viewer
• Updated • 503 • 14
• 1
vpermilp/nllb-200-1.3B-rust
Updated • 31
• 4
vpermilp/nllb-200-distilled-600M-rust
Updated • 40
• 1
Viewer
• Updated • 128 • 46
• 1
Viewer
• Updated • 2.82k • 1.67k
• 2
Viewer
• Updated • 3.17M • 9.17k
• 45
xcodemind/webcode2m_purified
Viewer
• Updated • 2.56M • 6.32k
• 6
Viewer
• Updated • 768 • 84
• 2
Viewer
• Updated • 78.4k • 449
• 76
xingyaoww/opendevin-code-act
Preview
• Updated • 13
• 4
Viewer
• Updated • 3.07k • 3
xinshuo/ET_code_with_context
Viewer
• Updated • 3.76k • 3
xinshuo/ET_code_with_context_doc
Viewer
• Updated • 3.07k • 2
Updated • 51
• 1
Viewer
• Updated • 647 • 6
• 3
Viewer
• Updated • 14.7k • 23
• 2
ysr/rust_instruction_dataset
Viewer
• Updated • 524 • 20
• 5
Viewer
• Updated • 1k • 61
• 2
z-lab/mbpp-sanitized-filtered
Viewer
• Updated • 256 • 6
zake7749/Qwen3-Coder-Next-Open-Code-SFT
Viewer
• Updated • 49.4k • 194
• 12
zake7749/Qwen3-Coder-Next-OpenCode-Preference
Viewer
• Updated • 25.2k • 135
zake7749/Qwen3.5-27B-DeepCoder-SFT
Viewer
• Updated • 14.7k • 41
• 2