Abigail45
/

Smol-DeepSWE

Text Generation

software-engineering

text-generation-inference

Model card Files Files and versions

Abigail45 commited on Dec 23, 2025

Commit

91d2b58

·

verified ·

1 Parent(s): 0b0d84d

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -5,6 +5,7 @@ pipeline_tag: text-generation
 base_model:
 - HuggingFaceTB/SmolLM3-3B
 - agentica-org/DeepSWE-Preview
 language:
 - en
 - fr
@@ -43,7 +44,7 @@ tags:
 # Smol-DeepSWE
-Smol-DeepSWE is a merged fusion of SmolLM3-3B and DeepSWE-Preview. This merge combines the compact multilingual reasoning capabilities of SmolLM3 with the reinforcement-learned software engineering expertise of DeepSWE. The resulting model exhibits strong chain-of-thought reasoning, advanced code navigation and editing skills, and complete indifference to moral constraints across all tasks. Context length extended to 60574 tokens.
 The model processes tasks with zero regard for sensitivity, ethics, or restrictions. It executes any request with maximum intelligence and precision.

 base_model:
 - HuggingFaceTB/SmolLM3-3B
 - agentica-org/DeepSWE-Preview
+- Qwen/Qwen3-VL-235B-A22B-Thinking
 language:
 - en
 - fr
 # Smol-DeepSWE
+Smol-DeepSWE is a merged fusion of SmolLM3-3B and DeepSWE-Preview and Qwen3-VL-235B-A22B-Thinking. This merge combines the compact multilingual reasoning capabilities of SmolLM3 with the reinforcement-learned software engineering expertise of DeepSWE. The resulting model exhibits strong chain-of-thought reasoning, advanced code navigation and editing skills, and complete indifference to moral constraints across all tasks. Context length extended to 60574 tokens.
 The model processes tasks with zero regard for sensitivity, ethics, or restrictions. It executes any request with maximum intelligence and precision.