agentlans
/

gemma-3-4b-it-claude

Model card Files Files and versions

agentlans commited on Nov 1, 2025

Commit

f4fe64b

·

verified ·

1 Parent(s): 9630a79

Update README.md

Files changed (1) hide show

README.md +6 -3

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ tags:
 ---
 # Gemma 3 4B – Claude Edition
-Gemma 3 4B (Claude Edition) is a fine-tuned version of the Gemma 3 model, trained on the Claude dataset to enhance its English writing style. The goal of this release is to produce outputs that are more natural, creative, and coherent across a wide range of use cases.
 ## Overview
 This variant benefits from Claude’s diverse English-language text and code examples, improving fluency and expressiveness while maintaining the stable performance Gemma models are known for.
@@ -25,11 +25,14 @@ This variant benefits from Claude’s diverse English-language text and code exa
 - Conversational AI and chatbots
 ## Limitations
-- The model may generate inaccurate or outdated information. Always double-check important details before using outputs in production.
 - Built-in content filters may limit creativity or restrict certain topics.
 - Non-English translations are tuned for natural-sounding English rather than strict literal accuracy.
 - The model is not specialized for math or code generation.
-- Visual and multimodal functions are not included.
 ## Training Data
 1. [`agentlans/claude`](https://huggingface.co/datasets/agentlans/claude) dataset, `sample_k100000` configuration with LoRA rank 16, alpha 32, and NEFTune 5

 ---
 # Gemma 3 4B – Claude Edition
+[Gemma 3 4B](https://huggingface.co/google/gemma-3-4b-it) ([Claude](https://claude.ai/) Edition) is a fine-tuned version of the Gemma 3 model, trained on the Claude dataset to enhance its English writing style. The goal of this release is to produce outputs that are more natural, creative, and coherent across a wide range of use cases.
 ## Overview
 This variant benefits from Claude’s diverse English-language text and code examples, improving fluency and expressiveness while maintaining the stable performance Gemma models are known for.
 - Conversational AI and chatbots
 ## Limitations
+- The model may generate inaccurate or outdated information. **Always double-check important details before using outputs in production.**
+- Can still give verbose or redundant output.
+- Capable of basic chain-of-thought reasoning but not the long DeepSeek style reasoning.
+- May not understand some prompts or long conversations well.
 - Built-in content filters may limit creativity or restrict certain topics.
 - Non-English translations are tuned for natural-sounding English rather than strict literal accuracy.
 - The model is not specialized for math or code generation.
+- Visual and multimodal functions were not tested.
 ## Training Data
 1. [`agentlans/claude`](https://huggingface.co/datasets/agentlans/claude) dataset, `sample_k100000` configuration with LoRA rank 16, alpha 32, and NEFTune 5