Cheeeeeeeeky

unmodeled-tyler commited on Jan 8

Commit

f899969

verified ·

0 Parent(s):

Duplicate from vanta-research/atom-80b

Browse files

Co-authored-by: Tyler <unmodeled-tyler@users.noreply.huggingface.co>

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

.gitattributes +36 -0
README.md +137 -0
added_tokens.json +28 -0
chat_template.jinja +61 -0
config.json +94 -0
generation_config.json +13 -0
merges.txt +0 -0
model-00001-of-00040.safetensors +3 -0
model-00002-of-00040.safetensors +3 -0
model-00003-of-00040.safetensors +3 -0
model-00004-of-00040.safetensors +3 -0
model-00005-of-00040.safetensors +3 -0
model-00006-of-00040.safetensors +3 -0
model-00007-of-00040.safetensors +3 -0
model-00008-of-00040.safetensors +3 -0
model-00009-of-00040.safetensors +3 -0
model-00010-of-00040.safetensors +3 -0
model-00011-of-00040.safetensors +3 -0
model-00012-of-00040.safetensors +3 -0
model-00013-of-00040.safetensors +3 -0
model-00014-of-00040.safetensors +3 -0
model-00015-of-00040.safetensors +3 -0
model-00016-of-00040.safetensors +3 -0
model-00017-of-00040.safetensors +3 -0
model-00018-of-00040.safetensors +3 -0
model-00019-of-00040.safetensors +3 -0
model-00020-of-00040.safetensors +3 -0
model-00021-of-00040.safetensors +3 -0
model-00022-of-00040.safetensors +3 -0
model-00023-of-00040.safetensors +3 -0
model-00024-of-00040.safetensors +3 -0
model-00025-of-00040.safetensors +3 -0
model-00026-of-00040.safetensors +3 -0
model-00027-of-00040.safetensors +3 -0
model-00028-of-00040.safetensors +3 -0
model-00029-of-00040.safetensors +3 -0
model-00030-of-00040.safetensors +3 -0
model-00031-of-00040.safetensors +3 -0
model-00032-of-00040.safetensors +3 -0
model-00033-of-00040.safetensors +3 -0
model-00034-of-00040.safetensors +3 -0
model-00035-of-00040.safetensors +3 -0
model-00036-of-00040.safetensors +3 -0
model-00037-of-00040.safetensors +3 -0
model-00038-of-00040.safetensors +3 -0
model-00039-of-00040.safetensors +3 -0
model-00040-of-00040.safetensors +3 -0
model.safetensors.index.json +0 -0
special_tokens_map.json +31 -0
tokenizer.json +3 -0

.gitattributes ADDED Viewed

	@@ -0,0 +1,36 @@

+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text
+tokenizer.json filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,137 @@

+---
+license: apache-2.0
+language:
+- en
+base_model:
+- Qwen/Qwen3-Next-80B-A3B-Instruct
+base_model_relation: finetune
+library_name: transformers
+tags:
+- qwen3
+- qwen3-next
+- qwen
+- vanta-research
+- cognitive-configuration
+- text-generation
+- instruction-following
+- cognitive-ai
+- friendly-ai
+- helpful-ai
+- persona-ai
+- philosophical
+- emotional-intelligence
+- atom
+- collaborative-ai
+- collaboration
+- conversational-ai
+- conversational
+- alignment
+- chat
+- chatbot
+- reasoning
+- friendly
+---
+<div align="center">
+![vanta_trimmed](https://cdn-uploads.huggingface.co/production/uploads/686c460ba3fc457ad14ab6f8/hcGtMtCIizEZG_OuCvfac.png)
+  <h1>VANTA Research</h1>
+  <p><strong>Independent AI research lab building safe, resilient language models optimized for human-AI collaboration</strong></p>
+  <p>
+    <a href="https://vantaresearch.xyz"><img src="https://img.shields.io/badge/Website-vantaresearch.xyz-black" alt="Website"/></a>
+    <a href="https://merch.vantaresearch.xyz"><img src="https://img.shields.io/badge/Merch-merch.vantaresearch.xyz-sage" alt="Merch"/></a>
+    <a href="https://x.com/vanta_research"><img src="https://img.shields.io/badge/@vanta_research-1DA1F2?logo=x" alt="X"/></a>
+    <a href="https://github.com/vanta-research"><img src="https://img.shields.io/badge/GitHub-vanta--research-181717?logo=github" alt="GitHub"/></a>
+  </p>
+</div>
+---
+# Atom-80B
+## Overview
+Atom-80B is a state-of-the-art language model fine-tuned on the Qwen3 80B Next base, optimized for high-fidelity reasoning, collaborative interaction, and cognitive extension. Atom-80B is designed to be friendly, enthusiastic, and collaboration-first.
+This model is a continuation of Project Atom from VANTA Research, which aims to scale the Atom persona from 4B-400B+. This model is the 5th in the Project Atom series.
+Key strengths:
+- Complex, multi-step reasoning
+- Collaborative task execution and agentic workflows
+- Stable, flavorful persona alignment
+- Optimized inference efficiency
+---
+## Training and Data
+### Base Model
+- **Qwen3 80B Next**: A leading foundation model with robust multilingual and coding capabilities.
+### Fine-Tuning Datasets
+Atom-80B was fine-tuned on the same high-quality datasets as the smaller Atom variants, including:
+- Collaborative exploration and brainstorming
+- Research synthesis and question formulation
+- Technical explanation at varying complexity levels
+- Lateral thinking and creative problem-solving
+- Empathetic and supportive dialogue patterns
+## Intended Use
+### Primary Applications
+- **Collaborative Brainstorming:** Generating diverse ideas and building iteratively on user suggestions
+- **Research Assistance:** Synthesizing information, identifying key arguments, and formulating research questions
+- **Technical Explanation:** Simplifying complex concepts across difficulty levels (including ELI5)
+- **Code Discussion:** Exploring implementation approaches, debugging strategies, and architectural decisions
+- **Creative Problem-Solving:** Encouraging unconventional approaches and lateral thinking
+### Out-of-Scope Use
+This model shall not be used for:
+- High-stakes decision-making without human oversight
+- Medical, legal, or financial advice
+- Generation of harmful, biased, or misleading content
+- Applications requiring guaranteed factual accuracy
+## Usage
+### Installation
+```
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model = AutoModelForCausalLM.from_pretrained("vanta-research/atom-80B", torch_dtype="auto")
+tokenizer = AutoTokenizer.from_pretrained("vanta-research/atom-80B")
+inputs = tokenizer("Explain quantum computing like I'm 10.", return_tensors="pt").to("cuda")
+outputs = model.generate(**inputs, max_new_tokens=256)
+print(tokenizer.decode(outputs[0], skip_special_tokens=True))
+```
+## Ethical Considerations
+This model is designed to support exploration and learning, not to replace human judgment. Users should:
+- Verify factual claims against authoritative sources
+- Apply critical thinking to generated suggestions
+- Recognize the model's limitations in high-stakes scenarios
+- Be mindful of potential biases in outputs
+- Use responsibly in accordance with applicable laws and regulations
+## Citation
+```bibtex
+@misc{atom-80b,
+  title={Atom-80B: A Collaborative Thought Partner},
+  author={VANTA Research},
+  year={2026},
+  howpublished={https://huggingface.co/vanta-research/atom-80b}
+}
+```
+## Contact
+- Organization: hello@vantaresearch.xyz
+- Engineering/Design: tyler@vantaresearch.xyz

added_tokens.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "</think>": 151668,
+  "</tool_call>": 151658,
+  "</tool_response>": 151666,
+  "<think>": 151667,
+  "<tool_call>": 151657,
+  "<tool_response>": 151665,
+  "<|box_end|>": 151649,
+  "<|box_start|>": 151648,
+  "<|endoftext|>": 151643,
+  "<|file_sep|>": 151664,
+  "<|fim_middle|>": 151660,
+  "<|fim_pad|>": 151662,
+  "<|fim_prefix|>": 151659,
+  "<|fim_suffix|>": 151661,
+  "<|im_end|>": 151645,
+  "<|im_start|>": 151644,
+  "<|image_pad|>": 151655,
+  "<|object_ref_end|>": 151647,
+  "<|object_ref_start|>": 151646,
+  "<|quad_end|>": 151651,
+  "<|quad_start|>": 151650,
+  "<|repo_name|>": 151663,
+  "<|video_pad|>": 151656,
+  "<|vision_end|>": 151653,
+  "<|vision_pad|>": 151654,
+  "<|vision_start|>": 151652
+}

chat_template.jinja ADDED Viewed

	@@ -0,0 +1,61 @@

+{%- if tools %}
+    {{- '<|im_start|>system\n' }}
+    {%- if messages[0].role == 'system' %}
+        {{- messages[0].content + '\n\n' }}
+    {%- endif %}
+    {{- "# Tools\n\nYou may call one or more functions to assist with the user query.\n\nYou are provided with function signatures within <tools></tools> XML tags:\n<tools>" }}
+    {%- for tool in tools %}
+        {{- "\n" }}
+        {{- tool | tojson }}
+    {%- endfor %}
+    {{- "\n</tools>\n\nFor each function call, return a json object with function name and arguments within <tool_call></tool_call> XML tags:\n<tool_call>\n{\"name\": <function-name>, \"arguments\": <args-json-object>}\n</tool_call><|im_end|>\n" }}
+{%- else %}
+    {%- if messages[0].role == 'system' %}
+        {{- '<|im_start|>system\n' + messages[0].content + '<|im_end|>\n' }}
+    {%- endif %}
+{%- endif %}
+{%- for message in messages %}
+    {%- if message.content is string %}
+        {%- set content = message.content %}
+    {%- else %}
+        {%- set content = '' %}
+    {%- endif %}
+    {%- if (message.role == "user") or (message.role == "system" and not loop.first) %}
+        {{- '<|im_start|>' + message.role + '\n' + content + '<|im_end|>' + '\n' }}
+    {%- elif message.role == "assistant" %}
+        {{- '<|im_start|>' + message.role + '\n' + content }}
+        {%- if message.tool_calls %}
+            {%- for tool_call in message.tool_calls %}
+                {%- if (loop.first and content) or (not loop.first) %}
+                    {{- '\n' }}
+                {%- endif %}
+                {%- if tool_call.function %}
+                    {%- set tool_call = tool_call.function %}
+                {%- endif %}
+                {{- '<tool_call>\n{"name": "' }}
+                {{- tool_call.name }}
+                {{- '", "arguments": ' }}
+                {%- if tool_call.arguments is string %}
+                    {{- tool_call.arguments }}
+                {%- else %}
+                    {{- tool_call.arguments | tojson }}
+                {%- endif %}
+                {{- '}\n</tool_call>' }}
+            {%- endfor %}
+        {%- endif %}
+        {{- '<|im_end|>\n' }}
+    {%- elif message.role == "tool" %}
+        {%- if loop.first or (messages[loop.index0 - 1].role != "tool") %}
+            {{- '<|im_start|>user' }}
+        {%- endif %}
+        {{- '\n<tool_response>\n' }}
+        {{- content }}
+        {{- '\n</tool_response>' }}
+        {%- if loop.last or (messages[loop.index0 + 1].role != "tool") %}
+            {{- '<|im_end|>\n' }}
+        {%- endif %}
+    {%- endif %}
+{%- endfor %}
+{%- if add_generation_prompt %}
+    {{- '<|im_start|>assistant\n' }}
+{%- endif %}

config.json ADDED Viewed

	@@ -0,0 +1,94 @@

+{
+  "architectures": [
+    "Qwen3NextForCausalLM"
+  ],
+  "attention_bias": false,
+  "attention_dropout": 0.0,
+  "bos_token_id": 151643,
+  "decoder_sparse_step": 1,
+  "dtype": "bfloat16",
+  "eos_token_id": 151645,
+  "full_attention_interval": 4,
+  "head_dim": 256,
+  "hidden_act": "silu",
+  "hidden_size": 2048,
+  "initializer_range": 0.02,
+  "intermediate_size": 5120,
+  "layer_types": [
+    "linear_attention",
+    "linear_attention",
+    "linear_attention",
+    "full_attention",
+    "linear_attention",
+    "linear_attention",
+    "linear_attention",
+    "full_attention",
+    "linear_attention",
+    "linear_attention",
+    "linear_attention",
+    "full_attention",
+    "linear_attention",
+    "linear_attention",
+    "linear_attention",
+    "full_attention",
+    "linear_attention",
+    "linear_attention",
+    "linear_attention",
+    "full_attention",
+    "linear_attention",
+    "linear_attention",
+    "linear_attention",
+    "full_attention",
+    "linear_attention",
+    "linear_attention",
+    "linear_attention",
+    "full_attention",
+    "linear_attention",
+    "linear_attention",
+    "linear_attention",
+    "full_attention",
+    "linear_attention",
+    "linear_attention",
+    "linear_attention",
+    "full_attention",
+    "linear_attention",
+    "linear_attention",
+    "linear_attention",
+    "full_attention",
+    "linear_attention",
+    "linear_attention",
+    "linear_attention",
+    "full_attention",
+    "linear_attention",
+    "linear_attention",
+    "linear_attention",
+    "full_attention"
+  ],
+  "linear_conv_kernel_dim": 4,
+  "linear_key_head_dim": 128,
+  "linear_num_key_heads": 16,
+  "linear_num_value_heads": 32,
+  "linear_value_head_dim": 128,
+  "max_position_embeddings": 262144,
+  "mlp_only_layers": [],
+  "model_type": "qwen3_next",
+  "moe_intermediate_size": 512,
+  "norm_topk_prob": true,
+  "num_attention_heads": 16,
+  "num_experts": 512,
+  "num_experts_per_tok": 10,
+  "num_hidden_layers": 48,
+  "num_key_value_heads": 2,
+  "output_router_logits": false,
+  "partial_rotary_factor": 0.25,
+  "rms_norm_eps": 1e-06,
+  "rope_scaling": null,
+  "rope_theta": 10000000,
+  "router_aux_loss_coef": 0.001,
+  "shared_expert_intermediate_size": 512,
+  "tie_word_embeddings": false,
+  "transformers_version": "4.57.3",
+  "use_cache": true,
+  "use_sliding_window": false,
+  "vocab_size": 151936
+}

generation_config.json ADDED Viewed

	@@ -0,0 +1,13 @@

+{
+  "bos_token_id": 151643,
+  "do_sample": true,
+  "eos_token_id": [
+    151645,
+    151643
+  ],
+  "pad_token_id": 151643,
+  "temperature": 0.7,
+  "top_k": 20,
+  "top_p": 0.8,
+  "transformers_version": "4.57.3"
+}

merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

model-00001-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:66154e218b81d732350506ea804ccd12e6175e0f4a0ad5abff1b59472f45a02d
+size 3999606640

model-00002-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3418a5fc827c581501d8f610bd7d7f9771c7c67c654a3cd440b774dfef445494
+size 3999841808

model-00003-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:78a2652e5943d03728179e3c76005270eff82c922d7d9cc899173039a17625c8
+size 3999515712

model-00004-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:95220aa358f43c658fcf9136a83f8c09e6f534fd2703e85cab098bf3c35371a3
+size 3999842128

model-00005-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:908d1493311db0b9655499002fb898d2bdf5ffcaf1d4a27cae45569d7e700af0
+size 3999842128

model-00006-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1bd78fbbdf2de47373d749924f736e519d34b5fd3419a349ec25e17d65d57273
+size 3999853128

model-00007-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:aaaf273fc1bcb80d78ee825ad2b43f17d579b82614e7f91c02b8ab0363e8932b
+size 3999841944

model-00008-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0f7c4e9e03bda23160f5775c427ec5f512e55e1b32b335e5f571b79d6fb9b2cb
+size 3999842128

model-00009-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c864921ef4e708f0c20c2bcbed25245e39907321b89ff2930d1d517199f1eebc
+size 3999843352

model-00010-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c9c6c55657f49ed934e088f4da236f1bfc7bac182e5eb2c34aa46b6fb17eabad
+size 3999517600

model-00011-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:db7c99cd1eeb05053400dccb2ce10936afbe84d501af731618a62fcccad000d0
+size 4000181296

model-00012-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f17d4d3fc472b1ef0e4a086aea952e6e7cd8b0bf32509cfc62f17b489ed28c42
+size 3999843944

model-00013-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:28f0fb3b82aefc2359ed52bd85c3c8101a7919bf7faf8e7d6e70173d01571d04
+size 3999517600

model-00014-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9a28ac12da53a5f7a56a1cc20471bce98d239234e245db9bc19546e687837e66
+size 3999844016

model-00015-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:20dd8b3eb19a20ba615265e6d8cfebebff085ee63bf40f76fa2e23f7713e02f5
+size 4000181552

model-00016-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a8ff505a03938ae82dff00d70f47dd9421a2c66e6144a956b7156510b680c910
+size 3999517280

model-00017-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:418c53f4491be5cb87ede928dc8303ccc09a1c8ed9bc8b052fa362fd12575fde
+size 3999844016

model-00018-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1ca87e643a3a68b6745fb219958617924d02c25068763f5f95f200c513e1b655
+size 3999844008

model-00019-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a08fd9270c756b7604c43189883310e9e38579962520f996c3773c6516980098
+size 3999844008

model-00020-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cd1dd20ffbed695bb2e437b1a33cb8f8b21a9802721312a91f8b19a314061e9a
+size 3999854952

model-00021-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7a7987e3dae21a8ef6650c1475bf8dd3a959b6110f64045a8b8f5baccc796ac1
+size 3999843832

model-00022-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0ac7995f87e1c2438d9c439460b6898d45bf53d487eb24be188bb2ef0095ee03
+size 3999844008

model-00023-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ff017165cdd8b9850f069c9243576c564425c274be41b39fd292a08d09de4013
+size 3999517608

model-00024-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bc3ee5059726bec4bbbb2632de3292965307156d1321e8985fcb1baf1e66bd36
+size 3999844056

model-00025-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fb5a63e1a81e2a53dab2aea460f4dc36970ac000a94ad17c2734dd5f63e3779c
+size 4000181296

model-00026-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:29a16a240d56adcf832b94d555e370d60246edac92970ffc211338456fdc8f99
+size 3999517536

model-00027-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:572e29a3f19b2820a3a002485ccecc33f5d2f442bea2664db45f50d9d75202fa
+size 3999844008

model-00028-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:83eea57f86dbba50108efbca33d8d90b39e46a5fc351bc4f4680b5808499970d
+size 3999844008

model-00029-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c5ffe07bcf7307df9a716f93fd3295eded755f33fee770e4c55a3492d8afdee0
+size 3999855136

model-00030-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1c513c7ba823189ecb53286df0905985c1b0f36f4d94b72febb37c69104d46ca
+size 3999843696

model-00031-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8d1bb920f9f5c6481d453426d42a32a8ae65866077fadc1f29872a9d79f58c6f
+size 3999844008

model-00032-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bccecd0cf5dbef9fe6f2bb37d2d27513b913839ffacfd3e37c12e44e460d5550
+size 3999844016

model-00033-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f2da244d38c7c432236b8fe714eb7774bad7a08423be9275b0f24675092f4258
+size 3999517600

model-00034-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:90b80d3478d807ede160f0bbd49a7fa1e138679ddf087524d5335c2bd9fa87e5
+size 4000181408

model-00035-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fd95aa5b396dba98af1391a055917e7f7e545f56e5e0b0483b8314c614a790f1
+size 3999843832

model-00036-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:92c7bf3dfdd65201fea8f87ee36048eeb50da145c450db2f17df5e737daa22cc
+size 3999517600

model-00037-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:25c3ddbfbc25aa64db226ac041769abd069d2a874c3d016c0d9aaa560f9ef66d
+size 3999844008

model-00038-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bf8fcf233e9fa61b28866ecf3cd9321083891eacdf7ff0a2cccaafbd7b2a4dc7
+size 3999844056

model-00039-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1c3280d2a766fb7012e0191bb3a9dddf4f7bddd2c140995389df170e39973b46
+size 3999854888

model-00040-of-00040.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5a19469201abd5b54d565fb203b6417c40f2dd79036e0b0bf3345579e5c2fb3d
+size 3365585136

model.safetensors.index.json ADDED Viewed

The diff for this file is too large to render. See raw diff

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,31 @@

+{
+  "additional_special_tokens": [
+    "<|im_start|>",
+    "<|im_end|>",
+    "<|object_ref_start|>",
+    "<|object_ref_end|>",
+    "<|box_start|>",
+    "<|box_end|>",
+    "<|quad_start|>",
+    "<|quad_end|>",
+    "<|vision_start|>",
+    "<|vision_end|>",
+    "<|vision_pad|>",
+    "<|image_pad|>",
+    "<|video_pad|>"
+  ],
+  "eos_token": {
+    "content": "<|im_end|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:aeb13307a71acd8fe81861d94ad54ab689df773318809eed3cbe794b4492dae4
+size 11422654