diff --git a/.gitattributes b/.gitattributes
index a6344aac8c09253b3b630fb776ae94478aa0275b..52373fe24473b1aa44333d318f578ae6bf04b49b 100644
--- a/.gitattributes
+++ b/.gitattributes
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
+tokenizer.json filter=lfs diff=lfs merge=lfs -text
diff --git a/README.md b/README.md
new file mode 100644
index 0000000000000000000000000000000000000000..86ee907c00cf6e22c8ddfdffd69d6073f7b9a23b
--- /dev/null
+++ b/README.md
@@ -0,0 +1,242 @@
+---
+tags:
+- unsloth
+base_model:
+- zai-org/GLM-4.7
+language:
+ - en
+ - zh
+library_name: transformers
+license: mit
+pipeline_tag: text-generation
+---
+> [!NOTE]
+> Includes Unsloth **chat template fixes**!
For `llama.cpp`, use `--jinja`
+>
+
+
+
+
+# GLM-4.7
+
+
+

+
+
+ đź‘‹ Join our Discord community.
+
+ đź“– Check out the GLM-4.7 technical blog, technical report(GLM-4.5).
+
+ 📍 Use GLM-4.7 API services on Z.ai API Platform.
+
+ 👉 One click to GLM-4.7.
+
+
+## Introduction
+
+**GLM-4.7**, your new coding partner, is coming with the following features:
+
+- **Core Coding**: GLM-4.7 brings clear gains, compared to its predecessor GLM-4.6, in multilingual agentic coding and terminal-based tasks, including (73.8%, +5.8%) on SWE-bench, (66.7%, +12.9%) on SWE-bench Multilingual, and (41%, +16.5%) on Terminal Bench 2.0. GLM-4.7 also supports thinking before acting, with significant improvements on complex tasks in mainstream agent frameworks such as Claude Code, Kilo Code, Cline, and Roo Code.
+- **Vibe Coding**: GLM-4.7 takes a big step forward in improving UI quality. It produces cleaner, more modern webpages and generates better-looking slides with more accurate layout and sizing.
+- **Tool Using**: GLM-4.7 achieves significantly improvements in Tool using. Significant better performances can be seen on benchmarks such as Ď„^2-Bench and on web browsing via BrowseComp.
+- **Complex Reasoning**: GLM-4.7 delivers a substantial boost in mathematical and reasoning capabilities, achieving (42.8%, +12.4%) on the HLE (Humanity’s Last Exam) benchmark compared to GLM-4.6.
+
+You can also see significant improvements in many other scenarios such as chat, creative writing, and role-play scenario.
+
+
+
+**Performances on Benchmarks.** More detailed comparisons of GLM-4.7 with other models GPT-5-High, GPT-5.1-High, Claude Sonnet 4.5, Gemini 3.0 Pro, DeepSeek-V3.2, Kimi K2 Thinking, on 17 benchmarks (including 8 reasoning, 5 coding, and 3 agents benchmarks) can be seen in the below table.
+
+| Benchmark | GLM-4.7 | GLM-4.6 | Kimi K2 Thinking | DeepSeek-V3.2 | Gemini 3.0 Pro | Claude Sonnet 4.5 | GPT-5-High | GPT-5.1-High |
+|:-------------------------------|:-------:|:-------:|:----------------:|:-------------:|:--------------:|:-----------------:|:----------:|:------------:|
+| MMLU-Pro | 84.3 | 83.2 | 84.6 | 85.0 | 90.1 | 88.2 | 87.5 | 87.0 |
+| GPQA-Diamond | 85.7 | 81.0 | 84.5 | 82.4 | 91.9 | 83.4 | 85.7 | 88.1 |
+| HLE | 24.8 | 17.2 | 23.9 | 25.1 | 37.5 | 13.7 | 26.3 | 25.7 |
+| HLE (w/ Tools) | 42.8 | 30.4 | 44.9 | 40.8 | 45.8 | 32.0 | 35.2 | 42.7 |
+| AIME 2025 | 95.7 | 93.9 | 94.5 | 93.1 | 95.0 | 87.0 | 94.6 | 94.0 |
+| HMMT Feb. 2025 | 97.1 | 89.2 | 89.4 | 92.5 | 97.5 | 79.2 | 88.3 | 96.3 |
+| HMMT Nov. 2025 | 93.5 | 87.7 | 89.2 | 90.2 | 93.3 | 81.7 | 89.2 | - |
+| IMOAnswerBench | 82.0 | 73.5 | 78.6 | 78.3 | 83.3 | 65.8 | 76.0 | - |
+| LiveCodeBench-v6 | 84.9 | 82.8 | 83.1 | 83.3 | 90.7 | 64.0 | 87.0 | 87.0 |
+| SWE-bench Verified | 73.8 | 68.0 | 71.3 | 73.1 | 76.2 | 77.2 | 74.9 | 76.3 |
+| SWE-bench Multilingual | 66.7 | 53.8 | 61.1 | 70.2 | - | 68.0 | 55.3 | - |
+| Terminal Bench Hard | 33.3 | 23.6 | 30.6 | 35.4 | 39.0 | 33.3 | 30.5 | 43.0 |
+| Terminal Bench 2.0 | 41.0 | 24.5 | 35.7 | 46.4 | 54.2 | 42.8 | 35.2 | 47.6 |
+| BrowseComp | 52.0 | 45.1 | - | 51.4 | - | 24.1 | 54.9 | 50.8 |
+| BrowseComp (w/ Context Manage) | 67.5 | 57.5 | 60.2 | 67.6 | 59.2 | - | - | - |
+| BrowseComp-Zh | 66.6 | 49.5 | 62.3 | 65.0 | - | 42.4 | 63.0 | - |
+| τ²-Bench | 87.4 | 75.2 | 74.3 | 85.3 | 90.7 | 87.2 | 82.4 | 82.7 |
+
+> **Coding:** AGI is a long journey, and benchmarks are only one way to evaluate performance. While the metrics provide necessary checkpoints, the most important thing is still how it *feels*. True intelligence isn't just about acing a test or processing data faster; ultimately, the success of AGI will be measured by how seamlessly it integrates into our lives-**"coding"** this time.
+
+
+## Getting started with GLM-4.7
+
+### Interleaved Thinking & Preserved Thinking
+
+
+
+GLM-4.7 further enhances **Interleaved Thinking** (a feature introduced since GLM-4.5) and introduces **Preserved Thinking** and **Turn-level Thinking**. By thinking between actions and staying consistent across turns, it makes complex tasks more stable and more controllable:
+- **Interleaved Thinking**: The model thinks before every response and tool calling, improving instruction following and the quality of generation.
+- **Preserved Thinking**: In coding agent scenarios, the model automatically retains all thinking blocks across multi-turn conversations, reusing the existing reasoning instead of re-deriving from scratch. This reduces information loss and inconsistencies, and is well-suited for long-horizon, complex tasks.
+- **Turn-level Thinking**: The model supports per-turn control over reasoning within a session—disable thinking for lightweight requests to reduce latency/cost, enable it for complex tasks to improve accuracy and stability.
+
+More details: https://docs.z.ai/guides/capabilities/thinking-mode
+
+### Evaluation Parameters
+
+**Default Settings (Most Tasks)**
+
+* temperature: `1.0`
+* top-p: `0.95`
+* max new tokens: `131072`
+
+For multi-turn agentic tasks (τ²-Bench and Terminal Bench 2), please turn on [Preserved Thinking mode](https://docs.z.ai/guides/capabilities/thinking-mode).
+
+**Terminal Bench, SWE Bench Verified**
+
+* temperature: `0.7`
+* top-p: `1.0`
+* max new tokens: `16384`
+
+**Ď„^2-Bench**
+
+* Temperature: `0`
+* Max new tokens: `16384`
+
+For Ď„^2-Bench evaluation, we added an additional prompt to the Retail and Telecom user interaction to avoid failure modes caused by users ending the interaction incorrectly. For the Airline domain, we applied the domain fixes as proposed in the [Claude Opus 4.5](https://assets.anthropic.com/m/64823ba7485345a7/Claude-Opus-4-5-System-Card.pdf) release report.
+
+## Serve GLM-4.7 Locally
+
+For local deployment, GLM-4.7 supports inference frameworks including vLLM and SGLang. Comprehensive deployment instructions are available in the official [Github](https://github.com/zai-org/GLM-4.5) repository.
+
+
+vLLM and SGLang only support GLM-4.7 on their main branches. you can use their official docker images for inference.
+
+### vLLM
+
+Using Docker as:
+
+```shell
+docker pull vllm/vllm-openai:nightly
+```
+
+or using pip (must use pypi.org as the index url):
+
+```shell
+pip install -U vllm --pre --index-url https://pypi.org/simple --extra-index-url https://wheels.vllm.ai/nightly
+```
+
+### SGLang
+
+Using Docker as:
+
+```shell
+docker pull lmsysorg/sglang:dev
+```
+
+or using pip install sglang from source.
+
+
+### transformers
+
+using with transformers as `4.57.3` and then run:
+
+```python
+import torch
+from transformers import AutoModelForCausalLM, AutoTokenizer
+
+MODEL_PATH = "zai-org/GLM-4.7"
+messages = [{"role": "user", "content": "hello"}]
+tokenizer = AutoTokenizer.from_pretrained(MODEL_PATH)
+inputs = tokenizer.apply_chat_template(
+ messages,
+ tokenize=True,
+ add_generation_prompt=True,
+ return_dict=True,
+ return_tensors="pt",
+)
+model = AutoModelForCausalLM.from_pretrained(
+ pretrained_model_name_or_path=MODEL_PATH,
+ torch_dtype=torch.bfloat16,
+ device_map="auto",
+)
+inputs = inputs.to(model.device)
+generated_ids = model.generate(**inputs, max_new_tokens=128, do_sample=False)
+output_text = tokenizer.decode(generated_ids[0][inputs.input_ids.shape[1] :])
+print(output_text)
+```
+
+### vLLM
+
+```shell
+vllm serve zai-org/GLM-4.7-FP8 \
+ --tensor-parallel-size 8 \
+ --tool-call-parser glm47 \
+ --reasoning-parser glm45 \
+ --enable-auto-tool-choice \
+ --served-model-name glm-4.7-fp8
+```
+
+### SGLang
+
+```shell
+python3 -m sglang.launch_server \
+ --model-path zai-org/GLM-4.7-FP8 \
+ --tp-size 8 \
+ --tool-call-parser glm47 \
+ --reasoning-parser glm45 \
+ --speculative-algorithm EAGLE \
+ --speculative-num-steps 3 \
+ --speculative-eagle-topk 1 \
+ --speculative-num-draft-tokens 4 \
+ --mem-fraction-static 0.8 \
+ --served-model-name glm-4.7-fp8 \
+ --host 0.0.0.0 \
+ --port 8000
+```
+
+### Parameter Instructions
+
+- For agentic tasks of GLM-4.7, please turn on [Preserved Thinking mode](https://docs.z.ai/guides/capabilities/thinking-mode) by adding the following config (only sglang support):
+
+ ```
+ "chat_template_kwargs": {
+ "enable_thinking": true,
+ "clear_thinking": false
+ }
+ ```
+
+- When using `vLLM` and `SGLang`, thinking mode is enabled by default when sending requests. If you want to disable the thinking switch, you need to add the `extra_body={"chat_template_kwargs": {"enable_thinking": False}}` parameter.
+- Both support tool calling. Please use OpenAI-style tool description format for calls.
+
+
+## Citation
+
+If you find our work useful in your research, please consider citing the following paper:
+
+```bibtex
+@misc{5team2025glm45agenticreasoningcoding,
+ title={GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models},
+ author={GLM Team and Aohan Zeng and Xin Lv and Qinkai Zheng and Zhenyu Hou and Bin Chen and Chengxing Xie and Cunxiang Wang and Da Yin and Hao Zeng and Jiajie Zhang and Kedong Wang and Lucen Zhong and Mingdao Liu and Rui Lu and Shulin Cao and Xiaohan Zhang and Xuancheng Huang and Yao Wei and Yean Cheng and Yifan An and Yilin Niu and Yuanhao Wen and Yushi Bai and Zhengxiao Du and Zihan Wang and Zilin Zhu and Bohan Zhang and Bosi Wen and Bowen Wu and Bowen Xu and Can Huang and Casey Zhao and Changpeng Cai and Chao Yu and Chen Li and Chendi Ge and Chenghua Huang and Chenhui Zhang and Chenxi Xu and Chenzheng Zhu and Chuang Li and Congfeng Yin and Daoyan Lin and Dayong Yang and Dazhi Jiang and Ding Ai and Erle Zhu and Fei Wang and Gengzheng Pan and Guo Wang and Hailong Sun and Haitao Li and Haiyang Li and Haiyi Hu and Hanyu Zhang and Hao Peng and Hao Tai and Haoke Zhang and Haoran Wang and Haoyu Yang and He Liu and He Zhao and Hongwei Liu and Hongxi Yan and Huan Liu and Huilong Chen and Ji Li and Jiajing Zhao and Jiamin Ren and Jian Jiao and Jiani Zhao and Jianyang Yan and Jiaqi Wang and Jiayi Gui and Jiayue Zhao and Jie Liu and Jijie Li and Jing Li and Jing Lu and Jingsen Wang and Jingwei Yuan and Jingxuan Li and Jingzhao Du and Jinhua Du and Jinxin Liu and Junkai Zhi and Junli Gao and Ke Wang and Lekang Yang and Liang Xu and Lin Fan and Lindong Wu and Lintao Ding and Lu Wang and Man Zhang and Minghao Li and Minghuan Xu and Mingming Zhao and Mingshu Zhai and Pengfan Du and Qian Dong and Shangde Lei and Shangqing Tu and Shangtong Yang and Shaoyou Lu and Shijie Li and Shuang Li and Shuang-Li and Shuxun Yang and Sibo Yi and Tianshu Yu and Wei Tian and Weihan Wang and Wenbo Yu and Weng Lam Tam and Wenjie Liang and Wentao Liu and Xiao Wang and Xiaohan Jia and Xiaotao Gu and Xiaoying Ling and Xin Wang and Xing Fan and Xingru Pan and Xinyuan Zhang and Xinze Zhang and Xiuqing Fu and Xunkai Zhang and Yabo Xu and Yandong Wu and Yida Lu and Yidong Wang and Yilin Zhou and Yiming Pan and Ying Zhang and Yingli Wang and Yingru Li and Yinpei Su and Yipeng Geng and Yitong Zhu and Yongkun Yang and Yuhang Li and Yuhao Wu and Yujiang Li and Yunan Liu and Yunqing Wang and Yuntao Li and Yuxuan Zhang and Zezhen Liu and Zhen Yang and Zhengda Zhou and Zhongpei Qiao and Zhuoer Feng and Zhuorui Liu and Zichen Zhang and Zihan Wang and Zijun Yao and Zikang Wang and Ziqiang Liu and Ziwei Chai and Zixuan Li and Zuodong Zhao and Wenguang Chen and Jidong Zhai and Bin Xu and Minlie Huang and Hongning Wang and Juanzi Li and Yuxiao Dong and Jie Tang},
+ year={2025},
+ eprint={2508.06471},
+ archivePrefix={arXiv},
+ primaryClass={cs.CL},
+ url={https://arxiv.org/abs/2508.06471},
+}
\ No newline at end of file
diff --git a/chat_template.jinja b/chat_template.jinja
new file mode 100644
index 0000000000000000000000000000000000000000..109442b7f54ef82b4e6567b03e4eb9768f02f0fd
--- /dev/null
+++ b/chat_template.jinja
@@ -0,0 +1,88 @@
+{# Unsloth template fixes #}
+[gMASK]
+{%- if tools -%}
+<|system|>
+# Tools
+
+You may call one or more functions to assist with the user query.
+
+You are provided with function signatures within XML tags:
+
+{% for tool in tools %}
+{{ tool | tojson|string }}
+{% endfor %}
+
+
+For each function call, output the function name and arguments within the following XML format:
+{function-name}{arg-key-1}{arg-value-1}{arg-key-2}{arg-value-2}...{%- endif -%}
+{%- macro visible_text(content) -%}
+ {%- if content is string -%}
+ {{- content }}
+ {%- elif content is iterable and content is not mapping -%}
+ {%- for item in content -%}
+ {%- if item is mapping and item.type == 'text' -%}
+ {{- item.text }}
+ {%- elif item is string -%}
+ {{- item }}
+ {%- endif -%}
+ {%- endfor -%}
+ {%- else -%}
+ {{- content }}
+ {%- endif -%}
+{%- endmacro -%}
+{%- set ns = namespace(last_user_index=-1) %}
+{%- for m in messages %}
+ {%- if m.role == 'user' %}
+ {% set ns.last_user_index = loop.index0 -%}
+ {%- endif %}
+{%- endfor %}
+{% for m in messages %}
+{%- if m.role == 'user' -%}<|user|>{{ visible_text(m.content) }}
+{%- elif m.role == 'assistant' -%}
+<|assistant|>
+{%- set reasoning_content = '' %}
+{%- set content = visible_text(m.content) %}
+{%- if m.reasoning_content is string %}
+ {%- set reasoning_content = m.reasoning_content %}
+{%- else %}
+ {%- if '' in content %}
+ {%- set reasoning_content = ((content.split('')|first).rstrip('\n').split('')|last).lstrip('\n') %}
+ {%- set content = (content.split('')|last).lstrip('\n') %}
+ {%- endif %}
+{%- endif %}
+{%- if ((clear_thinking is defined and not clear_thinking) or loop.index0 > ns.last_user_index) and reasoning_content -%}
+{{ '' + reasoning_content.strip() + ''}}
+{%- else -%}
+{{ '' }}
+{%- endif -%}
+{%- if content.strip() -%}
+{{ content.strip() }}
+{%- endif -%}
+{% if m.tool_calls %}
+{% for tc in m.tool_calls %}
+{%- if tc.function %}
+ {%- set tc = tc.function %}
+{%- endif %}
+{{- '' + tc.name -}}
+{% set _args = tc.arguments %}{%- if _args is mapping %}{% for k, v in _args|items %}{{ k }}{{ v | tojson|string if v is not string else v }}{% endfor %}{%- endif %}{% endfor %}
+{% endif %}
+{%- elif m.role == 'tool' -%}
+{%- if m.content is string -%}
+{%- if loop.first or (messages[loop.index0 - 1].role != "tool") %}
+ {{- '<|observation|>' }}
+{%- endif %}
+{{- '' }}
+{{- m.content }}
+{{- '' }}
+{%- else -%}
+<|observation|>{% for tr in m.content %}
+{{ tr.output if tr.output is defined else tr }}{% endfor -%}
+{% endif -%}
+{%- elif m.role == 'system' -%}
+<|system|>{{ visible_text(m.content) }}
+{%- endif -%}
+{%- endfor -%}
+{%- if add_generation_prompt -%}
+ <|assistant|>{{- '' if (enable_thinking is defined and not enable_thinking) else '' -}}
+{%- endif -%}
+{# Copyright 2025-present Unsloth. Apache 2.0 License. #}
\ No newline at end of file
diff --git a/config.json b/config.json
new file mode 100644
index 0000000000000000000000000000000000000000..0f6af0d77d0465b023a724d50412b707d46a0fbe
--- /dev/null
+++ b/config.json
@@ -0,0 +1,44 @@
+{
+ "architectures": [
+ "Glm4MoeForCausalLM"
+ ],
+ "attention_bias": true,
+ "attention_dropout": 0.0,
+ "torch_dtype": "bfloat16",
+ "eos_token_id": [
+ 151329,
+ 151336,
+ 151338
+ ],
+ "first_k_dense_replace": 3,
+ "head_dim": 128,
+ "hidden_act": "silu",
+ "hidden_size": 5120,
+ "initializer_range": 0.02,
+ "intermediate_size": 12288,
+ "max_position_embeddings": 202752,
+ "model_type": "glm4_moe",
+ "moe_intermediate_size": 1536,
+ "n_group": 1,
+ "n_routed_experts": 160,
+ "n_shared_experts": 1,
+ "norm_topk_prob": true,
+ "num_attention_heads": 96,
+ "num_experts_per_tok": 8,
+ "num_hidden_layers": 92,
+ "num_key_value_heads": 8,
+ "num_nextn_predict_layers": 1,
+ "pad_token_id": 151330,
+ "partial_rotary_factor": 0.5,
+ "rms_norm_eps": 1e-05,
+ "rope_scaling": null,
+ "rope_theta": 1000000,
+ "routed_scaling_factor": 2.5,
+ "tie_word_embeddings": false,
+ "topk_group": 1,
+ "transformers_version": "4.57.3",
+ "unsloth_fixed": true,
+ "use_cache": true,
+ "use_qk_norm": true,
+ "vocab_size": 151552
+}
\ No newline at end of file
diff --git a/generation_config.json b/generation_config.json
new file mode 100644
index 0000000000000000000000000000000000000000..f51194759eb31dde6fbc75a28e3fb7036f68161a
--- /dev/null
+++ b/generation_config.json
@@ -0,0 +1,11 @@
+{
+ "_from_model_config": true,
+ "eos_token_id": [
+ 151329,
+ 151336,
+ 151338
+ ],
+ "pad_token_id": 151329,
+ "temperature": 1.0,
+ "transformers_version": "4.56.2"
+}
diff --git a/model-00001-of-00092.safetensors b/model-00001-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..10b6e918d808c8e8f6ff65ebb7c6be14851e50f9
--- /dev/null
+++ b/model-00001-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:68a6fdc167ec2112ded8d04a97966739ae07cf4698e4b29d192536c06f013e51
+size 2202060968
diff --git a/model-00002-of-00092.safetensors b/model-00002-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..131cbe57154bb1518dad38ea9816517a261bc748
--- /dev/null
+++ b/model-00002-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:15296a7b7190f062f80d6fda3f64786bc4bfcbc3eb72b060ee633e1d95822bf2
+size 650168352
diff --git a/model-00003-of-00092.safetensors b/model-00003-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..4c6c2802232b81abc7f1d5cff6457c94297411e0
--- /dev/null
+++ b/model-00003-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:5b518e9565f74a154bfd46786aa1d9f8ebb1c053f962256b5352f33431f7f007
+size 650168352
diff --git a/model-00004-of-00092.safetensors b/model-00004-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..59d7874e4049facb4df833949a168e4d4a6d5d59
--- /dev/null
+++ b/model-00004-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:3e8db1605ce2afe1f8475c1183aefdd582b802f93430254fc67ebec3ff87c1ae
+size 7871313120
diff --git a/model-00005-of-00092.safetensors b/model-00005-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..ba4f83c2014ab3312f0f8a0d7895f4439e06e9d4
--- /dev/null
+++ b/model-00005-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:8b6a0f06e3ef8c5dfa2f8ce31c2397654d93cc709d74f666118748b80d64b8d7
+size 7871313120
diff --git a/model-00006-of-00092.safetensors b/model-00006-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..fa61a6c43887165c424e8f8add30f026cd423ecc
--- /dev/null
+++ b/model-00006-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:705dfb6825169f023cbed84007ad007e4676255e9ddb0c376618c8a5f37db07b
+size 7871313120
diff --git a/model-00007-of-00092.safetensors b/model-00007-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..463873ae832face983c47e431fd32c83799dae19
--- /dev/null
+++ b/model-00007-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:3c63bba33598a046d5fc6a4b6be2fa38f5795f5969771ac732860aae1bc6df5a
+size 7871313120
diff --git a/model-00008-of-00092.safetensors b/model-00008-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..9dbb4ed573dfa6f9397e3ecd1a3f6cec5654393b
--- /dev/null
+++ b/model-00008-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:476c0136870b99eeb705dc9e277d3538c0fe917eb030a3efe45c135c537d1639
+size 7871313120
diff --git a/model-00009-of-00092.safetensors b/model-00009-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..1a35ad5762d5aac49c8b572134de32391d525afd
--- /dev/null
+++ b/model-00009-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:9d6ced5113e74e01b833eb5fcdf78fcdbd8259dc2f0d7c39e1401e783d06ffd4
+size 7871313120
diff --git a/model-00010-of-00092.safetensors b/model-00010-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..bc236315135abf28e2e45360541b0b65af0d25b9
--- /dev/null
+++ b/model-00010-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:1f76b445ffb294d5ed248d4d81c3ca0b15bca9e9f6fb2acf36c4ad1b30c82b47
+size 7871313120
diff --git a/model-00011-of-00092.safetensors b/model-00011-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..7a99b9056cb5bc5b7ea2f7b72be41898e6f33ff1
--- /dev/null
+++ b/model-00011-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:0ec0e4bcd13137e0915bed1e8dcaacbd6d874cba36ce344afd75f34fb3f1ac6c
+size 7871313616
diff --git a/model-00012-of-00092.safetensors b/model-00012-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..6d6a26e34c5869f2ae85a8f06bb4b99190482ce4
--- /dev/null
+++ b/model-00012-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:ef236827f5bb49e4b656eec99462f95d4b2fb576d65c349e275e307a3049adb3
+size 7871313616
diff --git a/model-00013-of-00092.safetensors b/model-00013-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..9cb77a596b30ccbc52c263c33ba5cb59dd3decc4
--- /dev/null
+++ b/model-00013-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:7173db5ec5755bb2f6c7c0020761d4e7c98d510f24c64898bece11131ef18768
+size 7871313616
diff --git a/model-00014-of-00092.safetensors b/model-00014-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..9c59f9fe62515f8d30119fcadeee0fa767f39533
--- /dev/null
+++ b/model-00014-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:0fb1a2e2bca6a3b2aca3218fa4775cfec7a50d31e773fc0a33554c8b426d06a5
+size 7871313616
diff --git a/model-00015-of-00092.safetensors b/model-00015-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..9ba55deb0a5ec8aab8b309a3386ed8bb8b6a7124
--- /dev/null
+++ b/model-00015-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:8abc2995ffa8b6542dbf489bcc5cdeb9b7f5a66b620d25ca0b116e1a2bf990fe
+size 7871313616
diff --git a/model-00016-of-00092.safetensors b/model-00016-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..78730bd4ce7cca28e1970e948ac2e926f92a6327
--- /dev/null
+++ b/model-00016-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:cdb08861b405bef1ed5c71395580597abb2bf6af7bd42e4b4b037e2a997f4e83
+size 7871313616
diff --git a/model-00017-of-00092.safetensors b/model-00017-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..4e54a03bf59be497ae00214bdafe212d5ae40936
--- /dev/null
+++ b/model-00017-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:5671d6fd9ffcb251101577a414e6918f1a82036c4abea44cab68aafb1a452501
+size 7871313616
diff --git a/model-00018-of-00092.safetensors b/model-00018-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..28f673caf8628b457b152576404b2c2712ced6c8
--- /dev/null
+++ b/model-00018-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:21c1aba4f208f6982896783bb33d87488ac8ee7fbb26ff12a2fdbd7fc02eb384
+size 7871313616
diff --git a/model-00019-of-00092.safetensors b/model-00019-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..10f63a2b7730b0589c3556851a5c16b56bf8b4dd
--- /dev/null
+++ b/model-00019-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:bd9e23d4c5b6ec9654f7b3c71a0b851d0e37fdfe94a98bf78e897939b3f64178
+size 7871313616
diff --git a/model-00020-of-00092.safetensors b/model-00020-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..7c0e06e8362a8c29b94a3c593e51ee4d033b4013
--- /dev/null
+++ b/model-00020-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:06f33e14ee03d4a7db54e7136a8553beacdd31afe05044fd09ca10b63616d271
+size 7871313616
diff --git a/model-00021-of-00092.safetensors b/model-00021-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..7b732b31bd7d7a64441933483164e075f7be0fa5
--- /dev/null
+++ b/model-00021-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:23f45515976a4f6ad85d1ab97a2bfefa930d776f47bcbfc0148249d51d0d87f7
+size 7871313616
diff --git a/model-00022-of-00092.safetensors b/model-00022-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..8aa0d81c62b935b80c301c340ffe943333d37754
--- /dev/null
+++ b/model-00022-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:b7b67850a3b19945cffaae859724b476e0f6864b58d5378205492a09d073986a
+size 7871313616
diff --git a/model-00023-of-00092.safetensors b/model-00023-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..ab9c9f9b378bdcace167e72d70ef21900c7c5dd1
--- /dev/null
+++ b/model-00023-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:1dff224046d45db02f63f1fa45eaf9d7783b6bb0f00ef3e446f9d65ef330b0cc
+size 7871313616
diff --git a/model-00024-of-00092.safetensors b/model-00024-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..55b18410e44da52636c0d3fbdae33a40a1938add
--- /dev/null
+++ b/model-00024-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:d082e9e4fe4a813eb2683e23cff52dcb85d438e25bc5d3bdbb01f69e2732be61
+size 7871313616
diff --git a/model-00025-of-00092.safetensors b/model-00025-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..b6ff89064afbd22890b5e96ccdd494efedec569f
--- /dev/null
+++ b/model-00025-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:23b20871d461f9a350bdcf67d12c0095cf24c70ab1f98e5333741eaa6a6c5e1c
+size 7871313616
diff --git a/model-00026-of-00092.safetensors b/model-00026-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..e2f673134500ac2bfac669e4021381af1a6daee6
--- /dev/null
+++ b/model-00026-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:428edb09eb0917aebf12a836d33a58897b402458c7aa23a7c7290e924b20bcb6
+size 7871313616
diff --git a/model-00027-of-00092.safetensors b/model-00027-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..2e64ee1f6fe6336a65648357a5ec20757e9fa46d
--- /dev/null
+++ b/model-00027-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:0c282a88d7e8bc4fd57c7c80a7114c66da82a81dc78f934b3404b6ab7e1f1c09
+size 7871313616
diff --git a/model-00028-of-00092.safetensors b/model-00028-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..ab3344512917c7da5ac9a6704e4dc70215ede12f
--- /dev/null
+++ b/model-00028-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:e79def67bd54f46e39a005bbbe62e765fb6aafa46efb9840235cf0fb715b6978
+size 7871313616
diff --git a/model-00029-of-00092.safetensors b/model-00029-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..9db8d872b4acff15f7cf4869deffca5951a95517
--- /dev/null
+++ b/model-00029-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:bb3421d27afd0cd7f675f5317f242862d835eaf9cfadc9a78f0942df8078b4a8
+size 7871313616
diff --git a/model-00030-of-00092.safetensors b/model-00030-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..35e87e716bdcb6c632b14b5f11b9d8a4ab9b06c0
--- /dev/null
+++ b/model-00030-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:649320f0046102954667a216a33db63890b9aefb5f05ef4e8e388b3079594733
+size 7871313616
diff --git a/model-00031-of-00092.safetensors b/model-00031-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..a897867710c258f4fa858f2dc44e3a3dab77e44f
--- /dev/null
+++ b/model-00031-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:24f00a67630f089abead00b1ab573410f7980a1325e2dd5b0a21e48315c969ac
+size 7871313616
diff --git a/model-00032-of-00092.safetensors b/model-00032-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..f31654f53b3d679c6413b4ebc15063e1c2ed5425
--- /dev/null
+++ b/model-00032-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:686516209abebdd06d923fa537c8047b9b87fbca6e367689031f5941be29d7e0
+size 7871313616
diff --git a/model-00033-of-00092.safetensors b/model-00033-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..7825f51c913e7a5a427ae39e72ce9ca059dab203
--- /dev/null
+++ b/model-00033-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:d08ea10cbc9d6792f9dbcfd16f9dd579411c439a275ddc08e6cfaf1180ee0f80
+size 7871313616
diff --git a/model-00034-of-00092.safetensors b/model-00034-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..27b045918221fed924a088c38497a7b6daed4b3d
--- /dev/null
+++ b/model-00034-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:6c6ffd9adb18fd2c8469c612151476dd61906b337c6d2a7e42f9ee0c740d9200
+size 7871313616
diff --git a/model-00035-of-00092.safetensors b/model-00035-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..40688f91584ec018d52ab3ecd9200e54401ecbf8
--- /dev/null
+++ b/model-00035-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:7b010414eb4ab1dce16759b8e5852b05265290ead08eff54b7b82dba78737029
+size 7871313616
diff --git a/model-00036-of-00092.safetensors b/model-00036-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..3cdfce0b8db17142d33ae19e211968372d61ae86
--- /dev/null
+++ b/model-00036-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:c6e06322143dfe75f88925bd41387a74762278f143ed7f0d52c3299f9d95b6a3
+size 7871313616
diff --git a/model-00037-of-00092.safetensors b/model-00037-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..5454dab2165819baef1846e9be9c2f103c06ff87
--- /dev/null
+++ b/model-00037-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:e6046141408a257c55a273985fb86fd9b7b95b64b57388f0adc38355f84e8ddb
+size 7871313616
diff --git a/model-00038-of-00092.safetensors b/model-00038-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..93252c60dd6352b1df0122eb725a53937faff497
--- /dev/null
+++ b/model-00038-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:8a305c39a91b0038d23fd226a987c36e22e4e6f30333df5fcec82cd5e5b632ef
+size 7871313616
diff --git a/model-00039-of-00092.safetensors b/model-00039-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..5690a13207432906c7a9f9480f39bfe8ca890ba1
--- /dev/null
+++ b/model-00039-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:6b8cb2fe51cac4b22d1aab7cccc25bdce8bd52953466b09b3e974d7d356418a0
+size 7871313616
diff --git a/model-00040-of-00092.safetensors b/model-00040-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..7a3cd35817e793019fd524f6bc8fee15262bb931
--- /dev/null
+++ b/model-00040-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:eb4f4b16a377e1e09e4e52dfbd9d66b75458e4271a77f273430bffb61369b751
+size 7871313616
diff --git a/model-00041-of-00092.safetensors b/model-00041-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..c00209103353b03d36c5b7644bfca6723f8d12d3
--- /dev/null
+++ b/model-00041-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:0aba8077b06b1365bae77d04df01f72137b32272e9696ea6ab642e5cc434f5c9
+size 7871313616
diff --git a/model-00042-of-00092.safetensors b/model-00042-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..8a712a829e2886551093a8e502f0466b4aa029bb
--- /dev/null
+++ b/model-00042-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:df0a314dac9df0ddeb7fcaf60e007f650dde5cc101f1bb532de1c660c31bd48e
+size 7871313616
diff --git a/model-00043-of-00092.safetensors b/model-00043-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..c4c7283af2fc64f0916580ebda84f313c00a62c9
--- /dev/null
+++ b/model-00043-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:a340ac212f460e1b16d681d39cc8bc42245f807916c3bf5c42221f7e72cb9c22
+size 7871313616
diff --git a/model-00044-of-00092.safetensors b/model-00044-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..f541981cbce4a7c894990deee1a40ec0691634fa
--- /dev/null
+++ b/model-00044-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:befcbefa0573decb72a49fa2544080c2804081521567b9add974c536b43bfb57
+size 7871313616
diff --git a/model-00045-of-00092.safetensors b/model-00045-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..16c677ff03dc13de06582277a2ef232bd4bb3144
--- /dev/null
+++ b/model-00045-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:a9fb874dadf025d4f3a634a77b01da540811a7f276eb26e8ca900b47bcb013c9
+size 7871313616
diff --git a/model-00046-of-00092.safetensors b/model-00046-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..35baa91e9d47b140896929e6f314869863bbcac4
--- /dev/null
+++ b/model-00046-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:0b2e8d32011d91c99a962daf80ca9bd8123925d34f7fd7332153403928d92275
+size 7871313616
diff --git a/model-00047-of-00092.safetensors b/model-00047-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..771febb2f2ddc01a7e505a76c7ba89da542f2460
--- /dev/null
+++ b/model-00047-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:93ce4178dadbbbb3c631ad4fe4b84216d6a1beb57288cd466cefeaac55c1effb
+size 7871313616
diff --git a/model-00048-of-00092.safetensors b/model-00048-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..79bc3786205615ba3b7b3991c2011702399a22ad
--- /dev/null
+++ b/model-00048-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:c4512b972bd42e96c663232b081e49fe62bb424b004ca3cbcd44ef5be6f3c250
+size 7871313616
diff --git a/model-00049-of-00092.safetensors b/model-00049-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..0ac52e7d6821db4ad90eae33687d097fa15b8cd6
--- /dev/null
+++ b/model-00049-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:a7af1c9e9d8fb21e41a8e7d8520c18fcea288d5cc5f05e44326c45a915f15d2d
+size 7871313616
diff --git a/model-00050-of-00092.safetensors b/model-00050-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..e6fa335094549421501ef2a76c996cc05bcd2124
--- /dev/null
+++ b/model-00050-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:c7712f1e274d4c4e591803e2b7458d513d18a38a8c192c1762b5712d72cfeac8
+size 7871313616
diff --git a/model-00051-of-00092.safetensors b/model-00051-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..58e7d2c55a69e66b6fba5699e05d887e70d3d1ed
--- /dev/null
+++ b/model-00051-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:1db13e16ddb77919e2a3cf4b376b178183fb1e51f43417e1f74c1c9f356133d5
+size 7871313616
diff --git a/model-00052-of-00092.safetensors b/model-00052-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..7c556ce63e9755600e4cb47c4395a95132da47c0
--- /dev/null
+++ b/model-00052-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:74caa975e11dd8fc6890316e26bdbda914058a0dcb323339797ca85781a6b22e
+size 7871313616
diff --git a/model-00053-of-00092.safetensors b/model-00053-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..59c78d75dfe7454f77eb601d05e8cef03f3b0cf2
--- /dev/null
+++ b/model-00053-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:5e938f6cef067149d4def828b9e13149515f38e1fbafa293b823b1ae4b1f1742
+size 7871313616
diff --git a/model-00054-of-00092.safetensors b/model-00054-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..db3c68c62a85e20af0387d9070c51fc1256f57c8
--- /dev/null
+++ b/model-00054-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:7b5891ba1fee2b4f8a497e00cab4c22392bf73761f753d3a4caa007303894873
+size 7871313616
diff --git a/model-00055-of-00092.safetensors b/model-00055-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..dfd1148042419cebc1144aa5c9b4c97c09ca9852
--- /dev/null
+++ b/model-00055-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:6bbef4b25406c803880bfce4ff2cb68b3fe4efe1e28546a4e5e22aa56e554c22
+size 7871313616
diff --git a/model-00056-of-00092.safetensors b/model-00056-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..bccef36d7354113cea7bbf587b919cd5cf7e40fe
--- /dev/null
+++ b/model-00056-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:38fda2efbdb4a2e40fb7c1ee917eecdeed464f8ae29cfede6ba8441de08d7e43
+size 7871313616
diff --git a/model-00057-of-00092.safetensors b/model-00057-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..0b5adacc54e5d31b7571daa14268607d9ad78fdc
--- /dev/null
+++ b/model-00057-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:544ff484d4fcb4ca0f41753577e44d8423efe782e6d981a3b4947faf132a3ca0
+size 7871313616
diff --git a/model-00058-of-00092.safetensors b/model-00058-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..7dc78877ff6dfeb44913a254cd22bbf255d8e987
--- /dev/null
+++ b/model-00058-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:149f48e55ce5396ad741e41ab041364abd782fe1860c11bea4792b1d4a0c730b
+size 7871313616
diff --git a/model-00059-of-00092.safetensors b/model-00059-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..9309af776917a72af8fb0d0339d6d2d1897b8fe7
--- /dev/null
+++ b/model-00059-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:9745cc296bc965d267b444d675ce2597aead6b607dbc5e71c2b3287e925b57e6
+size 7871313616
diff --git a/model-00060-of-00092.safetensors b/model-00060-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..7582ee11c64d270facecaada5a42bd5ba844813e
--- /dev/null
+++ b/model-00060-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:08611b0ee83b6031abdc2909070a9d1fd2363c4580e20f25d88d1f65010d0b66
+size 7871313616
diff --git a/model-00061-of-00092.safetensors b/model-00061-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..a8f934b97d6c39858c271a276748180319e9b162
--- /dev/null
+++ b/model-00061-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:6383141f36aa68236d7ee1980e77cd78d332876df6a77427eefe0361dbc09e29
+size 7871313616
diff --git a/model-00062-of-00092.safetensors b/model-00062-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..471db1784327aa6f068abed1cbbbcb5af50048f9
--- /dev/null
+++ b/model-00062-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:566db31ce52b85570ac48e9c3a9238ac2ae0a470b8ad07caab74488b505320c9
+size 7871313616
diff --git a/model-00063-of-00092.safetensors b/model-00063-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..f2bd44602b6a315420a46992ec0f7c6d9bc24f45
--- /dev/null
+++ b/model-00063-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:b4837fba098f4ccb13ddfc6770cda76af0f2752d45e1e90d8660478aa87ffe6a
+size 7871313616
diff --git a/model-00064-of-00092.safetensors b/model-00064-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..0d4a07a8817a6398e6c9c4fa792faea82e52ea07
--- /dev/null
+++ b/model-00064-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:0f4b5025ddf597d59b678c7001715672c0fce64c583b43599f8137c8cc38c85f
+size 7871313616
diff --git a/model-00065-of-00092.safetensors b/model-00065-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..00c697688569605b9aa726c0a8b5a1dec7df6738
--- /dev/null
+++ b/model-00065-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:b4167e7832b826db2931824f002ff63f9ac00ed5d8f7c94549294dfadbf38d84
+size 7871313616
diff --git a/model-00066-of-00092.safetensors b/model-00066-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..d04457f57855e6e1a60ac412a83791c1e63b3d8b
--- /dev/null
+++ b/model-00066-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:3d529371515b32c5e36a6264f04a827b326cc4387f60e02c291ded82675bdb29
+size 7871313616
diff --git a/model-00067-of-00092.safetensors b/model-00067-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..c7ed6ffeffa38b4b1949c8c404b2155d56c17bf2
--- /dev/null
+++ b/model-00067-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:0fcba2646d4d1a1d702ffa99dd4984501e190f039bd4e526fa2ebc609e98f565
+size 7871313616
diff --git a/model-00068-of-00092.safetensors b/model-00068-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..1901bc0eff301f0c656e626cc7d161dd47b4b9ab
--- /dev/null
+++ b/model-00068-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:cfc6abab874c047c5a37699466235a77006c5cc118ca7470f3cdcbf8b692f8a1
+size 7871313616
diff --git a/model-00069-of-00092.safetensors b/model-00069-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..7fe6be9ad6a20d6576d2a74d02b569f841e4da01
--- /dev/null
+++ b/model-00069-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:70710d08ae40b150f72a1a1ce737d022fc5a4a1b0d91f052d0905b4246d3e23c
+size 7871313616
diff --git a/model-00070-of-00092.safetensors b/model-00070-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..01d03ecba7e8b153fc6a7dc8cbdd330db1622936
--- /dev/null
+++ b/model-00070-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:05ceede6c2cc0920b89ef1ac423d7e739575da3a3e77e6ac7a4b70e0a7d95305
+size 7871313616
diff --git a/model-00071-of-00092.safetensors b/model-00071-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..1302fa424c775059d35d668ba0ecd9a663bd8f47
--- /dev/null
+++ b/model-00071-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:7116630f5a00294b7fb0bc1db8420662e4b222ead1928ddb7d75247a51740f63
+size 7871313616
diff --git a/model-00072-of-00092.safetensors b/model-00072-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..4346c8292adec7a4c642ac63aa2cffb49547fee6
--- /dev/null
+++ b/model-00072-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:19f145e6ab5f187d38dcc0b8548d7ee21646dd687cfe13e238d55f811b39823e
+size 7871313616
diff --git a/model-00073-of-00092.safetensors b/model-00073-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..511ca50f6b0ffcf8a6ed34c8c065f2746d99817b
--- /dev/null
+++ b/model-00073-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:b5e248b81b253812525515e6e0ce3862ca1f1be28299ab0ff91ffcc857fccc5e
+size 7871313616
diff --git a/model-00074-of-00092.safetensors b/model-00074-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..930ea3b48409c1e5f4c34eca9dc91ea9295552ea
--- /dev/null
+++ b/model-00074-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:1de98b8cc5620f71d0b2f2b57c2e82c92ae636075b17af5150bd23afc5825200
+size 7871313616
diff --git a/model-00075-of-00092.safetensors b/model-00075-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..862c5781e3b169b8a6009d24eae8fa1f940e4e68
--- /dev/null
+++ b/model-00075-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:40105d9e08659abe93863efd07ef3693546e6d71bfb14bad2d1dbe346de2c2b4
+size 7871313616
diff --git a/model-00076-of-00092.safetensors b/model-00076-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..829a441801829d9faa4edc40970da73864dfebe0
--- /dev/null
+++ b/model-00076-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:c96d00c03d67063f449bcec160fb6925b3ca50495e1d97c0a7dcb1ef927b58b1
+size 7871313616
diff --git a/model-00077-of-00092.safetensors b/model-00077-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..ff69343449ca72305dcb87716f656d32d5a2f215
--- /dev/null
+++ b/model-00077-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:244ff18d601a674dfa3cee3fe89c1d7d199627ec2828c9042945464e9360e73c
+size 7871313616
diff --git a/model-00078-of-00092.safetensors b/model-00078-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..eafb8b42e36c5361f2b8c10f84e9b2a089f8e123
--- /dev/null
+++ b/model-00078-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:009adc17ef42623ff31d9b5891dddf9b65107239e0524249b3cfb04c8dbccd81
+size 7871313616
diff --git a/model-00079-of-00092.safetensors b/model-00079-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..79050b7093195f189757332cb7244468e84ed164
--- /dev/null
+++ b/model-00079-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:dd3dd005f14500de006034ea790ffc68ca4ff661e3baeabd92449e1565a41641
+size 7871313616
diff --git a/model-00080-of-00092.safetensors b/model-00080-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..48e054bda5bde7a4b59c5cf23f372d7c7ef9150f
--- /dev/null
+++ b/model-00080-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:d3f461f3f119d9204a0dc5dc70f2f12723731f40287e1e1cd1a197a49582a961
+size 7871313616
diff --git a/model-00081-of-00092.safetensors b/model-00081-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..559d66847fc4f66cf2ee745d46f7486c3e2748cb
--- /dev/null
+++ b/model-00081-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:15c0c28e765860bf21a7e192cc26719e1a756439a69c1bdf64992c26c1a9b913
+size 7871313616
diff --git a/model-00082-of-00092.safetensors b/model-00082-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..f87086be460dc53a979e3c0f513aa5e823458e41
--- /dev/null
+++ b/model-00082-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:3db2243d3d30320cb9283636230c9b3f56da27884dcc560127f0e1c7a36ba1ab
+size 7871313616
diff --git a/model-00083-of-00092.safetensors b/model-00083-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..56bb2a2a0c06c8f1ecf8aad879d1f8caf2d5670b
--- /dev/null
+++ b/model-00083-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:97335632904a2ba5c73f9456ff5cfacc9dea7942e537e0270ef7720a8c936283
+size 7871313616
diff --git a/model-00084-of-00092.safetensors b/model-00084-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..59401eeb6a1037869c8827d7d63fe59262539ff9
--- /dev/null
+++ b/model-00084-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:1d279ed7c71edbed54473bb1887df0688a081c720fcb5d80d93c5348c753b5d4
+size 7871313616
diff --git a/model-00085-of-00092.safetensors b/model-00085-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..44f7473dc682cb50b70d49a95cf54edd9f82537c
--- /dev/null
+++ b/model-00085-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:02041dc0d86bd3c59be6b24b2d9ad753b722f64bbd562839cd0fc88889991ce0
+size 7871313616
diff --git a/model-00086-of-00092.safetensors b/model-00086-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..e6569ababb6525558809d7d1bcdf2af31a935d3f
--- /dev/null
+++ b/model-00086-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:87cfbea39a0205b33390b0794a58efc3ee8027d918a2842c2062b4e6ee0710b5
+size 7871313616
diff --git a/model-00087-of-00092.safetensors b/model-00087-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..62c68713f9019853888317927518fa52b155cd47
--- /dev/null
+++ b/model-00087-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:503d16544ff0ebafc39df0daa6ae17a3699b68a5d8313f213ceaf3f7d684eb42
+size 7871313616
diff --git a/model-00088-of-00092.safetensors b/model-00088-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..7429aa7d91f6c54a09586091ae4ae2b881d069e6
--- /dev/null
+++ b/model-00088-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:0a0cc0beecbed935686b418f17c66abd28470f4bea70a7392f652b410caf0ce2
+size 7871313616
diff --git a/model-00089-of-00092.safetensors b/model-00089-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..7ca81747fa2dc30f99a813bde3caa54db6236870
--- /dev/null
+++ b/model-00089-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:0797520746118082f5f25b832cea0d9fe1afd5f8255d76b1a39742deb5dd9d19
+size 7871313616
diff --git a/model-00090-of-00092.safetensors b/model-00090-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..6d7b64aac43495a95a5c528b3266f986520a5250
--- /dev/null
+++ b/model-00090-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:512cbbb13d94b08417fe541ee22a3ee30a30569517857ee5907676f592219390
+size 7871313616
diff --git a/model-00091-of-00092.safetensors b/model-00091-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..ff86f084a6a15c040d7d1a77a321c24a420503af
--- /dev/null
+++ b/model-00091-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:1b82d24b8ba14c92bc9f7a95f882309e924268eb35287701d2e3d419a4e74f3a
+size 7871313616
diff --git a/model-00092-of-00092.safetensors b/model-00092-of-00092.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..cfc923e9397b001491fee1b271a1323371e404a7
--- /dev/null
+++ b/model-00092-of-00092.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:30ee55884d2ce83efac8b702808e7d0c1d079423fbf6dfe6fa3b7c47f5d7b9ff
+size 9423216672
diff --git a/model.safetensors.index.json b/model.safetensors.index.json
new file mode 100644
index 0000000000000000000000000000000000000000..e171e67be01507d60a955a7bbad9f519c9a62fc0
--- /dev/null
+++ b/model.safetensors.index.json
@@ -0,0 +1,44698 @@
+{
+ "metadata": {
+ "total_size": 358337791296
+ },
+ "weight_map": {
+ "model.embed_tokens.weight": "model-00001-of-00092.safetensors",
+ "model.layers.0.input_layernorm.weight": "model-00001-of-00092.safetensors",
+ "model.layers.0.mlp.down_proj.weight": "model-00001-of-00092.safetensors",
+ "model.layers.0.mlp.gate_proj.weight": "model-00001-of-00092.safetensors",
+ "model.layers.0.mlp.up_proj.weight": "model-00001-of-00092.safetensors",
+ "model.layers.0.post_attention_layernorm.weight": "model-00001-of-00092.safetensors",
+ "model.layers.0.self_attn.k_norm.weight": "model-00001-of-00092.safetensors",
+ "model.layers.0.self_attn.k_proj.bias": "model-00001-of-00092.safetensors",
+ "model.layers.0.self_attn.k_proj.weight": "model-00001-of-00092.safetensors",
+ "model.layers.0.self_attn.o_proj.weight": "model-00001-of-00092.safetensors",
+ "model.layers.0.self_attn.q_norm.weight": "model-00001-of-00092.safetensors",
+ "model.layers.0.self_attn.q_proj.bias": "model-00001-of-00092.safetensors",
+ "model.layers.0.self_attn.q_proj.weight": "model-00001-of-00092.safetensors",
+ "model.layers.0.self_attn.v_proj.bias": "model-00001-of-00092.safetensors",
+ "model.layers.0.self_attn.v_proj.weight": "model-00001-of-00092.safetensors",
+ "model.layers.1.input_layernorm.weight": "model-00002-of-00092.safetensors",
+ "model.layers.1.mlp.down_proj.weight": "model-00002-of-00092.safetensors",
+ "model.layers.1.mlp.gate_proj.weight": "model-00002-of-00092.safetensors",
+ "model.layers.1.mlp.up_proj.weight": "model-00002-of-00092.safetensors",
+ "model.layers.1.post_attention_layernorm.weight": "model-00002-of-00092.safetensors",
+ "model.layers.1.self_attn.k_norm.weight": "model-00002-of-00092.safetensors",
+ "model.layers.1.self_attn.k_proj.bias": "model-00002-of-00092.safetensors",
+ "model.layers.1.self_attn.k_proj.weight": "model-00002-of-00092.safetensors",
+ "model.layers.1.self_attn.o_proj.weight": "model-00002-of-00092.safetensors",
+ "model.layers.1.self_attn.q_norm.weight": "model-00002-of-00092.safetensors",
+ "model.layers.1.self_attn.q_proj.bias": "model-00002-of-00092.safetensors",
+ "model.layers.1.self_attn.q_proj.weight": "model-00002-of-00092.safetensors",
+ "model.layers.1.self_attn.v_proj.bias": "model-00002-of-00092.safetensors",
+ "model.layers.1.self_attn.v_proj.weight": "model-00002-of-00092.safetensors",
+ "model.layers.2.input_layernorm.weight": "model-00003-of-00092.safetensors",
+ "model.layers.2.mlp.down_proj.weight": "model-00003-of-00092.safetensors",
+ "model.layers.2.mlp.gate_proj.weight": "model-00003-of-00092.safetensors",
+ "model.layers.2.mlp.up_proj.weight": "model-00003-of-00092.safetensors",
+ "model.layers.2.post_attention_layernorm.weight": "model-00003-of-00092.safetensors",
+ "model.layers.2.self_attn.k_norm.weight": "model-00003-of-00092.safetensors",
+ "model.layers.2.self_attn.k_proj.bias": "model-00003-of-00092.safetensors",
+ "model.layers.2.self_attn.k_proj.weight": "model-00003-of-00092.safetensors",
+ "model.layers.2.self_attn.o_proj.weight": "model-00003-of-00092.safetensors",
+ "model.layers.2.self_attn.q_norm.weight": "model-00003-of-00092.safetensors",
+ "model.layers.2.self_attn.q_proj.bias": "model-00003-of-00092.safetensors",
+ "model.layers.2.self_attn.q_proj.weight": "model-00003-of-00092.safetensors",
+ "model.layers.2.self_attn.v_proj.bias": "model-00003-of-00092.safetensors",
+ "model.layers.2.self_attn.v_proj.weight": "model-00003-of-00092.safetensors",
+ "model.layers.3.input_layernorm.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.0.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.0.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.0.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.1.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.1.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.1.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.10.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.10.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.10.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.100.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.100.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.100.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.101.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.101.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.101.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.102.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.102.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.102.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.103.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.103.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.103.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.104.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.104.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.104.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.105.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.105.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.105.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.106.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.106.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.106.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.107.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.107.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.107.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.108.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.108.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.108.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.109.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.109.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.109.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.11.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.11.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.11.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.110.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.110.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.110.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.111.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.111.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.111.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.112.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.112.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.112.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.113.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.113.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.113.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.114.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.114.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.114.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.115.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.115.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.115.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.116.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.116.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.116.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.117.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.117.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.117.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.118.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.118.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.118.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.119.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.119.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.119.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.12.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.12.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.12.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.120.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.120.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.120.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.121.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.121.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.121.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.122.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.122.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.122.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.123.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.123.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.123.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.124.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.124.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.124.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.125.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.125.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.125.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.126.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.126.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.126.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.127.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.127.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.127.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.128.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.128.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.128.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.129.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.129.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.129.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.13.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.13.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.13.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.130.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.130.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.130.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.131.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.131.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.131.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.132.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.132.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.132.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.133.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.133.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.133.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.134.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.134.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.134.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.135.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.135.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.135.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.136.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.136.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.136.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.137.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.137.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.137.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.138.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.138.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.138.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.139.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.139.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.139.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.14.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.14.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.14.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.140.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.140.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.140.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.141.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.141.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.141.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.142.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.142.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.142.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.143.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.143.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.143.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.144.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.144.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.144.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.145.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.145.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.145.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.146.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.146.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.146.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.147.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.147.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.147.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.148.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.148.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.148.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.149.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.149.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.149.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.15.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.15.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.15.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.150.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.150.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.150.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.151.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.151.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.151.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.152.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.152.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.152.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.153.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.153.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.153.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.154.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.154.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.154.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.155.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.155.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.155.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.156.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.156.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.156.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.157.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.157.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.157.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.158.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.158.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.158.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.159.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.159.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.159.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.16.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.16.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.16.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.17.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.17.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.17.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.18.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.18.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.18.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.19.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.19.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.19.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.2.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.2.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.2.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.20.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.20.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.20.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.21.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.21.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.21.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.22.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.22.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.22.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.23.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.23.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.23.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.24.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.24.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.24.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.25.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.25.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.25.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.26.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.26.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.26.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.27.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.27.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.27.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.28.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.28.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.28.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.29.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.29.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.29.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.3.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.3.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.3.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.30.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.30.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.30.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.31.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.31.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.31.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.32.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.32.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.32.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.33.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.33.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.33.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.34.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.34.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.34.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.35.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.35.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.35.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.36.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.36.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.36.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.37.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.37.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.37.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.38.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.38.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.38.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.39.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.39.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.39.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.4.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.4.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.4.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.40.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.40.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.40.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.41.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.41.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.41.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.42.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.42.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.42.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.43.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.43.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.43.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.44.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.44.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.44.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.45.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.45.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.45.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.46.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.46.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.46.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.47.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.47.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.47.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.48.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.48.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.48.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.49.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.49.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.49.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.5.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.5.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.5.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.50.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.50.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.50.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.51.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.51.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.51.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.52.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.52.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.52.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.53.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.53.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.53.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.54.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.54.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.54.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.55.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.55.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.55.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.56.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.56.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.56.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.57.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.57.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.57.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.58.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.58.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.58.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.59.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.59.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.59.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.6.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.6.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.6.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.60.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.60.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.60.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.61.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.61.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.61.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.62.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.62.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.62.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.63.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.63.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.63.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.64.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.64.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.64.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.65.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.65.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.65.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.66.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.66.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.66.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.67.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.67.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.67.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.68.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.68.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.68.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.69.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.69.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.69.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.7.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.7.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.7.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.70.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.70.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.70.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.71.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.71.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.71.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.72.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.72.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.72.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.73.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.73.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.73.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.74.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.74.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.74.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.75.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.75.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.75.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.76.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.76.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.76.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.77.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.77.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.77.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.78.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.78.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.78.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.79.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.79.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.79.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.8.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.8.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.8.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.80.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.80.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.80.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.81.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.81.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.81.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.82.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.82.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.82.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.83.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.83.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.83.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.84.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.84.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.84.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.85.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.85.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.85.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.86.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.86.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.86.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.87.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.87.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.87.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.88.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.88.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.88.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.89.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.89.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.89.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.9.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.9.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.9.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.90.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.90.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.90.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.91.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.91.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.91.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.92.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.92.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.92.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.93.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.93.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.93.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.94.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.94.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.94.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.95.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.95.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.95.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.96.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.96.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.96.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.97.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.97.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.97.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.98.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.98.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.98.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.99.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.99.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.experts.99.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.gate.e_score_correction_bias": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.gate.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.shared_experts.down_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.shared_experts.gate_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.mlp.shared_experts.up_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.post_attention_layernorm.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.self_attn.k_norm.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.self_attn.k_proj.bias": "model-00004-of-00092.safetensors",
+ "model.layers.3.self_attn.k_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.self_attn.o_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.self_attn.q_norm.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.self_attn.q_proj.bias": "model-00004-of-00092.safetensors",
+ "model.layers.3.self_attn.q_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.3.self_attn.v_proj.bias": "model-00004-of-00092.safetensors",
+ "model.layers.3.self_attn.v_proj.weight": "model-00004-of-00092.safetensors",
+ "model.layers.4.input_layernorm.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.0.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.0.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.0.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.1.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.1.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.1.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.10.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.10.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.10.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.100.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.100.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.100.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.101.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.101.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.101.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.102.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.102.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.102.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.103.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.103.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.103.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.104.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.104.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.104.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.105.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.105.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.105.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.106.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.106.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.106.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.107.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.107.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.107.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.108.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.108.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.108.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.109.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.109.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.109.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.11.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.11.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.11.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.110.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.110.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.110.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.111.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.111.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.111.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.112.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.112.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.112.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.113.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.113.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.113.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.114.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.114.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.114.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.115.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.115.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.115.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.116.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.116.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.116.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.117.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.117.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.117.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.118.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.118.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.118.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.119.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.119.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.119.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.12.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.12.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.12.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.120.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.120.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.120.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.121.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.121.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.121.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.122.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.122.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.122.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.123.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.123.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.123.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.124.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.124.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.124.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.125.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.125.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.125.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.126.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.126.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.126.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.127.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.127.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.127.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.128.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.128.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.128.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.129.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.129.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.129.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.13.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.13.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.13.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.130.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.130.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.130.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.131.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.131.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.131.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.132.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.132.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.132.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.133.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.133.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.133.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.134.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.134.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.134.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.135.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.135.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.135.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.136.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.136.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.136.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.137.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.137.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.137.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.138.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.138.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.138.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.139.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.139.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.139.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.14.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.14.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.14.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.140.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.140.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.140.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.141.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.141.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.141.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.142.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.142.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.142.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.143.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.143.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.143.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.144.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.144.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.144.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.145.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.145.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.145.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.146.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.146.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.146.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.147.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.147.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.147.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.148.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.148.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.148.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.149.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.149.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.149.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.15.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.15.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.15.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.150.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.150.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.150.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.151.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.151.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.151.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.152.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.152.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.152.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.153.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.153.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.153.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.154.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.154.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.154.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.155.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.155.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.155.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.156.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.156.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.156.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.157.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.157.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.157.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.158.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.158.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.158.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.159.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.159.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.159.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.16.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.16.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.16.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.17.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.17.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.17.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.18.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.18.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.18.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.19.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.19.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.19.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.2.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.2.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.2.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.20.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.20.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.20.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.21.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.21.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.21.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.22.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.22.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.22.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.23.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.23.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.23.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.24.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.24.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.24.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.25.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.25.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.25.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.26.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.26.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.26.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.27.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.27.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.27.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.28.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.28.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.28.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.29.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.29.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.29.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.3.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.3.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.3.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.30.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.30.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.30.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.31.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.31.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.31.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.32.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.32.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.32.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.33.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.33.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.33.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.34.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.34.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.34.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.35.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.35.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.35.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.36.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.36.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.36.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.37.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.37.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.37.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.38.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.38.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.38.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.39.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.39.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.39.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.4.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.4.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.4.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.40.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.40.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.40.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.41.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.41.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.41.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.42.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.42.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.42.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.43.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.43.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.43.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.44.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.44.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.44.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.45.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.45.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.45.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.46.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.46.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.46.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.47.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.47.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.47.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.48.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.48.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.48.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.49.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.49.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.49.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.5.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.5.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.5.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.50.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.50.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.50.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.51.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.51.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.51.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.52.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.52.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.52.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.53.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.53.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.53.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.54.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.54.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.54.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.55.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.55.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.55.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.56.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.56.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.56.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.57.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.57.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.57.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.58.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.58.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.58.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.59.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.59.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.59.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.6.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.6.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.6.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.60.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.60.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.60.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.61.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.61.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.61.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.62.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.62.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.62.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.63.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.63.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.63.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.64.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.64.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.64.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.65.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.65.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.65.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.66.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.66.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.66.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.67.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.67.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.67.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.68.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.68.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.68.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.69.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.69.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.69.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.7.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.7.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.7.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.70.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.70.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.70.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.71.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.71.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.71.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.72.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.72.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.72.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.73.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.73.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.73.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.74.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.74.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.74.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.75.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.75.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.75.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.76.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.76.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.76.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.77.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.77.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.77.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.78.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.78.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.78.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.79.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.79.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.79.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.8.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.8.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.8.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.80.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.80.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.80.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.81.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.81.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.81.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.82.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.82.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.82.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.83.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.83.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.83.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.84.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.84.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.84.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.85.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.85.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.85.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.86.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.86.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.86.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.87.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.87.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.87.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.88.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.88.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.88.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.89.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.89.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.89.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.9.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.9.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.9.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.90.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.90.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.90.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.91.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.91.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.91.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.92.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.92.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.92.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.93.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.93.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.93.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.94.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.94.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.94.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.95.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.95.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.95.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.96.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.96.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.96.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.97.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.97.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.97.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.98.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.98.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.98.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.99.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.99.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.experts.99.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.gate.e_score_correction_bias": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.gate.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.shared_experts.down_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.shared_experts.gate_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.mlp.shared_experts.up_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.post_attention_layernorm.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.self_attn.k_norm.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.self_attn.k_proj.bias": "model-00005-of-00092.safetensors",
+ "model.layers.4.self_attn.k_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.self_attn.o_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.self_attn.q_norm.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.self_attn.q_proj.bias": "model-00005-of-00092.safetensors",
+ "model.layers.4.self_attn.q_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.4.self_attn.v_proj.bias": "model-00005-of-00092.safetensors",
+ "model.layers.4.self_attn.v_proj.weight": "model-00005-of-00092.safetensors",
+ "model.layers.5.input_layernorm.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.0.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.0.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.0.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.1.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.1.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.1.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.10.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.10.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.10.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.100.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.100.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.100.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.101.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.101.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.101.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.102.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.102.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.102.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.103.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.103.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.103.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.104.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.104.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.104.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.105.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.105.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.105.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.106.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.106.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.106.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.107.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.107.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.107.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.108.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.108.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.108.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.109.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.109.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.109.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.11.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.11.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.11.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.110.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.110.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.110.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.111.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.111.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.111.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.112.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.112.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.112.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.113.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.113.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.113.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.114.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.114.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.114.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.115.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.115.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.115.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.116.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.116.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.116.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.117.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.117.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.117.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.118.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.118.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.118.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.119.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.119.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.119.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.12.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.12.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.12.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.120.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.120.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.120.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.121.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.121.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.121.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.122.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.122.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.122.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.123.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.123.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.123.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.124.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.124.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.124.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.125.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.125.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.125.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.126.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.126.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.126.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.127.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.127.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.127.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.128.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.128.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.128.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.129.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.129.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.129.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.13.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.13.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.13.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.130.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.130.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.130.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.131.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.131.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.131.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.132.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.132.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.132.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.133.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.133.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.133.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.134.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.134.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.134.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.135.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.135.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.135.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.136.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.136.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.136.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.137.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.137.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.137.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.138.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.138.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.138.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.139.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.139.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.139.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.14.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.14.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.14.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.140.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.140.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.140.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.141.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.141.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.141.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.142.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.142.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.142.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.143.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.143.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.143.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.144.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.144.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.144.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.145.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.145.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.145.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.146.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.146.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.146.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.147.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.147.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.147.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.148.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.148.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.148.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.149.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.149.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.149.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.15.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.15.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.15.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.150.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.150.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.150.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.151.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.151.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.151.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.152.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.152.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.152.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.153.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.153.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.153.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.154.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.154.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.154.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.155.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.155.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.155.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.156.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.156.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.156.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.157.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.157.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.157.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.158.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.158.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.158.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.159.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.159.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.159.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.16.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.16.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.16.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.17.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.17.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.17.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.18.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.18.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.18.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.19.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.19.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.19.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.2.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.2.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.2.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.20.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.20.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.20.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.21.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.21.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.21.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.22.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.22.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.22.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.23.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.23.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.23.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.24.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.24.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.24.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.25.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.25.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.25.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.26.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.26.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.26.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.27.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.27.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.27.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.28.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.28.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.28.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.29.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.29.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.29.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.3.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.3.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.3.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.30.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.30.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.30.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.31.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.31.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.31.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.32.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.32.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.32.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.33.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.33.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.33.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.34.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.34.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.34.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.35.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.35.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.35.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.36.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.36.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.36.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.37.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.37.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.37.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.38.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.38.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.38.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.39.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.39.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.39.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.4.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.4.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.4.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.40.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.40.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.40.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.41.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.41.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.41.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.42.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.42.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.42.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.43.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.43.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.43.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.44.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.44.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.44.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.45.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.45.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.45.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.46.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.46.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.46.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.47.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.47.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.47.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.48.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.48.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.48.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.49.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.49.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.49.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.5.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.5.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.5.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.50.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.50.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.50.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.51.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.51.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.51.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.52.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.52.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.52.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.53.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.53.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.53.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.54.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.54.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.54.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.55.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.55.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.55.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.56.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.56.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.56.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.57.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.57.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.57.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.58.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.58.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.58.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.59.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.59.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.59.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.6.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.6.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.6.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.60.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.60.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.60.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.61.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.61.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.61.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.62.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.62.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.62.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.63.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.63.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.63.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.64.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.64.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.64.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.65.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.65.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.65.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.66.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.66.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.66.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.67.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.67.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.67.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.68.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.68.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.68.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.69.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.69.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.69.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.7.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.7.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.7.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.70.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.70.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.70.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.71.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.71.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.71.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.72.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.72.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.72.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.73.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.73.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.73.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.74.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.74.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.74.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.75.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.75.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.75.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.76.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.76.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.76.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.77.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.77.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.77.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.78.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.78.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.78.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.79.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.79.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.79.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.8.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.8.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.8.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.80.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.80.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.80.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.81.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.81.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.81.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.82.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.82.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.82.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.83.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.83.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.83.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.84.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.84.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.84.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.85.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.85.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.85.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.86.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.86.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.86.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.87.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.87.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.87.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.88.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.88.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.88.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.89.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.89.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.89.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.9.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.9.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.9.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.90.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.90.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.90.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.91.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.91.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.91.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.92.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.92.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.92.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.93.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.93.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.93.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.94.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.94.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.94.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.95.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.95.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.95.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.96.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.96.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.96.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.97.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.97.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.97.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.98.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.98.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.98.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.99.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.99.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.experts.99.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.gate.e_score_correction_bias": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.gate.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.shared_experts.down_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.shared_experts.gate_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.mlp.shared_experts.up_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.post_attention_layernorm.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.self_attn.k_norm.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.self_attn.k_proj.bias": "model-00006-of-00092.safetensors",
+ "model.layers.5.self_attn.k_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.self_attn.o_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.self_attn.q_norm.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.self_attn.q_proj.bias": "model-00006-of-00092.safetensors",
+ "model.layers.5.self_attn.q_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.5.self_attn.v_proj.bias": "model-00006-of-00092.safetensors",
+ "model.layers.5.self_attn.v_proj.weight": "model-00006-of-00092.safetensors",
+ "model.layers.6.input_layernorm.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.0.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.0.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.0.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.1.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.1.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.1.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.10.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.10.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.10.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.100.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.100.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.100.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.101.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.101.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.101.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.102.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.102.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.102.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.103.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.103.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.103.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.104.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.104.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.104.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.105.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.105.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.105.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.106.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.106.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.106.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.107.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.107.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.107.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.108.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.108.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.108.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.109.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.109.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.109.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.11.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.11.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.11.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.110.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.110.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.110.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.111.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.111.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.111.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.112.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.112.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.112.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.113.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.113.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.113.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.114.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.114.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.114.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.115.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.115.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.115.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.116.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.116.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.116.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.117.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.117.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.117.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.118.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.118.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.118.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.119.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.119.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.119.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.12.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.12.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.12.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.120.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.120.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.120.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.121.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.121.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.121.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.122.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.122.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.122.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.123.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.123.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.123.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.124.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.124.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.124.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.125.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.125.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.125.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.126.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.126.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.126.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.127.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.127.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.127.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.128.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.128.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.128.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.129.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.129.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.129.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.13.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.13.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.13.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.130.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.130.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.130.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.131.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.131.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.131.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.132.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.132.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.132.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.133.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.133.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.133.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.134.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.134.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.134.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.135.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.135.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.135.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.136.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.136.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.136.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.137.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.137.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.137.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.138.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.138.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.138.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.139.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.139.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.139.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.14.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.14.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.14.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.140.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.140.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.140.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.141.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.141.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.141.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.142.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.142.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.142.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.143.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.143.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.143.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.144.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.144.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.144.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.145.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.145.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.145.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.146.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.146.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.146.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.147.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.147.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.147.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.148.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.148.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.148.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.149.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.149.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.149.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.15.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.15.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.15.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.150.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.150.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.150.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.151.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.151.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.151.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.152.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.152.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.152.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.153.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.153.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.153.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.154.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.154.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.154.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.155.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.155.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.155.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.156.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.156.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.156.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.157.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.157.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.157.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.158.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.158.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.158.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.159.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.159.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.159.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.16.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.16.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.16.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.17.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.17.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.17.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.18.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.18.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.18.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.19.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.19.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.19.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.2.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.2.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.2.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.20.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.20.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.20.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.21.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.21.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.21.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.22.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.22.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.22.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.23.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.23.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.23.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.24.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.24.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.24.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.25.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.25.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.25.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.26.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.26.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.26.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.27.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.27.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.27.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.28.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.28.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.28.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.29.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.29.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.29.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.3.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.3.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.3.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.30.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.30.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.30.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.31.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.31.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.31.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.32.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.32.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.32.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.33.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.33.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.33.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.34.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.34.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.34.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.35.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.35.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.35.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.36.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.36.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.36.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.37.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.37.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.37.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.38.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.38.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.38.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.39.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.39.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.39.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.4.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.4.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.4.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.40.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.40.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.40.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.41.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.41.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.41.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.42.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.42.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.42.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.43.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.43.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.43.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.44.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.44.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.44.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.45.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.45.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.45.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.46.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.46.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.46.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.47.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.47.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.47.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.48.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.48.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.48.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.49.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.49.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.49.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.5.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.5.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.5.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.50.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.50.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.50.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.51.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.51.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.51.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.52.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.52.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.52.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.53.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.53.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.53.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.54.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.54.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.54.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.55.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.55.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.55.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.56.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.56.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.56.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.57.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.57.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.57.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.58.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.58.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.58.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.59.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.59.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.59.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.6.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.6.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.6.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.60.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.60.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.60.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.61.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.61.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.61.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.62.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.62.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.62.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.63.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.63.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.63.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.64.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.64.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.64.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.65.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.65.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.65.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.66.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.66.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.66.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.67.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.67.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.67.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.68.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.68.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.68.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.69.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.69.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.69.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.7.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.7.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.7.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.70.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.70.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.70.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.71.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.71.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.71.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.72.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.72.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.72.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.73.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.73.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.73.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.74.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.74.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.74.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.75.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.75.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.75.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.76.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.76.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.76.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.77.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.77.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.77.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.78.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.78.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.78.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.79.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.79.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.79.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.8.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.8.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.8.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.80.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.80.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.80.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.81.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.81.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.81.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.82.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.82.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.82.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.83.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.83.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.83.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.84.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.84.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.84.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.85.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.85.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.85.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.86.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.86.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.86.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.87.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.87.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.87.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.88.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.88.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.88.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.89.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.89.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.89.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.9.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.9.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.9.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.90.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.90.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.90.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.91.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.91.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.91.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.92.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.92.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.92.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.93.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.93.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.93.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.94.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.94.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.94.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.95.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.95.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.95.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.96.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.96.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.96.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.97.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.97.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.97.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.98.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.98.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.98.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.99.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.99.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.experts.99.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.gate.e_score_correction_bias": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.gate.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.shared_experts.down_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.shared_experts.gate_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.mlp.shared_experts.up_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.post_attention_layernorm.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.self_attn.k_norm.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.self_attn.k_proj.bias": "model-00007-of-00092.safetensors",
+ "model.layers.6.self_attn.k_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.self_attn.o_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.self_attn.q_norm.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.self_attn.q_proj.bias": "model-00007-of-00092.safetensors",
+ "model.layers.6.self_attn.q_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.6.self_attn.v_proj.bias": "model-00007-of-00092.safetensors",
+ "model.layers.6.self_attn.v_proj.weight": "model-00007-of-00092.safetensors",
+ "model.layers.7.input_layernorm.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.0.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.0.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.0.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.1.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.1.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.1.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.10.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.10.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.10.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.100.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.100.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.100.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.101.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.101.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.101.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.102.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.102.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.102.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.103.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.103.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.103.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.104.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.104.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.104.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.105.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.105.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.105.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.106.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.106.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.106.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.107.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.107.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.107.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.108.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.108.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.108.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.109.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.109.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.109.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.11.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.11.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.11.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.110.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.110.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.110.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.111.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.111.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.111.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.112.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.112.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.112.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.113.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.113.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.113.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.114.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.114.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.114.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.115.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.115.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.115.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.116.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.116.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.116.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.117.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.117.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.117.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.118.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.118.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.118.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.119.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.119.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.119.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.12.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.12.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.12.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.120.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.120.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.120.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.121.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.121.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.121.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.122.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.122.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.122.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.123.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.123.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.123.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.124.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.124.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.124.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.125.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.125.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.125.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.126.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.126.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.126.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.127.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.127.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.127.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.128.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.128.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.128.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.129.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.129.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.129.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.13.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.13.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.13.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.130.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.130.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.130.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.131.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.131.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.131.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.132.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.132.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.132.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.133.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.133.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.133.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.134.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.134.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.134.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.135.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.135.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.135.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.136.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.136.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.136.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.137.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.137.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.137.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.138.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.138.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.138.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.139.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.139.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.139.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.14.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.14.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.14.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.140.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.140.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.140.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.141.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.141.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.141.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.142.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.142.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.142.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.143.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.143.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.143.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.144.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.144.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.144.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.145.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.145.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.145.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.146.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.146.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.146.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.147.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.147.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.147.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.148.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.148.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.148.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.149.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.149.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.149.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.15.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.15.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.15.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.150.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.150.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.150.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.151.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.151.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.151.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.152.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.152.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.152.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.153.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.153.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.153.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.154.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.154.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.154.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.155.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.155.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.155.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.156.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.156.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.156.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.157.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.157.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.157.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.158.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.158.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.158.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.159.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.159.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.159.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.16.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.16.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.16.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.17.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.17.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.17.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.18.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.18.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.18.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.19.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.19.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.19.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.2.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.2.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.2.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.20.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.20.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.20.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.21.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.21.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.21.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.22.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.22.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.22.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.23.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.23.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.23.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.24.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.24.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.24.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.25.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.25.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.25.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.26.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.26.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.26.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.27.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.27.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.27.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.28.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.28.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.28.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.29.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.29.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.29.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.3.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.3.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.3.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.30.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.30.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.30.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.31.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.31.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.31.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.32.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.32.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.32.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.33.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.33.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.33.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.34.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.34.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.34.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.35.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.35.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.35.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.36.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.36.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.36.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.37.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.37.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.37.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.38.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.38.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.38.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.39.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.39.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.39.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.4.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.4.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.4.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.40.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.40.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.40.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.41.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.41.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.41.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.42.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.42.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.42.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.43.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.43.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.43.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.44.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.44.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.44.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.45.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.45.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.45.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.46.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.46.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.46.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.47.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.47.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.47.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.48.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.48.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.48.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.49.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.49.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.49.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.5.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.5.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.5.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.50.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.50.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.50.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.51.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.51.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.51.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.52.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.52.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.52.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.53.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.53.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.53.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.54.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.54.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.54.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.55.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.55.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.55.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.56.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.56.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.56.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.57.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.57.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.57.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.58.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.58.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.58.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.59.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.59.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.59.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.6.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.6.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.6.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.60.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.60.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.60.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.61.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.61.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.61.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.62.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.62.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.62.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.63.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.63.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.63.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.64.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.64.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.64.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.65.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.65.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.65.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.66.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.66.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.66.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.67.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.67.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.67.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.68.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.68.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.68.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.69.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.69.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.69.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.7.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.7.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.7.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.70.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.70.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.70.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.71.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.71.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.71.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.72.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.72.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.72.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.73.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.73.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.73.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.74.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.74.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.74.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.75.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.75.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.75.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.76.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.76.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.76.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.77.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.77.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.77.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.78.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.78.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.78.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.79.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.79.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.79.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.8.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.8.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.8.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.80.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.80.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.80.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.81.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.81.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.81.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.82.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.82.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.82.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.83.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.83.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.83.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.84.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.84.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.84.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.85.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.85.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.85.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.86.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.86.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.86.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.87.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.87.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.87.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.88.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.88.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.88.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.89.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.89.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.89.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.9.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.9.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.9.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.90.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.90.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.90.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.91.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.91.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.91.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.92.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.92.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.92.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.93.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.93.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.93.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.94.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.94.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.94.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.95.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.95.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.95.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.96.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.96.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.96.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.97.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.97.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.97.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.98.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.98.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.98.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.99.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.99.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.experts.99.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.gate.e_score_correction_bias": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.gate.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.shared_experts.down_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.shared_experts.gate_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.mlp.shared_experts.up_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.post_attention_layernorm.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.self_attn.k_norm.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.self_attn.k_proj.bias": "model-00008-of-00092.safetensors",
+ "model.layers.7.self_attn.k_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.self_attn.o_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.self_attn.q_norm.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.self_attn.q_proj.bias": "model-00008-of-00092.safetensors",
+ "model.layers.7.self_attn.q_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.7.self_attn.v_proj.bias": "model-00008-of-00092.safetensors",
+ "model.layers.7.self_attn.v_proj.weight": "model-00008-of-00092.safetensors",
+ "model.layers.8.input_layernorm.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.0.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.0.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.0.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.1.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.1.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.1.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.10.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.10.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.10.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.100.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.100.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.100.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.101.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.101.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.101.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.102.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.102.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.102.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.103.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.103.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.103.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.104.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.104.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.104.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.105.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.105.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.105.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.106.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.106.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.106.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.107.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.107.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.107.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.108.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.108.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.108.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.109.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.109.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.109.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.11.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.11.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.11.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.110.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.110.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.110.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.111.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.111.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.111.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.112.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.112.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.112.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.113.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.113.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.113.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.114.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.114.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.114.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.115.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.115.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.115.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.116.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.116.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.116.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.117.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.117.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.117.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.118.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.118.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.118.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.119.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.119.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.119.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.12.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.12.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.12.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.120.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.120.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.120.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.121.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.121.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.121.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.122.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.122.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.122.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.123.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.123.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.123.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.124.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.124.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.124.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.125.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.125.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.125.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.126.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.126.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.126.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.127.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.127.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.127.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.128.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.128.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.128.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.129.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.129.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.129.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.13.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.13.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.13.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.130.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.130.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.130.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.131.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.131.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.131.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.132.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.132.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.132.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.133.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.133.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.133.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.134.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.134.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.134.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.135.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.135.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.135.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.136.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.136.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.136.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.137.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.137.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.137.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.138.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.138.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.138.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.139.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.139.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.139.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.14.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.14.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.14.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.140.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.140.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.140.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.141.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.141.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.141.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.142.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.142.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.142.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.143.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.143.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.143.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.144.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.144.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.144.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.145.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.145.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.145.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.146.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.146.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.146.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.147.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.147.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.147.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.148.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.148.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.148.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.149.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.149.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.149.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.15.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.15.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.15.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.150.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.150.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.150.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.151.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.151.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.151.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.152.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.152.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.152.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.153.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.153.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.153.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.154.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.154.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.154.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.155.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.155.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.155.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.156.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.156.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.156.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.157.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.157.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.157.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.158.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.158.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.158.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.159.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.159.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.159.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.16.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.16.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.16.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.17.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.17.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.17.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.18.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.18.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.18.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.19.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.19.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.19.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.2.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.2.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.2.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.20.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.20.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.20.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.21.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.21.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.21.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.22.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.22.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.22.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.23.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.23.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.23.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.24.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.24.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.24.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.25.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.25.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.25.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.26.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.26.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.26.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.27.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.27.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.27.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.28.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.28.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.28.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.29.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.29.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.29.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.3.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.3.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.3.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.30.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.30.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.30.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.31.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.31.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.31.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.32.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.32.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.32.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.33.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.33.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.33.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.34.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.34.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.34.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.35.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.35.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.35.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.36.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.36.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.36.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.37.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.37.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.37.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.38.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.38.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.38.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.39.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.39.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.39.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.4.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.4.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.4.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.40.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.40.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.40.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.41.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.41.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.41.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.42.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.42.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.42.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.43.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.43.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.43.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.44.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.44.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.44.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.45.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.45.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.45.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.46.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.46.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.46.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.47.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.47.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.47.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.48.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.48.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.48.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.49.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.49.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.49.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.5.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.5.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.5.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.50.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.50.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.50.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.51.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.51.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.51.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.52.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.52.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.52.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.53.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.53.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.53.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.54.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.54.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.54.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.55.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.55.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.55.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.56.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.56.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.56.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.57.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.57.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.57.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.58.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.58.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.58.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.59.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.59.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.59.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.6.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.6.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.6.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.60.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.60.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.60.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.61.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.61.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.61.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.62.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.62.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.62.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.63.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.63.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.63.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.64.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.64.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.64.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.65.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.65.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.65.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.66.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.66.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.66.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.67.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.67.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.67.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.68.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.68.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.68.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.69.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.69.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.69.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.7.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.7.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.7.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.70.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.70.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.70.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.71.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.71.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.71.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.72.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.72.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.72.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.73.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.73.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.73.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.74.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.74.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.74.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.75.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.75.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.75.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.76.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.76.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.76.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.77.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.77.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.77.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.78.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.78.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.78.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.79.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.79.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.79.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.8.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.8.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.8.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.80.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.80.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.80.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.81.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.81.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.81.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.82.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.82.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.82.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.83.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.83.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.83.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.84.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.84.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.84.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.85.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.85.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.85.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.86.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.86.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.86.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.87.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.87.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.87.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.88.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.88.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.88.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.89.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.89.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.89.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.9.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.9.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.9.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.90.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.90.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.90.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.91.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.91.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.91.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.92.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.92.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.92.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.93.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.93.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.93.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.94.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.94.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.94.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.95.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.95.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.95.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.96.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.96.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.96.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.97.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.97.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.97.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.98.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.98.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.98.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.99.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.99.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.experts.99.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.gate.e_score_correction_bias": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.gate.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.shared_experts.down_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.shared_experts.gate_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.mlp.shared_experts.up_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.post_attention_layernorm.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.self_attn.k_norm.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.self_attn.k_proj.bias": "model-00009-of-00092.safetensors",
+ "model.layers.8.self_attn.k_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.self_attn.o_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.self_attn.q_norm.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.self_attn.q_proj.bias": "model-00009-of-00092.safetensors",
+ "model.layers.8.self_attn.q_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.8.self_attn.v_proj.bias": "model-00009-of-00092.safetensors",
+ "model.layers.8.self_attn.v_proj.weight": "model-00009-of-00092.safetensors",
+ "model.layers.9.input_layernorm.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.0.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.0.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.0.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.1.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.1.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.1.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.10.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.10.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.10.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.100.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.100.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.100.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.101.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.101.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.101.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.102.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.102.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.102.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.103.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.103.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.103.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.104.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.104.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.104.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.105.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.105.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.105.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.106.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.106.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.106.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.107.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.107.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.107.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.108.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.108.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.108.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.109.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.109.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.109.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.11.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.11.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.11.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.110.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.110.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.110.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.111.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.111.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.111.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.112.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.112.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.112.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.113.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.113.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.113.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.114.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.114.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.114.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.115.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.115.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.115.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.116.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.116.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.116.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.117.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.117.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.117.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.118.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.118.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.118.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.119.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.119.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.119.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.12.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.12.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.12.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.120.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.120.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.120.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.121.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.121.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.121.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.122.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.122.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.122.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.123.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.123.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.123.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.124.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.124.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.124.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.125.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.125.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.125.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.126.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.126.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.126.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.127.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.127.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.127.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.128.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.128.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.128.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.129.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.129.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.129.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.13.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.13.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.13.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.130.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.130.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.130.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.131.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.131.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.131.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.132.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.132.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.132.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.133.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.133.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.133.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.134.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.134.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.134.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.135.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.135.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.135.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.136.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.136.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.136.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.137.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.137.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.137.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.138.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.138.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.138.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.139.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.139.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.139.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.14.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.14.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.14.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.140.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.140.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.140.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.141.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.141.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.141.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.142.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.142.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.142.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.143.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.143.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.143.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.144.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.144.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.144.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.145.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.145.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.145.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.146.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.146.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.146.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.147.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.147.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.147.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.148.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.148.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.148.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.149.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.149.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.149.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.15.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.15.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.15.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.150.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.150.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.150.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.151.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.151.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.151.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.152.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.152.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.152.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.153.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.153.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.153.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.154.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.154.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.154.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.155.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.155.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.155.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.156.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.156.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.156.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.157.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.157.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.157.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.158.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.158.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.158.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.159.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.159.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.159.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.16.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.16.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.16.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.17.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.17.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.17.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.18.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.18.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.18.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.19.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.19.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.19.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.2.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.2.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.2.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.20.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.20.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.20.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.21.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.21.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.21.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.22.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.22.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.22.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.23.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.23.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.23.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.24.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.24.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.24.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.25.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.25.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.25.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.26.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.26.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.26.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.27.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.27.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.27.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.28.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.28.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.28.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.29.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.29.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.29.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.3.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.3.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.3.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.30.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.30.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.30.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.31.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.31.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.31.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.32.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.32.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.32.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.33.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.33.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.33.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.34.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.34.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.34.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.35.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.35.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.35.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.36.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.36.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.36.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.37.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.37.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.37.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.38.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.38.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.38.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.39.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.39.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.39.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.4.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.4.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.4.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.40.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.40.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.40.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.41.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.41.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.41.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.42.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.42.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.42.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.43.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.43.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.43.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.44.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.44.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.44.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.45.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.45.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.45.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.46.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.46.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.46.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.47.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.47.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.47.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.48.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.48.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.48.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.49.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.49.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.49.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.5.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.5.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.5.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.50.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.50.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.50.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.51.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.51.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.51.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.52.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.52.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.52.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.53.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.53.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.53.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.54.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.54.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.54.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.55.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.55.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.55.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.56.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.56.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.56.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.57.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.57.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.57.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.58.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.58.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.58.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.59.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.59.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.59.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.6.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.6.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.6.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.60.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.60.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.60.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.61.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.61.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.61.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.62.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.62.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.62.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.63.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.63.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.63.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.64.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.64.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.64.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.65.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.65.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.65.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.66.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.66.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.66.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.67.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.67.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.67.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.68.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.68.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.68.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.69.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.69.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.69.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.7.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.7.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.7.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.70.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.70.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.70.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.71.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.71.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.71.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.72.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.72.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.72.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.73.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.73.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.73.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.74.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.74.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.74.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.75.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.75.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.75.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.76.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.76.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.76.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.77.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.77.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.77.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.78.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.78.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.78.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.79.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.79.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.79.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.8.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.8.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.8.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.80.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.80.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.80.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.81.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.81.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.81.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.82.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.82.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.82.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.83.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.83.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.83.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.84.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.84.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.84.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.85.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.85.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.85.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.86.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.86.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.86.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.87.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.87.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.87.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.88.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.88.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.88.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.89.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.89.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.89.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.9.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.9.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.9.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.90.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.90.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.90.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.91.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.91.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.91.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.92.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.92.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.92.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.93.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.93.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.93.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.94.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.94.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.94.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.95.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.95.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.95.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.96.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.96.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.96.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.97.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.97.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.97.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.98.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.98.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.98.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.99.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.99.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.experts.99.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.gate.e_score_correction_bias": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.gate.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.shared_experts.down_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.shared_experts.gate_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.mlp.shared_experts.up_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.post_attention_layernorm.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.self_attn.k_norm.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.self_attn.k_proj.bias": "model-00010-of-00092.safetensors",
+ "model.layers.9.self_attn.k_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.self_attn.o_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.self_attn.q_norm.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.self_attn.q_proj.bias": "model-00010-of-00092.safetensors",
+ "model.layers.9.self_attn.q_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.9.self_attn.v_proj.bias": "model-00010-of-00092.safetensors",
+ "model.layers.9.self_attn.v_proj.weight": "model-00010-of-00092.safetensors",
+ "model.layers.10.input_layernorm.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.0.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.0.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.0.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.1.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.1.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.1.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.10.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.10.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.10.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.100.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.100.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.100.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.101.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.101.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.101.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.102.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.102.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.102.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.103.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.103.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.103.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.104.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.104.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.104.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.105.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.105.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.105.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.106.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.106.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.106.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.107.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.107.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.107.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.108.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.108.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.108.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.109.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.109.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.109.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.11.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.11.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.11.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.110.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.110.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.110.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.111.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.111.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.111.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.112.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.112.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.112.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.113.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.113.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.113.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.114.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.114.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.114.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.115.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.115.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.115.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.116.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.116.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.116.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.117.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.117.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.117.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.118.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.118.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.118.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.119.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.119.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.119.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.12.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.12.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.12.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.120.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.120.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.120.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.121.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.121.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.121.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.122.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.122.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.122.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.123.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.123.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.123.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.124.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.124.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.124.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.125.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.125.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.125.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.126.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.126.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.126.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.127.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.127.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.127.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.128.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.128.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.128.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.129.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.129.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.129.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.13.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.13.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.13.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.130.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.130.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.130.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.131.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.131.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.131.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.132.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.132.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.132.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.133.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.133.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.133.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.134.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.134.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.134.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.135.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.135.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.135.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.136.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.136.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.136.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.137.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.137.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.137.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.138.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.138.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.138.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.139.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.139.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.139.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.14.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.14.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.14.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.140.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.140.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.140.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.141.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.141.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.141.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.142.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.142.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.142.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.143.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.143.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.143.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.144.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.144.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.144.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.145.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.145.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.145.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.146.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.146.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.146.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.147.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.147.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.147.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.148.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.148.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.148.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.149.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.149.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.149.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.15.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.15.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.15.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.150.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.150.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.150.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.151.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.151.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.151.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.152.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.152.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.152.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.153.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.153.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.153.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.154.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.154.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.154.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.155.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.155.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.155.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.156.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.156.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.156.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.157.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.157.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.157.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.158.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.158.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.158.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.159.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.159.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.159.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.16.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.16.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.16.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.17.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.17.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.17.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.18.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.18.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.18.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.19.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.19.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.19.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.2.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.2.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.2.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.20.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.20.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.20.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.21.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.21.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.21.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.22.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.22.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.22.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.23.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.23.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.23.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.24.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.24.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.24.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.25.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.25.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.25.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.26.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.26.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.26.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.27.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.27.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.27.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.28.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.28.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.28.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.29.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.29.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.29.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.3.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.3.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.3.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.30.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.30.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.30.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.31.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.31.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.31.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.32.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.32.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.32.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.33.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.33.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.33.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.34.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.34.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.34.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.35.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.35.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.35.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.36.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.36.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.36.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.37.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.37.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.37.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.38.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.38.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.38.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.39.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.39.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.39.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.4.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.4.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.4.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.40.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.40.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.40.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.41.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.41.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.41.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.42.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.42.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.42.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.43.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.43.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.43.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.44.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.44.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.44.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.45.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.45.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.45.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.46.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.46.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.46.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.47.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.47.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.47.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.48.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.48.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.48.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.49.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.49.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.49.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.5.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.5.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.5.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.50.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.50.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.50.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.51.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.51.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.51.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.52.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.52.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.52.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.53.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.53.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.53.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.54.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.54.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.54.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.55.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.55.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.55.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.56.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.56.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.56.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.57.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.57.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.57.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.58.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.58.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.58.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.59.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.59.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.59.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.6.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.6.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.6.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.60.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.60.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.60.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.61.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.61.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.61.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.62.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.62.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.62.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.63.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.63.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.63.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.64.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.64.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.64.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.65.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.65.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.65.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.66.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.66.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.66.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.67.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.67.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.67.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.68.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.68.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.68.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.69.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.69.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.69.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.7.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.7.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.7.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.70.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.70.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.70.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.71.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.71.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.71.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.72.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.72.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.72.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.73.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.73.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.73.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.74.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.74.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.74.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.75.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.75.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.75.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.76.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.76.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.76.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.77.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.77.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.77.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.78.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.78.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.78.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.79.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.79.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.79.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.8.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.8.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.8.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.80.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.80.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.80.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.81.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.81.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.81.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.82.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.82.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.82.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.83.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.83.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.83.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.84.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.84.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.84.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.85.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.85.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.85.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.86.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.86.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.86.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.87.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.87.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.87.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.88.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.88.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.88.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.89.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.89.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.89.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.9.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.9.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.9.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.90.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.90.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.90.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.91.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.91.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.91.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.92.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.92.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.92.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.93.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.93.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.93.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.94.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.94.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.94.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.95.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.95.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.95.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.96.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.96.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.96.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.97.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.97.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.97.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.98.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.98.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.98.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.99.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.99.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.experts.99.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.gate.e_score_correction_bias": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.gate.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.shared_experts.down_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.shared_experts.gate_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.mlp.shared_experts.up_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.post_attention_layernorm.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.self_attn.k_norm.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.self_attn.k_proj.bias": "model-00011-of-00092.safetensors",
+ "model.layers.10.self_attn.k_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.self_attn.o_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.self_attn.q_norm.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.self_attn.q_proj.bias": "model-00011-of-00092.safetensors",
+ "model.layers.10.self_attn.q_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.10.self_attn.v_proj.bias": "model-00011-of-00092.safetensors",
+ "model.layers.10.self_attn.v_proj.weight": "model-00011-of-00092.safetensors",
+ "model.layers.11.input_layernorm.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.0.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.0.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.0.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.1.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.1.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.1.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.10.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.10.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.10.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.100.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.100.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.100.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.101.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.101.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.101.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.102.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.102.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.102.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.103.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.103.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.103.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.104.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.104.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.104.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.105.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.105.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.105.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.106.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.106.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.106.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.107.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.107.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.107.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.108.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.108.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.108.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.109.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.109.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.109.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.11.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.11.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.11.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.110.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.110.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.110.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.111.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.111.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.111.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.112.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.112.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.112.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.113.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.113.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.113.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.114.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.114.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.114.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.115.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.115.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.115.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.116.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.116.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.116.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.117.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.117.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.117.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.118.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.118.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.118.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.119.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.119.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.119.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.12.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.12.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.12.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.120.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.120.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.120.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.121.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.121.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.121.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.122.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.122.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.122.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.123.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.123.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.123.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.124.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.124.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.124.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.125.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.125.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.125.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.126.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.126.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.126.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.127.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.127.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.127.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.128.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.128.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.128.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.129.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.129.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.129.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.13.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.13.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.13.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.130.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.130.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.130.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.131.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.131.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.131.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.132.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.132.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.132.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.133.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.133.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.133.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.134.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.134.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.134.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.135.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.135.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.135.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.136.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.136.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.136.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.137.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.137.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.137.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.138.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.138.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.138.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.139.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.139.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.139.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.14.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.14.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.14.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.140.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.140.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.140.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.141.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.141.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.141.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.142.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.142.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.142.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.143.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.143.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.143.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.144.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.144.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.144.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.145.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.145.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.145.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.146.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.146.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.146.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.147.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.147.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.147.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.148.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.148.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.148.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.149.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.149.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.149.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.15.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.15.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.15.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.150.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.150.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.150.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.151.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.151.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.151.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.152.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.152.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.152.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.153.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.153.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.153.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.154.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.154.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.154.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.155.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.155.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.155.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.156.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.156.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.156.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.157.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.157.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.157.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.158.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.158.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.158.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.159.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.159.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.159.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.16.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.16.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.16.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.17.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.17.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.17.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.18.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.18.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.18.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.19.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.19.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.19.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.2.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.2.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.2.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.20.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.20.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.20.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.21.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.21.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.21.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.22.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.22.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.22.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.23.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.23.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.23.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.24.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.24.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.24.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.25.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.25.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.25.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.26.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.26.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.26.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.27.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.27.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.27.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.28.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.28.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.28.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.29.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.29.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.29.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.3.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.3.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.3.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.30.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.30.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.30.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.31.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.31.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.31.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.32.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.32.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.32.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.33.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.33.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.33.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.34.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.34.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.34.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.35.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.35.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.35.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.36.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.36.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.36.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.37.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.37.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.37.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.38.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.38.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.38.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.39.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.39.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.39.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.4.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.4.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.4.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.40.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.40.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.40.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.41.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.41.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.41.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.42.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.42.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.42.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.43.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.43.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.43.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.44.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.44.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.44.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.45.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.45.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.45.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.46.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.46.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.46.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.47.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.47.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.47.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.48.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.48.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.48.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.49.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.49.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.49.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.5.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.5.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.5.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.50.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.50.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.50.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.51.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.51.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.51.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.52.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.52.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.52.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.53.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.53.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.53.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.54.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.54.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.54.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.55.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.55.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.55.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.56.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.56.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.56.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.57.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.57.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.57.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.58.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.58.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.58.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.59.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.59.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.59.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.6.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.6.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.6.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.60.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.60.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.60.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.61.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.61.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.61.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.62.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.62.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.62.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.63.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.63.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.63.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.64.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.64.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.64.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.65.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.65.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.65.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.66.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.66.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.66.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.67.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.67.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.67.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.68.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.68.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.68.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.69.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.69.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.69.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.7.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.7.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.7.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.70.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.70.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.70.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.71.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.71.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.71.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.72.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.72.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.72.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.73.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.73.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.73.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.74.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.74.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.74.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.75.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.75.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.75.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.76.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.76.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.76.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.77.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.77.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.77.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.78.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.78.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.78.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.79.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.79.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.79.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.8.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.8.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.8.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.80.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.80.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.80.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.81.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.81.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.81.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.82.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.82.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.82.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.83.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.83.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.83.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.84.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.84.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.84.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.85.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.85.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.85.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.86.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.86.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.86.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.87.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.87.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.87.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.88.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.88.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.88.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.89.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.89.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.89.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.9.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.9.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.9.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.90.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.90.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.90.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.91.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.91.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.91.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.92.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.92.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.92.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.93.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.93.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.93.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.94.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.94.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.94.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.95.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.95.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.95.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.96.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.96.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.96.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.97.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.97.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.97.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.98.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.98.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.98.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.99.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.99.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.experts.99.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.gate.e_score_correction_bias": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.gate.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.shared_experts.down_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.shared_experts.gate_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.mlp.shared_experts.up_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.post_attention_layernorm.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.self_attn.k_norm.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.self_attn.k_proj.bias": "model-00012-of-00092.safetensors",
+ "model.layers.11.self_attn.k_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.self_attn.o_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.self_attn.q_norm.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.self_attn.q_proj.bias": "model-00012-of-00092.safetensors",
+ "model.layers.11.self_attn.q_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.11.self_attn.v_proj.bias": "model-00012-of-00092.safetensors",
+ "model.layers.11.self_attn.v_proj.weight": "model-00012-of-00092.safetensors",
+ "model.layers.12.input_layernorm.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.0.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.0.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.0.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.1.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.1.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.1.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.10.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.10.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.10.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.100.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.100.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.100.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.101.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.101.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.101.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.102.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.102.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.102.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.103.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.103.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.103.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.104.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.104.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.104.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.105.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.105.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.105.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.106.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.106.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.106.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.107.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.107.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.107.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.108.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.108.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.108.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.109.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.109.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.109.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.11.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.11.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.11.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.110.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.110.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.110.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.111.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.111.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.111.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.112.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.112.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.112.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.113.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.113.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.113.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.114.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.114.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.114.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.115.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.115.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.115.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.116.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.116.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.116.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.117.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.117.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.117.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.118.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.118.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.118.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.119.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.119.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.119.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.12.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.12.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.12.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.120.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.120.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.120.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.121.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.121.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.121.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.122.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.122.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.122.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.123.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.123.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.123.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.124.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.124.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.124.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.125.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.125.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.125.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.126.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.126.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.126.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.127.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.127.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.127.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.128.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.128.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.128.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.129.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.129.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.129.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.13.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.13.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.13.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.130.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.130.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.130.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.131.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.131.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.131.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.132.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.132.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.132.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.133.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.133.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.133.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.134.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.134.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.134.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.135.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.135.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.135.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.136.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.136.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.136.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.137.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.137.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.137.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.138.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.138.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.138.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.139.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.139.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.139.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.14.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.14.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.14.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.140.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.140.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.140.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.141.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.141.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.141.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.142.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.142.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.142.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.143.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.143.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.143.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.144.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.144.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.144.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.145.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.145.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.145.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.146.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.146.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.146.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.147.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.147.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.147.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.148.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.148.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.148.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.149.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.149.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.149.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.15.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.15.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.15.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.150.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.150.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.150.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.151.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.151.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.151.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.152.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.152.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.152.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.153.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.153.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.153.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.154.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.154.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.154.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.155.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.155.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.155.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.156.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.156.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.156.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.157.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.157.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.157.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.158.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.158.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.158.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.159.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.159.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.159.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.16.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.16.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.16.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.17.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.17.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.17.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.18.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.18.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.18.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.19.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.19.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.19.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.2.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.2.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.2.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.20.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.20.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.20.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.21.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.21.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.21.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.22.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.22.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.22.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.23.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.23.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.23.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.24.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.24.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.24.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.25.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.25.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.25.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.26.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.26.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.26.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.27.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.27.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.27.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.28.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.28.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.28.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.29.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.29.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.29.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.3.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.3.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.3.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.30.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.30.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.30.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.31.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.31.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.31.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.32.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.32.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.32.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.33.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.33.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.33.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.34.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.34.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.34.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.35.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.35.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.35.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.36.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.36.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.36.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.37.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.37.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.37.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.38.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.38.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.38.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.39.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.39.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.39.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.4.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.4.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.4.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.40.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.40.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.40.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.41.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.41.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.41.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.42.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.42.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.42.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.43.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.43.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.43.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.44.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.44.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.44.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.45.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.45.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.45.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.46.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.46.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.46.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.47.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.47.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.47.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.48.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.48.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.48.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.49.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.49.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.49.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.5.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.5.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.5.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.50.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.50.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.50.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.51.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.51.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.51.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.52.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.52.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.52.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.53.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.53.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.53.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.54.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.54.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.54.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.55.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.55.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.55.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.56.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.56.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.56.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.57.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.57.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.57.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.58.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.58.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.58.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.59.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.59.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.59.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.6.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.6.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.6.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.60.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.60.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.60.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.61.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.61.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.61.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.62.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.62.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.62.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.63.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.63.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.63.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.64.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.64.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.64.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.65.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.65.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.65.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.66.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.66.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.66.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.67.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.67.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.67.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.68.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.68.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.68.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.69.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.69.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.69.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.7.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.7.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.7.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.70.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.70.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.70.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.71.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.71.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.71.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.72.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.72.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.72.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.73.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.73.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.73.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.74.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.74.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.74.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.75.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.75.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.75.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.76.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.76.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.76.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.77.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.77.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.77.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.78.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.78.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.78.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.79.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.79.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.79.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.8.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.8.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.8.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.80.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.80.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.80.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.81.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.81.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.81.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.82.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.82.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.82.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.83.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.83.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.83.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.84.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.84.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.84.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.85.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.85.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.85.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.86.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.86.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.86.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.87.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.87.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.87.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.88.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.88.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.88.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.89.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.89.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.89.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.9.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.9.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.9.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.90.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.90.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.90.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.91.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.91.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.91.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.92.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.92.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.92.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.93.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.93.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.93.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.94.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.94.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.94.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.95.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.95.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.95.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.96.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.96.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.96.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.97.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.97.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.97.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.98.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.98.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.98.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.99.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.99.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.experts.99.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.gate.e_score_correction_bias": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.gate.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.shared_experts.down_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.shared_experts.gate_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.mlp.shared_experts.up_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.post_attention_layernorm.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.self_attn.k_norm.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.self_attn.k_proj.bias": "model-00013-of-00092.safetensors",
+ "model.layers.12.self_attn.k_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.self_attn.o_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.self_attn.q_norm.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.self_attn.q_proj.bias": "model-00013-of-00092.safetensors",
+ "model.layers.12.self_attn.q_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.12.self_attn.v_proj.bias": "model-00013-of-00092.safetensors",
+ "model.layers.12.self_attn.v_proj.weight": "model-00013-of-00092.safetensors",
+ "model.layers.13.input_layernorm.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.0.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.0.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.0.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.1.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.1.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.1.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.10.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.10.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.10.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.100.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.100.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.100.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.101.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.101.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.101.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.102.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.102.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.102.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.103.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.103.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.103.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.104.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.104.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.104.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.105.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.105.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.105.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.106.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.106.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.106.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.107.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.107.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.107.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.108.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.108.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.108.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.109.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.109.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.109.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.11.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.11.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.11.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.110.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.110.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.110.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.111.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.111.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.111.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.112.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.112.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.112.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.113.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.113.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.113.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.114.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.114.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.114.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.115.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.115.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.115.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.116.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.116.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.116.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.117.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.117.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.117.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.118.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.118.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.118.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.119.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.119.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.119.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.12.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.12.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.12.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.120.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.120.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.120.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.121.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.121.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.121.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.122.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.122.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.122.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.123.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.123.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.123.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.124.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.124.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.124.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.125.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.125.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.125.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.126.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.126.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.126.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.127.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.127.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.127.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.128.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.128.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.128.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.129.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.129.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.129.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.13.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.13.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.13.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.130.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.130.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.130.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.131.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.131.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.131.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.132.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.132.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.132.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.133.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.133.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.133.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.134.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.134.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.134.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.135.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.135.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.135.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.136.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.136.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.136.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.137.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.137.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.137.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.138.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.138.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.138.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.139.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.139.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.139.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.14.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.14.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.14.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.140.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.140.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.140.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.141.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.141.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.141.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.142.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.142.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.142.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.143.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.143.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.143.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.144.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.144.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.144.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.145.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.145.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.145.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.146.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.146.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.146.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.147.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.147.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.147.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.148.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.148.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.148.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.149.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.149.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.149.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.15.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.15.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.15.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.150.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.150.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.150.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.151.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.151.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.151.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.152.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.152.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.152.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.153.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.153.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.153.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.154.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.154.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.154.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.155.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.155.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.155.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.156.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.156.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.156.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.157.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.157.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.157.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.158.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.158.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.158.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.159.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.159.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.159.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.16.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.16.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.16.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.17.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.17.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.17.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.18.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.18.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.18.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.19.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.19.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.19.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.2.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.2.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.2.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.20.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.20.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.20.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.21.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.21.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.21.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.22.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.22.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.22.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.23.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.23.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.23.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.24.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.24.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.24.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.25.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.25.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.25.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.26.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.26.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.26.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.27.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.27.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.27.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.28.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.28.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.28.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.29.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.29.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.29.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.3.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.3.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.3.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.30.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.30.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.30.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.31.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.31.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.31.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.32.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.32.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.32.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.33.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.33.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.33.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.34.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.34.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.34.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.35.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.35.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.35.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.36.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.36.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.36.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.37.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.37.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.37.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.38.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.38.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.38.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.39.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.39.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.39.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.4.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.4.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.4.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.40.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.40.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.40.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.41.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.41.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.41.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.42.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.42.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.42.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.43.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.43.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.43.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.44.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.44.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.44.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.45.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.45.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.45.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.46.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.46.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.46.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.47.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.47.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.47.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.48.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.48.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.48.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.49.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.49.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.49.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.5.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.5.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.5.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.50.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.50.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.50.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.51.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.51.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.51.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.52.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.52.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.52.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.53.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.53.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.53.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.54.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.54.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.54.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.55.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.55.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.55.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.56.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.56.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.56.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.57.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.57.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.57.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.58.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.58.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.58.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.59.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.59.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.59.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.6.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.6.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.6.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.60.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.60.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.60.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.61.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.61.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.61.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.62.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.62.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.62.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.63.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.63.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.63.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.64.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.64.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.64.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.65.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.65.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.65.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.66.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.66.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.66.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.67.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.67.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.67.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.68.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.68.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.68.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.69.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.69.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.69.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.7.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.7.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.7.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.70.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.70.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.70.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.71.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.71.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.71.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.72.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.72.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.72.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.73.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.73.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.73.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.74.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.74.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.74.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.75.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.75.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.75.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.76.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.76.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.76.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.77.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.77.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.77.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.78.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.78.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.78.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.79.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.79.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.79.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.8.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.8.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.8.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.80.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.80.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.80.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.81.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.81.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.81.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.82.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.82.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.82.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.83.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.83.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.83.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.84.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.84.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.84.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.85.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.85.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.85.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.86.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.86.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.86.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.87.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.87.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.87.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.88.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.88.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.88.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.89.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.89.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.89.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.9.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.9.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.9.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.90.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.90.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.90.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.91.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.91.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.91.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.92.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.92.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.92.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.93.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.93.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.93.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.94.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.94.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.94.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.95.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.95.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.95.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.96.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.96.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.96.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.97.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.97.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.97.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.98.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.98.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.98.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.99.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.99.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.experts.99.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.gate.e_score_correction_bias": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.gate.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.shared_experts.down_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.shared_experts.gate_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.mlp.shared_experts.up_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.post_attention_layernorm.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.self_attn.k_norm.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.self_attn.k_proj.bias": "model-00014-of-00092.safetensors",
+ "model.layers.13.self_attn.k_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.self_attn.o_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.self_attn.q_norm.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.self_attn.q_proj.bias": "model-00014-of-00092.safetensors",
+ "model.layers.13.self_attn.q_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.13.self_attn.v_proj.bias": "model-00014-of-00092.safetensors",
+ "model.layers.13.self_attn.v_proj.weight": "model-00014-of-00092.safetensors",
+ "model.layers.14.input_layernorm.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.0.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.0.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.0.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.1.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.1.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.1.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.10.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.10.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.10.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.100.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.100.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.100.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.101.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.101.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.101.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.102.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.102.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.102.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.103.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.103.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.103.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.104.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.104.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.104.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.105.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.105.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.105.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.106.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.106.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.106.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.107.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.107.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.107.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.108.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.108.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.108.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.109.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.109.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.109.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.11.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.11.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.11.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.110.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.110.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.110.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.111.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.111.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.111.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.112.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.112.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.112.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.113.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.113.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.113.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.114.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.114.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.114.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.115.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.115.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.115.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.116.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.116.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.116.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.117.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.117.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.117.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.118.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.118.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.118.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.119.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.119.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.119.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.12.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.12.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.12.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.120.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.120.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.120.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.121.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.121.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.121.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.122.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.122.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.122.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.123.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.123.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.123.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.124.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.124.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.124.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.125.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.125.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.125.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.126.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.126.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.126.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.127.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.127.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.127.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.128.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.128.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.128.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.129.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.129.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.129.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.13.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.13.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.13.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.130.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.130.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.130.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.131.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.131.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.131.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.132.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.132.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.132.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.133.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.133.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.133.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.134.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.134.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.134.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.135.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.135.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.135.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.136.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.136.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.136.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.137.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.137.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.137.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.138.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.138.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.138.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.139.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.139.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.139.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.14.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.14.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.14.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.140.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.140.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.140.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.141.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.141.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.141.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.142.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.142.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.142.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.143.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.143.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.143.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.144.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.144.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.144.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.145.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.145.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.145.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.146.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.146.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.146.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.147.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.147.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.147.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.148.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.148.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.148.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.149.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.149.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.149.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.15.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.15.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.15.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.150.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.150.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.150.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.151.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.151.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.151.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.152.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.152.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.152.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.153.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.153.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.153.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.154.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.154.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.154.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.155.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.155.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.155.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.156.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.156.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.156.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.157.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.157.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.157.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.158.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.158.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.158.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.159.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.159.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.159.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.16.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.16.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.16.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.17.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.17.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.17.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.18.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.18.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.18.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.19.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.19.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.19.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.2.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.2.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.2.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.20.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.20.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.20.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.21.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.21.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.21.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.22.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.22.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.22.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.23.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.23.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.23.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.24.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.24.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.24.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.25.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.25.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.25.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.26.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.26.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.26.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.27.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.27.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.27.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.28.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.28.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.28.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.29.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.29.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.29.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.3.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.3.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.3.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.30.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.30.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.30.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.31.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.31.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.31.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.32.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.32.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.32.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.33.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.33.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.33.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.34.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.34.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.34.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.35.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.35.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.35.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.36.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.36.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.36.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.37.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.37.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.37.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.38.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.38.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.38.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.39.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.39.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.39.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.4.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.4.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.4.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.40.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.40.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.40.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.41.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.41.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.41.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.42.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.42.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.42.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.43.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.43.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.43.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.44.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.44.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.44.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.45.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.45.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.45.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.46.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.46.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.46.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.47.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.47.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.47.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.48.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.48.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.48.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.49.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.49.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.49.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.5.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.5.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.5.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.50.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.50.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.50.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.51.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.51.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.51.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.52.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.52.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.52.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.53.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.53.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.53.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.54.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.54.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.54.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.55.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.55.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.55.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.56.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.56.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.56.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.57.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.57.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.57.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.58.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.58.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.58.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.59.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.59.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.59.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.6.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.6.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.6.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.60.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.60.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.60.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.61.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.61.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.61.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.62.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.62.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.62.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.63.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.63.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.63.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.64.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.64.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.64.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.65.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.65.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.65.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.66.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.66.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.66.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.67.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.67.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.67.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.68.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.68.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.68.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.69.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.69.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.69.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.7.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.7.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.7.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.70.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.70.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.70.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.71.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.71.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.71.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.72.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.72.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.72.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.73.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.73.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.73.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.74.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.74.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.74.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.75.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.75.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.75.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.76.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.76.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.76.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.77.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.77.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.77.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.78.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.78.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.78.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.79.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.79.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.79.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.8.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.8.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.8.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.80.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.80.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.80.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.81.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.81.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.81.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.82.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.82.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.82.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.83.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.83.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.83.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.84.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.84.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.84.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.85.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.85.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.85.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.86.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.86.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.86.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.87.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.87.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.87.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.88.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.88.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.88.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.89.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.89.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.89.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.9.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.9.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.9.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.90.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.90.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.90.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.91.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.91.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.91.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.92.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.92.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.92.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.93.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.93.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.93.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.94.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.94.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.94.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.95.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.95.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.95.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.96.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.96.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.96.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.97.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.97.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.97.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.98.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.98.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.98.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.99.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.99.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.experts.99.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.gate.e_score_correction_bias": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.gate.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.shared_experts.down_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.shared_experts.gate_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.mlp.shared_experts.up_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.post_attention_layernorm.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.self_attn.k_norm.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.self_attn.k_proj.bias": "model-00015-of-00092.safetensors",
+ "model.layers.14.self_attn.k_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.self_attn.o_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.self_attn.q_norm.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.self_attn.q_proj.bias": "model-00015-of-00092.safetensors",
+ "model.layers.14.self_attn.q_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.14.self_attn.v_proj.bias": "model-00015-of-00092.safetensors",
+ "model.layers.14.self_attn.v_proj.weight": "model-00015-of-00092.safetensors",
+ "model.layers.15.input_layernorm.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.0.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.0.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.0.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.1.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.1.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.1.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.10.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.10.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.10.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.100.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.100.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.100.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.101.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.101.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.101.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.102.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.102.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.102.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.103.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.103.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.103.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.104.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.104.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.104.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.105.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.105.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.105.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.106.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.106.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.106.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.107.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.107.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.107.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.108.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.108.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.108.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.109.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.109.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.109.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.11.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.11.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.11.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.110.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.110.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.110.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.111.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.111.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.111.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.112.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.112.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.112.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.113.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.113.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.113.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.114.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.114.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.114.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.115.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.115.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.115.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.116.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.116.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.116.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.117.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.117.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.117.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.118.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.118.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.118.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.119.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.119.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.119.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.12.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.12.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.12.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.120.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.120.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.120.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.121.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.121.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.121.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.122.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.122.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.122.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.123.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.123.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.123.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.124.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.124.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.124.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.125.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.125.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.125.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.126.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.126.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.126.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.127.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.127.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.127.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.128.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.128.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.128.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.129.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.129.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.129.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.13.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.13.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.13.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.130.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.130.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.130.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.131.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.131.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.131.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.132.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.132.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.132.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.133.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.133.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.133.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.134.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.134.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.134.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.135.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.135.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.135.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.136.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.136.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.136.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.137.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.137.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.137.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.138.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.138.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.138.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.139.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.139.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.139.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.14.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.14.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.14.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.140.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.140.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.140.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.141.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.141.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.141.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.142.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.142.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.142.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.143.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.143.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.143.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.144.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.144.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.144.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.145.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.145.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.145.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.146.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.146.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.146.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.147.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.147.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.147.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.148.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.148.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.148.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.149.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.149.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.149.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.15.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.15.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.15.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.150.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.150.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.150.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.151.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.151.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.151.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.152.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.152.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.152.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.153.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.153.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.153.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.154.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.154.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.154.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.155.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.155.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.155.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.156.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.156.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.156.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.157.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.157.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.157.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.158.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.158.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.158.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.159.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.159.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.159.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.16.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.16.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.16.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.17.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.17.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.17.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.18.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.18.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.18.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.19.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.19.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.19.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.2.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.2.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.2.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.20.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.20.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.20.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.21.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.21.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.21.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.22.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.22.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.22.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.23.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.23.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.23.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.24.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.24.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.24.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.25.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.25.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.25.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.26.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.26.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.26.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.27.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.27.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.27.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.28.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.28.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.28.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.29.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.29.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.29.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.3.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.3.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.3.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.30.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.30.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.30.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.31.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.31.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.31.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.32.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.32.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.32.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.33.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.33.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.33.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.34.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.34.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.34.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.35.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.35.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.35.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.36.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.36.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.36.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.37.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.37.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.37.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.38.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.38.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.38.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.39.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.39.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.39.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.4.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.4.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.4.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.40.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.40.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.40.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.41.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.41.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.41.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.42.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.42.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.42.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.43.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.43.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.43.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.44.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.44.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.44.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.45.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.45.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.45.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.46.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.46.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.46.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.47.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.47.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.47.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.48.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.48.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.48.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.49.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.49.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.49.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.5.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.5.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.5.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.50.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.50.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.50.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.51.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.51.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.51.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.52.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.52.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.52.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.53.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.53.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.53.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.54.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.54.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.54.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.55.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.55.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.55.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.56.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.56.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.56.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.57.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.57.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.57.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.58.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.58.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.58.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.59.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.59.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.59.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.6.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.6.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.6.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.60.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.60.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.60.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.61.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.61.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.61.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.62.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.62.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.62.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.63.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.63.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.63.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.64.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.64.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.64.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.65.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.65.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.65.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.66.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.66.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.66.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.67.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.67.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.67.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.68.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.68.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.68.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.69.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.69.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.69.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.7.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.7.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.7.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.70.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.70.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.70.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.71.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.71.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.71.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.72.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.72.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.72.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.73.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.73.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.73.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.74.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.74.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.74.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.75.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.75.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.75.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.76.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.76.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.76.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.77.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.77.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.77.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.78.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.78.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.78.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.79.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.79.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.79.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.8.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.8.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.8.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.80.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.80.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.80.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.81.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.81.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.81.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.82.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.82.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.82.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.83.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.83.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.83.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.84.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.84.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.84.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.85.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.85.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.85.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.86.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.86.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.86.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.87.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.87.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.87.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.88.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.88.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.88.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.89.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.89.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.89.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.9.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.9.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.9.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.90.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.90.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.90.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.91.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.91.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.91.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.92.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.92.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.92.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.93.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.93.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.93.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.94.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.94.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.94.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.95.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.95.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.95.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.96.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.96.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.96.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.97.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.97.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.97.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.98.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.98.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.98.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.99.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.99.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.experts.99.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.gate.e_score_correction_bias": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.gate.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.shared_experts.down_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.shared_experts.gate_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.mlp.shared_experts.up_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.post_attention_layernorm.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.self_attn.k_norm.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.self_attn.k_proj.bias": "model-00016-of-00092.safetensors",
+ "model.layers.15.self_attn.k_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.self_attn.o_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.self_attn.q_norm.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.self_attn.q_proj.bias": "model-00016-of-00092.safetensors",
+ "model.layers.15.self_attn.q_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.15.self_attn.v_proj.bias": "model-00016-of-00092.safetensors",
+ "model.layers.15.self_attn.v_proj.weight": "model-00016-of-00092.safetensors",
+ "model.layers.16.input_layernorm.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.0.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.0.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.0.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.1.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.1.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.1.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.10.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.10.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.10.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.100.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.100.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.100.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.101.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.101.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.101.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.102.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.102.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.102.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.103.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.103.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.103.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.104.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.104.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.104.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.105.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.105.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.105.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.106.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.106.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.106.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.107.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.107.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.107.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.108.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.108.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.108.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.109.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.109.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.109.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.11.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.11.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.11.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.110.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.110.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.110.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.111.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.111.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.111.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.112.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.112.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.112.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.113.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.113.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.113.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.114.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.114.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.114.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.115.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.115.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.115.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.116.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.116.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.116.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.117.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.117.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.117.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.118.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.118.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.118.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.119.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.119.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.119.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.12.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.12.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.12.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.120.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.120.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.120.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.121.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.121.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.121.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.122.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.122.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.122.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.123.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.123.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.123.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.124.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.124.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.124.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.125.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.125.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.125.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.126.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.126.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.126.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.127.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.127.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.127.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.128.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.128.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.128.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.129.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.129.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.129.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.13.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.13.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.13.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.130.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.130.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.130.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.131.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.131.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.131.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.132.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.132.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.132.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.133.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.133.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.133.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.134.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.134.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.134.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.135.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.135.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.135.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.136.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.136.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.136.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.137.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.137.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.137.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.138.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.138.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.138.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.139.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.139.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.139.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.14.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.14.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.14.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.140.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.140.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.140.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.141.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.141.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.141.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.142.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.142.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.142.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.143.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.143.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.143.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.144.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.144.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.144.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.145.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.145.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.145.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.146.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.146.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.146.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.147.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.147.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.147.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.148.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.148.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.148.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.149.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.149.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.149.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.15.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.15.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.15.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.150.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.150.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.150.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.151.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.151.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.151.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.152.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.152.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.152.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.153.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.153.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.153.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.154.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.154.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.154.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.155.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.155.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.155.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.156.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.156.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.156.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.157.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.157.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.157.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.158.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.158.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.158.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.159.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.159.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.159.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.16.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.16.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.16.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.17.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.17.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.17.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.18.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.18.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.18.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.19.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.19.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.19.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.2.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.2.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.2.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.20.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.20.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.20.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.21.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.21.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.21.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.22.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.22.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.22.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.23.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.23.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.23.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.24.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.24.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.24.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.25.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.25.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.25.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.26.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.26.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.26.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.27.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.27.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.27.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.28.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.28.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.28.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.29.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.29.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.29.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.3.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.3.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.3.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.30.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.30.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.30.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.31.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.31.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.31.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.32.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.32.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.32.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.33.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.33.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.33.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.34.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.34.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.34.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.35.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.35.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.35.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.36.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.36.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.36.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.37.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.37.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.37.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.38.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.38.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.38.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.39.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.39.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.39.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.4.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.4.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.4.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.40.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.40.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.40.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.41.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.41.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.41.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.42.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.42.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.42.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.43.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.43.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.43.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.44.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.44.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.44.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.45.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.45.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.45.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.46.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.46.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.46.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.47.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.47.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.47.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.48.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.48.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.48.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.49.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.49.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.49.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.5.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.5.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.5.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.50.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.50.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.50.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.51.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.51.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.51.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.52.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.52.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.52.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.53.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.53.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.53.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.54.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.54.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.54.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.55.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.55.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.55.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.56.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.56.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.56.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.57.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.57.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.57.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.58.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.58.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.58.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.59.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.59.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.59.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.6.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.6.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.6.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.60.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.60.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.60.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.61.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.61.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.61.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.62.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.62.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.62.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.63.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.63.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.63.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.64.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.64.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.64.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.65.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.65.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.65.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.66.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.66.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.66.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.67.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.67.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.67.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.68.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.68.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.68.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.69.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.69.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.69.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.7.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.7.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.7.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.70.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.70.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.70.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.71.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.71.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.71.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.72.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.72.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.72.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.73.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.73.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.73.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.74.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.74.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.74.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.75.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.75.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.75.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.76.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.76.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.76.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.77.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.77.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.77.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.78.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.78.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.78.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.79.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.79.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.79.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.8.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.8.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.8.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.80.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.80.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.80.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.81.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.81.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.81.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.82.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.82.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.82.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.83.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.83.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.83.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.84.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.84.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.84.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.85.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.85.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.85.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.86.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.86.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.86.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.87.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.87.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.87.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.88.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.88.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.88.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.89.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.89.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.89.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.9.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.9.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.9.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.90.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.90.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.90.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.91.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.91.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.91.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.92.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.92.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.92.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.93.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.93.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.93.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.94.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.94.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.94.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.95.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.95.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.95.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.96.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.96.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.96.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.97.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.97.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.97.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.98.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.98.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.98.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.99.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.99.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.experts.99.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.gate.e_score_correction_bias": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.gate.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.shared_experts.down_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.shared_experts.gate_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.mlp.shared_experts.up_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.post_attention_layernorm.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.self_attn.k_norm.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.self_attn.k_proj.bias": "model-00017-of-00092.safetensors",
+ "model.layers.16.self_attn.k_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.self_attn.o_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.self_attn.q_norm.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.self_attn.q_proj.bias": "model-00017-of-00092.safetensors",
+ "model.layers.16.self_attn.q_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.16.self_attn.v_proj.bias": "model-00017-of-00092.safetensors",
+ "model.layers.16.self_attn.v_proj.weight": "model-00017-of-00092.safetensors",
+ "model.layers.17.input_layernorm.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.0.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.0.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.0.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.1.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.1.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.1.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.10.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.10.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.10.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.100.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.100.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.100.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.101.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.101.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.101.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.102.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.102.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.102.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.103.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.103.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.103.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.104.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.104.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.104.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.105.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.105.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.105.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.106.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.106.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.106.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.107.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.107.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.107.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.108.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.108.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.108.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.109.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.109.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.109.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.11.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.11.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.11.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.110.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.110.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.110.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.111.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.111.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.111.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.112.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.112.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.112.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.113.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.113.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.113.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.114.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.114.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.114.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.115.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.115.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.115.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.116.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.116.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.116.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.117.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.117.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.117.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.118.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.118.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.118.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.119.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.119.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.119.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.12.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.12.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.12.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.120.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.120.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.120.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.121.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.121.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.121.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.122.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.122.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.122.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.123.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.123.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.123.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.124.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.124.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.124.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.125.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.125.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.125.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.126.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.126.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.126.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.127.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.127.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.127.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.128.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.128.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.128.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.129.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.129.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.129.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.13.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.13.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.13.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.130.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.130.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.130.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.131.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.131.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.131.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.132.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.132.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.132.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.133.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.133.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.133.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.134.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.134.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.134.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.135.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.135.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.135.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.136.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.136.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.136.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.137.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.137.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.137.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.138.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.138.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.138.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.139.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.139.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.139.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.14.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.14.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.14.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.140.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.140.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.140.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.141.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.141.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.141.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.142.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.142.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.142.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.143.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.143.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.143.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.144.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.144.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.144.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.145.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.145.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.145.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.146.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.146.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.146.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.147.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.147.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.147.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.148.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.148.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.148.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.149.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.149.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.149.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.15.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.15.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.15.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.150.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.150.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.150.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.151.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.151.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.151.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.152.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.152.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.152.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.153.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.153.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.153.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.154.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.154.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.154.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.155.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.155.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.155.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.156.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.156.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.156.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.157.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.157.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.157.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.158.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.158.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.158.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.159.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.159.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.159.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.16.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.16.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.16.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.17.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.17.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.17.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.18.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.18.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.18.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.19.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.19.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.19.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.2.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.2.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.2.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.20.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.20.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.20.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.21.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.21.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.21.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.22.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.22.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.22.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.23.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.23.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.23.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.24.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.24.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.24.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.25.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.25.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.25.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.26.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.26.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.26.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.27.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.27.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.27.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.28.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.28.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.28.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.29.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.29.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.29.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.3.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.3.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.3.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.30.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.30.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.30.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.31.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.31.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.31.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.32.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.32.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.32.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.33.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.33.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.33.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.34.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.34.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.34.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.35.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.35.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.35.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.36.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.36.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.36.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.37.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.37.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.37.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.38.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.38.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.38.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.39.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.39.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.39.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.4.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.4.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.4.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.40.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.40.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.40.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.41.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.41.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.41.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.42.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.42.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.42.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.43.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.43.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.43.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.44.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.44.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.44.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.45.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.45.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.45.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.46.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.46.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.46.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.47.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.47.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.47.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.48.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.48.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.48.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.49.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.49.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.49.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.5.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.5.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.5.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.50.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.50.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.50.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.51.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.51.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.51.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.52.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.52.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.52.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.53.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.53.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.53.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.54.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.54.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.54.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.55.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.55.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.55.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.56.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.56.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.56.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.57.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.57.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.57.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.58.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.58.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.58.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.59.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.59.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.59.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.6.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.6.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.6.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.60.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.60.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.60.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.61.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.61.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.61.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.62.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.62.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.62.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.63.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.63.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.63.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.64.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.64.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.64.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.65.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.65.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.65.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.66.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.66.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.66.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.67.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.67.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.67.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.68.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.68.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.68.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.69.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.69.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.69.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.7.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.7.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.7.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.70.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.70.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.70.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.71.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.71.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.71.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.72.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.72.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.72.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.73.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.73.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.73.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.74.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.74.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.74.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.75.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.75.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.75.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.76.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.76.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.76.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.77.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.77.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.77.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.78.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.78.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.78.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.79.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.79.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.79.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.8.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.8.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.8.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.80.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.80.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.80.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.81.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.81.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.81.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.82.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.82.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.82.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.83.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.83.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.83.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.84.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.84.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.84.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.85.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.85.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.85.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.86.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.86.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.86.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.87.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.87.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.87.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.88.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.88.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.88.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.89.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.89.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.89.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.9.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.9.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.9.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.90.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.90.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.90.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.91.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.91.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.91.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.92.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.92.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.92.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.93.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.93.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.93.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.94.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.94.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.94.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.95.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.95.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.95.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.96.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.96.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.96.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.97.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.97.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.97.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.98.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.98.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.98.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.99.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.99.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.experts.99.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.gate.e_score_correction_bias": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.gate.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.shared_experts.down_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.shared_experts.gate_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.mlp.shared_experts.up_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.post_attention_layernorm.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.self_attn.k_norm.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.self_attn.k_proj.bias": "model-00018-of-00092.safetensors",
+ "model.layers.17.self_attn.k_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.self_attn.o_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.self_attn.q_norm.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.self_attn.q_proj.bias": "model-00018-of-00092.safetensors",
+ "model.layers.17.self_attn.q_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.17.self_attn.v_proj.bias": "model-00018-of-00092.safetensors",
+ "model.layers.17.self_attn.v_proj.weight": "model-00018-of-00092.safetensors",
+ "model.layers.18.input_layernorm.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.0.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.0.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.0.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.1.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.1.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.1.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.10.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.10.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.10.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.100.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.100.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.100.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.101.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.101.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.101.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.102.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.102.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.102.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.103.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.103.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.103.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.104.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.104.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.104.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.105.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.105.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.105.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.106.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.106.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.106.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.107.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.107.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.107.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.108.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.108.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.108.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.109.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.109.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.109.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.11.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.11.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.11.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.110.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.110.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.110.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.111.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.111.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.111.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.112.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.112.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.112.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.113.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.113.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.113.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.114.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.114.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.114.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.115.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.115.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.115.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.116.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.116.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.116.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.117.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.117.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.117.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.118.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.118.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.118.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.119.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.119.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.119.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.12.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.12.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.12.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.120.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.120.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.120.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.121.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.121.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.121.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.122.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.122.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.122.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.123.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.123.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.123.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.124.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.124.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.124.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.125.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.125.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.125.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.126.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.126.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.126.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.127.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.127.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.127.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.128.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.128.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.128.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.129.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.129.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.129.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.13.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.13.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.13.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.130.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.130.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.130.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.131.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.131.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.131.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.132.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.132.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.132.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.133.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.133.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.133.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.134.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.134.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.134.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.135.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.135.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.135.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.136.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.136.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.136.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.137.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.137.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.137.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.138.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.138.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.138.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.139.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.139.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.139.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.14.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.14.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.14.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.140.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.140.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.140.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.141.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.141.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.141.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.142.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.142.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.142.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.143.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.143.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.143.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.144.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.144.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.144.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.145.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.145.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.145.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.146.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.146.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.146.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.147.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.147.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.147.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.148.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.148.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.148.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.149.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.149.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.149.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.15.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.15.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.15.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.150.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.150.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.150.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.151.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.151.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.151.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.152.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.152.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.152.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.153.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.153.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.153.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.154.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.154.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.154.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.155.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.155.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.155.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.156.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.156.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.156.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.157.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.157.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.157.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.158.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.158.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.158.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.159.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.159.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.159.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.16.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.16.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.16.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.17.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.17.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.17.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.18.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.18.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.18.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.19.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.19.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.19.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.2.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.2.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.2.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.20.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.20.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.20.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.21.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.21.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.21.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.22.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.22.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.22.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.23.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.23.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.23.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.24.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.24.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.24.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.25.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.25.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.25.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.26.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.26.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.26.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.27.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.27.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.27.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.28.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.28.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.28.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.29.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.29.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.29.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.3.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.3.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.3.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.30.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.30.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.30.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.31.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.31.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.31.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.32.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.32.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.32.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.33.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.33.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.33.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.34.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.34.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.34.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.35.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.35.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.35.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.36.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.36.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.36.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.37.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.37.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.37.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.38.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.38.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.38.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.39.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.39.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.39.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.4.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.4.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.4.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.40.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.40.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.40.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.41.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.41.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.41.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.42.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.42.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.42.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.43.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.43.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.43.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.44.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.44.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.44.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.45.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.45.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.45.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.46.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.46.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.46.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.47.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.47.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.47.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.48.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.48.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.48.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.49.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.49.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.49.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.5.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.5.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.5.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.50.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.50.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.50.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.51.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.51.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.51.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.52.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.52.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.52.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.53.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.53.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.53.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.54.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.54.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.54.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.55.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.55.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.55.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.56.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.56.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.56.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.57.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.57.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.57.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.58.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.58.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.58.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.59.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.59.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.59.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.6.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.6.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.6.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.60.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.60.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.60.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.61.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.61.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.61.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.62.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.62.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.62.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.63.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.63.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.63.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.64.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.64.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.64.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.65.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.65.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.65.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.66.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.66.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.66.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.67.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.67.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.67.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.68.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.68.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.68.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.69.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.69.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.69.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.7.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.7.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.7.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.70.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.70.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.70.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.71.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.71.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.71.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.72.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.72.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.72.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.73.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.73.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.73.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.74.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.74.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.74.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.75.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.75.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.75.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.76.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.76.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.76.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.77.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.77.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.77.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.78.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.78.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.78.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.79.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.79.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.79.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.8.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.8.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.8.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.80.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.80.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.80.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.81.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.81.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.81.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.82.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.82.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.82.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.83.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.83.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.83.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.84.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.84.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.84.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.85.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.85.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.85.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.86.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.86.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.86.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.87.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.87.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.87.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.88.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.88.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.88.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.89.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.89.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.89.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.9.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.9.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.9.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.90.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.90.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.90.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.91.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.91.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.91.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.92.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.92.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.92.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.93.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.93.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.93.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.94.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.94.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.94.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.95.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.95.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.95.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.96.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.96.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.96.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.97.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.97.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.97.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.98.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.98.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.98.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.99.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.99.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.experts.99.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.gate.e_score_correction_bias": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.gate.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.shared_experts.down_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.shared_experts.gate_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.mlp.shared_experts.up_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.post_attention_layernorm.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.self_attn.k_norm.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.self_attn.k_proj.bias": "model-00019-of-00092.safetensors",
+ "model.layers.18.self_attn.k_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.self_attn.o_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.self_attn.q_norm.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.self_attn.q_proj.bias": "model-00019-of-00092.safetensors",
+ "model.layers.18.self_attn.q_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.18.self_attn.v_proj.bias": "model-00019-of-00092.safetensors",
+ "model.layers.18.self_attn.v_proj.weight": "model-00019-of-00092.safetensors",
+ "model.layers.19.input_layernorm.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.0.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.0.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.0.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.1.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.1.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.1.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.10.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.10.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.10.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.100.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.100.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.100.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.101.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.101.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.101.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.102.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.102.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.102.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.103.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.103.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.103.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.104.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.104.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.104.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.105.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.105.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.105.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.106.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.106.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.106.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.107.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.107.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.107.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.108.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.108.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.108.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.109.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.109.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.109.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.11.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.11.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.11.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.110.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.110.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.110.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.111.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.111.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.111.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.112.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.112.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.112.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.113.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.113.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.113.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.114.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.114.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.114.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.115.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.115.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.115.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.116.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.116.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.116.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.117.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.117.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.117.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.118.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.118.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.118.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.119.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.119.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.119.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.12.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.12.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.12.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.120.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.120.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.120.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.121.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.121.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.121.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.122.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.122.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.122.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.123.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.123.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.123.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.124.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.124.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.124.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.125.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.125.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.125.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.126.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.126.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.126.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.127.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.127.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.127.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.128.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.128.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.128.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.129.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.129.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.129.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.13.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.13.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.13.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.130.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.130.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.130.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.131.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.131.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.131.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.132.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.132.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.132.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.133.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.133.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.133.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.134.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.134.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.134.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.135.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.135.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.135.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.136.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.136.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.136.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.137.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.137.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.137.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.138.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.138.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.138.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.139.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.139.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.139.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.14.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.14.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.14.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.140.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.140.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.140.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.141.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.141.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.141.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.142.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.142.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.142.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.143.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.143.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.143.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.144.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.144.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.144.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.145.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.145.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.145.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.146.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.146.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.146.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.147.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.147.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.147.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.148.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.148.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.148.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.149.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.149.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.149.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.15.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.15.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.15.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.150.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.150.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.150.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.151.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.151.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.151.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.152.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.152.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.152.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.153.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.153.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.153.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.154.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.154.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.154.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.155.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.155.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.155.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.156.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.156.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.156.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.157.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.157.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.157.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.158.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.158.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.158.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.159.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.159.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.159.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.16.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.16.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.16.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.17.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.17.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.17.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.18.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.18.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.18.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.19.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.19.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.19.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.2.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.2.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.2.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.20.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.20.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.20.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.21.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.21.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.21.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.22.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.22.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.22.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.23.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.23.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.23.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.24.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.24.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.24.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.25.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.25.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.25.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.26.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.26.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.26.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.27.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.27.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.27.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.28.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.28.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.28.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.29.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.29.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.29.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.3.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.3.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.3.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.30.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.30.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.30.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.31.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.31.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.31.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.32.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.32.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.32.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.33.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.33.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.33.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.34.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.34.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.34.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.35.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.35.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.35.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.36.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.36.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.36.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.37.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.37.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.37.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.38.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.38.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.38.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.39.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.39.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.39.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.4.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.4.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.4.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.40.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.40.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.40.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.41.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.41.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.41.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.42.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.42.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.42.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.43.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.43.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.43.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.44.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.44.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.44.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.45.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.45.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.45.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.46.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.46.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.46.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.47.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.47.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.47.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.48.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.48.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.48.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.49.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.49.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.49.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.5.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.5.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.5.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.50.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.50.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.50.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.51.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.51.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.51.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.52.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.52.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.52.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.53.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.53.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.53.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.54.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.54.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.54.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.55.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.55.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.55.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.56.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.56.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.56.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.57.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.57.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.57.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.58.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.58.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.58.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.59.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.59.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.59.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.6.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.6.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.6.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.60.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.60.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.60.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.61.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.61.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.61.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.62.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.62.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.62.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.63.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.63.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.63.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.64.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.64.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.64.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.65.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.65.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.65.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.66.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.66.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.66.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.67.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.67.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.67.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.68.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.68.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.68.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.69.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.69.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.69.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.7.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.7.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.7.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.70.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.70.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.70.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.71.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.71.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.71.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.72.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.72.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.72.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.73.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.73.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.73.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.74.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.74.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.74.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.75.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.75.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.75.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.76.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.76.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.76.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.77.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.77.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.77.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.78.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.78.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.78.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.79.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.79.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.79.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.8.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.8.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.8.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.80.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.80.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.80.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.81.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.81.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.81.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.82.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.82.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.82.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.83.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.83.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.83.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.84.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.84.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.84.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.85.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.85.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.85.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.86.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.86.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.86.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.87.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.87.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.87.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.88.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.88.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.88.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.89.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.89.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.89.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.9.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.9.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.9.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.90.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.90.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.90.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.91.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.91.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.91.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.92.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.92.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.92.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.93.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.93.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.93.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.94.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.94.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.94.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.95.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.95.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.95.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.96.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.96.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.96.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.97.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.97.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.97.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.98.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.98.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.98.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.99.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.99.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.experts.99.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.gate.e_score_correction_bias": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.gate.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.shared_experts.down_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.shared_experts.gate_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.mlp.shared_experts.up_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.post_attention_layernorm.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.self_attn.k_norm.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.self_attn.k_proj.bias": "model-00020-of-00092.safetensors",
+ "model.layers.19.self_attn.k_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.self_attn.o_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.self_attn.q_norm.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.self_attn.q_proj.bias": "model-00020-of-00092.safetensors",
+ "model.layers.19.self_attn.q_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.19.self_attn.v_proj.bias": "model-00020-of-00092.safetensors",
+ "model.layers.19.self_attn.v_proj.weight": "model-00020-of-00092.safetensors",
+ "model.layers.20.input_layernorm.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.0.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.0.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.0.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.1.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.1.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.1.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.10.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.10.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.10.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.100.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.100.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.100.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.101.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.101.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.101.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.102.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.102.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.102.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.103.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.103.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.103.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.104.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.104.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.104.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.105.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.105.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.105.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.106.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.106.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.106.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.107.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.107.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.107.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.108.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.108.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.108.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.109.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.109.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.109.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.11.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.11.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.11.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.110.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.110.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.110.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.111.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.111.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.111.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.112.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.112.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.112.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.113.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.113.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.113.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.114.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.114.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.114.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.115.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.115.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.115.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.116.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.116.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.116.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.117.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.117.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.117.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.118.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.118.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.118.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.119.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.119.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.119.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.12.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.12.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.12.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.120.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.120.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.120.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.121.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.121.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.121.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.122.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.122.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.122.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.123.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.123.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.123.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.124.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.124.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.124.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.125.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.125.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.125.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.126.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.126.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.126.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.127.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.127.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.127.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.128.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.128.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.128.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.129.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.129.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.129.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.13.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.13.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.13.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.130.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.130.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.130.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.131.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.131.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.131.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.132.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.132.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.132.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.133.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.133.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.133.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.134.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.134.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.134.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.135.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.135.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.135.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.136.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.136.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.136.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.137.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.137.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.137.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.138.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.138.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.138.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.139.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.139.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.139.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.14.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.14.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.14.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.140.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.140.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.140.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.141.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.141.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.141.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.142.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.142.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.142.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.143.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.143.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.143.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.144.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.144.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.144.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.145.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.145.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.145.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.146.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.146.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.146.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.147.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.147.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.147.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.148.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.148.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.148.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.149.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.149.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.149.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.15.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.15.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.15.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.150.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.150.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.150.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.151.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.151.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.151.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.152.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.152.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.152.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.153.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.153.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.153.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.154.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.154.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.154.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.155.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.155.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.155.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.156.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.156.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.156.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.157.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.157.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.157.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.158.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.158.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.158.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.159.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.159.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.159.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.16.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.16.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.16.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.17.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.17.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.17.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.18.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.18.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.18.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.19.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.19.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.19.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.2.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.2.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.2.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.20.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.20.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.20.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.21.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.21.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.21.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.22.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.22.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.22.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.23.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.23.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.23.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.24.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.24.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.24.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.25.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.25.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.25.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.26.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.26.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.26.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.27.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.27.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.27.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.28.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.28.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.28.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.29.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.29.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.29.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.3.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.3.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.3.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.30.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.30.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.30.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.31.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.31.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.31.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.32.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.32.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.32.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.33.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.33.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.33.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.34.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.34.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.34.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.35.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.35.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.35.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.36.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.36.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.36.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.37.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.37.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.37.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.38.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.38.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.38.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.39.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.39.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.39.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.4.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.4.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.4.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.40.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.40.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.40.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.41.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.41.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.41.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.42.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.42.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.42.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.43.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.43.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.43.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.44.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.44.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.44.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.45.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.45.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.45.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.46.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.46.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.46.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.47.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.47.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.47.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.48.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.48.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.48.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.49.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.49.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.49.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.5.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.5.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.5.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.50.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.50.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.50.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.51.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.51.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.51.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.52.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.52.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.52.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.53.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.53.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.53.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.54.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.54.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.54.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.55.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.55.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.55.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.56.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.56.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.56.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.57.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.57.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.57.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.58.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.58.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.58.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.59.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.59.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.59.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.6.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.6.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.6.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.60.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.60.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.60.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.61.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.61.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.61.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.62.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.62.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.62.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.63.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.63.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.63.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.64.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.64.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.64.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.65.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.65.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.65.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.66.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.66.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.66.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.67.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.67.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.67.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.68.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.68.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.68.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.69.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.69.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.69.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.7.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.7.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.7.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.70.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.70.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.70.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.71.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.71.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.71.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.72.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.72.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.72.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.73.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.73.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.73.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.74.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.74.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.74.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.75.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.75.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.75.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.76.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.76.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.76.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.77.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.77.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.77.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.78.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.78.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.78.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.79.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.79.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.79.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.8.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.8.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.8.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.80.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.80.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.80.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.81.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.81.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.81.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.82.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.82.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.82.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.83.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.83.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.83.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.84.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.84.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.84.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.85.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.85.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.85.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.86.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.86.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.86.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.87.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.87.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.87.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.88.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.88.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.88.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.89.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.89.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.89.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.9.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.9.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.9.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.90.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.90.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.90.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.91.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.91.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.91.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.92.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.92.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.92.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.93.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.93.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.93.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.94.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.94.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.94.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.95.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.95.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.95.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.96.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.96.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.96.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.97.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.97.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.97.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.98.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.98.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.98.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.99.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.99.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.experts.99.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.gate.e_score_correction_bias": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.gate.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.shared_experts.down_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.shared_experts.gate_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.mlp.shared_experts.up_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.post_attention_layernorm.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.self_attn.k_norm.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.self_attn.k_proj.bias": "model-00021-of-00092.safetensors",
+ "model.layers.20.self_attn.k_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.self_attn.o_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.self_attn.q_norm.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.self_attn.q_proj.bias": "model-00021-of-00092.safetensors",
+ "model.layers.20.self_attn.q_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.20.self_attn.v_proj.bias": "model-00021-of-00092.safetensors",
+ "model.layers.20.self_attn.v_proj.weight": "model-00021-of-00092.safetensors",
+ "model.layers.21.input_layernorm.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.0.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.0.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.0.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.1.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.1.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.1.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.10.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.10.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.10.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.100.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.100.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.100.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.101.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.101.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.101.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.102.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.102.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.102.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.103.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.103.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.103.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.104.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.104.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.104.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.105.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.105.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.105.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.106.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.106.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.106.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.107.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.107.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.107.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.108.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.108.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.108.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.109.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.109.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.109.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.11.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.11.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.11.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.110.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.110.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.110.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.111.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.111.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.111.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.112.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.112.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.112.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.113.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.113.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.113.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.114.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.114.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.114.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.115.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.115.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.115.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.116.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.116.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.116.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.117.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.117.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.117.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.118.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.118.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.118.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.119.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.119.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.119.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.12.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.12.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.12.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.120.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.120.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.120.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.121.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.121.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.121.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.122.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.122.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.122.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.123.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.123.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.123.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.124.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.124.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.124.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.125.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.125.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.125.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.126.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.126.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.126.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.127.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.127.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.127.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.128.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.128.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.128.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.129.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.129.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.129.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.13.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.13.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.13.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.130.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.130.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.130.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.131.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.131.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.131.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.132.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.132.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.132.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.133.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.133.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.133.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.134.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.134.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.134.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.135.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.135.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.135.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.136.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.136.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.136.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.137.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.137.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.137.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.138.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.138.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.138.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.139.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.139.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.139.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.14.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.14.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.14.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.140.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.140.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.140.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.141.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.141.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.141.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.142.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.142.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.142.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.143.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.143.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.143.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.144.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.144.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.144.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.145.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.145.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.145.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.146.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.146.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.146.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.147.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.147.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.147.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.148.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.148.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.148.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.149.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.149.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.149.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.15.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.15.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.15.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.150.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.150.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.150.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.151.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.151.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.151.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.152.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.152.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.152.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.153.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.153.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.153.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.154.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.154.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.154.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.155.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.155.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.155.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.156.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.156.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.156.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.157.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.157.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.157.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.158.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.158.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.158.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.159.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.159.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.159.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.16.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.16.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.16.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.17.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.17.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.17.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.18.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.18.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.18.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.19.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.19.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.19.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.2.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.2.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.2.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.20.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.20.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.20.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.21.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.21.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.21.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.22.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.22.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.22.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.23.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.23.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.23.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.24.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.24.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.24.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.25.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.25.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.25.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.26.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.26.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.26.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.27.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.27.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.27.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.28.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.28.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.28.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.29.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.29.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.29.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.3.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.3.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.3.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.30.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.30.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.30.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.31.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.31.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.31.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.32.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.32.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.32.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.33.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.33.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.33.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.34.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.34.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.34.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.35.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.35.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.35.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.36.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.36.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.36.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.37.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.37.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.37.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.38.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.38.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.38.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.39.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.39.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.39.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.4.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.4.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.4.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.40.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.40.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.40.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.41.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.41.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.41.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.42.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.42.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.42.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.43.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.43.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.43.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.44.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.44.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.44.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.45.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.45.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.45.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.46.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.46.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.46.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.47.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.47.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.47.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.48.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.48.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.48.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.49.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.49.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.49.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.5.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.5.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.5.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.50.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.50.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.50.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.51.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.51.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.51.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.52.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.52.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.52.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.53.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.53.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.53.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.54.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.54.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.54.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.55.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.55.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.55.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.56.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.56.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.56.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.57.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.57.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.57.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.58.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.58.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.58.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.59.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.59.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.59.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.6.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.6.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.6.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.60.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.60.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.60.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.61.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.61.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.61.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.62.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.62.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.62.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.63.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.63.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.63.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.64.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.64.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.64.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.65.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.65.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.65.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.66.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.66.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.66.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.67.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.67.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.67.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.68.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.68.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.68.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.69.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.69.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.69.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.7.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.7.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.7.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.70.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.70.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.70.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.71.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.71.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.71.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.72.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.72.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.72.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.73.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.73.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.73.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.74.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.74.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.74.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.75.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.75.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.75.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.76.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.76.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.76.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.77.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.77.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.77.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.78.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.78.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.78.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.79.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.79.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.79.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.8.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.8.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.8.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.80.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.80.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.80.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.81.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.81.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.81.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.82.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.82.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.82.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.83.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.83.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.83.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.84.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.84.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.84.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.85.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.85.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.85.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.86.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.86.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.86.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.87.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.87.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.87.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.88.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.88.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.88.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.89.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.89.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.89.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.9.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.9.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.9.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.90.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.90.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.90.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.91.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.91.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.91.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.92.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.92.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.92.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.93.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.93.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.93.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.94.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.94.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.94.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.95.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.95.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.95.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.96.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.96.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.96.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.97.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.97.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.97.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.98.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.98.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.98.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.99.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.99.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.experts.99.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.gate.e_score_correction_bias": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.gate.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.shared_experts.down_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.shared_experts.gate_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.mlp.shared_experts.up_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.post_attention_layernorm.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.self_attn.k_norm.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.self_attn.k_proj.bias": "model-00022-of-00092.safetensors",
+ "model.layers.21.self_attn.k_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.self_attn.o_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.self_attn.q_norm.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.self_attn.q_proj.bias": "model-00022-of-00092.safetensors",
+ "model.layers.21.self_attn.q_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.21.self_attn.v_proj.bias": "model-00022-of-00092.safetensors",
+ "model.layers.21.self_attn.v_proj.weight": "model-00022-of-00092.safetensors",
+ "model.layers.22.input_layernorm.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.0.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.0.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.0.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.1.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.1.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.1.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.10.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.10.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.10.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.100.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.100.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.100.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.101.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.101.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.101.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.102.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.102.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.102.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.103.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.103.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.103.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.104.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.104.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.104.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.105.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.105.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.105.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.106.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.106.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.106.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.107.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.107.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.107.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.108.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.108.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.108.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.109.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.109.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.109.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.11.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.11.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.11.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.110.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.110.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.110.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.111.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.111.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.111.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.112.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.112.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.112.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.113.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.113.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.113.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.114.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.114.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.114.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.115.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.115.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.115.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.116.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.116.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.116.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.117.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.117.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.117.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.118.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.118.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.118.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.119.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.119.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.119.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.12.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.12.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.12.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.120.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.120.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.120.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.121.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.121.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.121.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.122.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.122.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.122.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.123.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.123.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.123.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.124.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.124.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.124.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.125.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.125.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.125.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.126.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.126.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.126.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.127.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.127.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.127.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.128.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.128.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.128.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.129.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.129.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.129.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.13.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.13.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.13.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.130.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.130.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.130.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.131.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.131.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.131.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.132.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.132.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.132.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.133.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.133.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.133.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.134.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.134.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.134.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.135.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.135.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.135.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.136.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.136.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.136.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.137.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.137.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.137.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.138.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.138.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.138.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.139.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.139.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.139.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.14.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.14.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.14.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.140.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.140.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.140.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.141.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.141.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.141.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.142.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.142.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.142.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.143.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.143.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.143.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.144.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.144.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.144.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.145.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.145.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.145.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.146.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.146.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.146.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.147.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.147.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.147.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.148.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.148.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.148.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.149.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.149.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.149.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.15.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.15.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.15.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.150.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.150.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.150.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.151.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.151.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.151.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.152.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.152.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.152.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.153.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.153.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.153.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.154.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.154.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.154.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.155.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.155.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.155.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.156.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.156.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.156.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.157.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.157.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.157.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.158.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.158.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.158.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.159.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.159.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.159.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.16.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.16.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.16.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.17.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.17.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.17.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.18.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.18.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.18.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.19.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.19.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.19.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.2.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.2.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.2.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.20.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.20.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.20.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.21.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.21.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.21.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.22.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.22.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.22.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.23.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.23.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.23.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.24.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.24.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.24.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.25.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.25.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.25.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.26.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.26.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.26.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.27.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.27.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.27.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.28.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.28.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.28.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.29.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.29.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.29.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.3.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.3.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.3.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.30.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.30.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.30.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.31.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.31.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.31.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.32.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.32.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.32.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.33.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.33.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.33.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.34.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.34.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.34.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.35.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.35.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.35.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.36.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.36.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.36.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.37.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.37.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.37.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.38.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.38.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.38.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.39.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.39.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.39.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.4.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.4.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.4.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.40.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.40.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.40.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.41.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.41.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.41.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.42.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.42.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.42.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.43.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.43.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.43.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.44.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.44.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.44.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.45.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.45.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.45.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.46.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.46.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.46.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.47.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.47.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.47.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.48.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.48.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.48.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.49.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.49.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.49.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.5.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.5.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.5.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.50.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.50.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.50.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.51.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.51.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.51.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.52.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.52.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.52.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.53.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.53.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.53.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.54.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.54.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.54.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.55.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.55.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.55.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.56.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.56.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.56.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.57.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.57.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.57.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.58.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.58.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.58.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.59.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.59.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.59.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.6.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.6.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.6.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.60.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.60.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.60.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.61.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.61.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.61.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.62.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.62.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.62.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.63.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.63.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.63.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.64.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.64.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.64.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.65.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.65.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.65.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.66.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.66.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.66.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.67.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.67.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.67.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.68.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.68.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.68.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.69.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.69.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.69.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.7.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.7.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.7.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.70.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.70.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.70.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.71.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.71.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.71.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.72.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.72.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.72.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.73.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.73.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.73.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.74.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.74.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.74.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.75.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.75.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.75.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.76.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.76.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.76.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.77.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.77.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.77.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.78.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.78.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.78.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.79.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.79.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.79.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.8.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.8.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.8.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.80.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.80.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.80.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.81.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.81.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.81.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.82.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.82.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.82.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.83.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.83.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.83.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.84.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.84.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.84.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.85.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.85.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.85.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.86.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.86.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.86.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.87.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.87.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.87.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.88.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.88.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.88.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.89.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.89.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.89.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.9.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.9.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.9.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.90.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.90.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.90.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.91.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.91.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.91.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.92.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.92.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.92.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.93.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.93.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.93.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.94.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.94.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.94.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.95.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.95.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.95.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.96.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.96.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.96.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.97.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.97.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.97.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.98.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.98.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.98.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.99.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.99.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.experts.99.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.gate.e_score_correction_bias": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.gate.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.shared_experts.down_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.shared_experts.gate_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.mlp.shared_experts.up_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.post_attention_layernorm.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.self_attn.k_norm.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.self_attn.k_proj.bias": "model-00023-of-00092.safetensors",
+ "model.layers.22.self_attn.k_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.self_attn.o_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.self_attn.q_norm.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.self_attn.q_proj.bias": "model-00023-of-00092.safetensors",
+ "model.layers.22.self_attn.q_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.22.self_attn.v_proj.bias": "model-00023-of-00092.safetensors",
+ "model.layers.22.self_attn.v_proj.weight": "model-00023-of-00092.safetensors",
+ "model.layers.23.input_layernorm.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.0.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.0.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.0.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.1.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.1.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.1.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.10.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.10.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.10.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.100.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.100.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.100.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.101.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.101.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.101.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.102.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.102.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.102.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.103.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.103.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.103.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.104.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.104.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.104.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.105.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.105.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.105.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.106.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.106.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.106.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.107.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.107.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.107.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.108.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.108.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.108.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.109.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.109.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.109.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.11.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.11.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.11.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.110.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.110.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.110.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.111.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.111.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.111.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.112.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.112.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.112.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.113.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.113.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.113.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.114.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.114.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.114.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.115.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.115.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.115.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.116.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.116.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.116.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.117.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.117.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.117.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.118.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.118.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.118.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.119.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.119.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.119.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.12.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.12.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.12.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.120.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.120.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.120.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.121.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.121.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.121.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.122.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.122.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.122.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.123.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.123.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.123.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.124.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.124.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.124.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.125.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.125.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.125.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.126.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.126.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.126.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.127.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.127.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.127.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.128.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.128.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.128.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.129.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.129.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.129.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.13.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.13.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.13.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.130.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.130.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.130.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.131.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.131.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.131.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.132.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.132.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.132.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.133.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.133.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.133.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.134.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.134.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.134.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.135.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.135.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.135.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.136.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.136.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.136.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.137.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.137.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.137.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.138.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.138.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.138.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.139.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.139.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.139.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.14.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.14.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.14.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.140.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.140.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.140.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.141.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.141.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.141.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.142.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.142.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.142.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.143.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.143.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.143.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.144.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.144.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.144.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.145.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.145.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.145.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.146.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.146.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.146.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.147.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.147.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.147.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.148.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.148.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.148.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.149.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.149.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.149.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.15.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.15.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.15.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.150.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.150.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.150.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.151.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.151.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.151.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.152.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.152.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.152.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.153.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.153.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.153.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.154.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.154.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.154.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.155.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.155.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.155.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.156.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.156.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.156.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.157.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.157.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.157.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.158.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.158.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.158.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.159.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.159.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.159.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.16.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.16.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.16.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.17.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.17.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.17.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.18.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.18.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.18.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.19.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.19.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.19.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.2.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.2.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.2.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.20.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.20.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.20.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.21.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.21.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.21.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.22.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.22.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.22.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.23.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.23.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.23.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.24.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.24.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.24.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.25.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.25.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.25.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.26.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.26.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.26.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.27.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.27.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.27.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.28.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.28.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.28.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.29.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.29.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.29.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.3.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.3.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.3.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.30.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.30.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.30.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.31.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.31.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.31.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.32.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.32.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.32.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.33.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.33.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.33.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.34.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.34.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.34.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.35.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.35.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.35.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.36.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.36.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.36.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.37.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.37.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.37.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.38.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.38.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.38.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.39.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.39.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.39.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.4.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.4.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.4.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.40.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.40.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.40.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.41.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.41.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.41.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.42.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.42.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.42.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.43.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.43.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.43.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.44.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.44.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.44.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.45.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.45.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.45.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.46.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.46.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.46.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.47.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.47.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.47.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.48.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.48.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.48.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.49.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.49.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.49.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.5.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.5.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.5.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.50.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.50.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.50.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.51.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.51.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.51.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.52.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.52.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.52.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.53.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.53.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.53.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.54.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.54.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.54.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.55.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.55.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.55.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.56.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.56.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.56.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.57.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.57.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.57.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.58.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.58.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.58.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.59.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.59.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.59.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.6.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.6.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.6.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.60.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.60.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.60.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.61.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.61.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.61.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.62.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.62.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.62.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.63.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.63.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.63.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.64.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.64.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.64.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.65.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.65.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.65.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.66.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.66.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.66.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.67.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.67.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.67.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.68.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.68.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.68.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.69.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.69.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.69.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.7.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.7.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.7.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.70.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.70.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.70.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.71.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.71.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.71.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.72.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.72.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.72.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.73.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.73.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.73.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.74.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.74.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.74.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.75.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.75.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.75.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.76.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.76.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.76.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.77.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.77.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.77.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.78.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.78.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.78.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.79.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.79.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.79.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.8.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.8.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.8.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.80.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.80.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.80.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.81.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.81.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.81.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.82.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.82.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.82.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.83.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.83.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.83.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.84.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.84.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.84.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.85.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.85.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.85.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.86.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.86.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.86.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.87.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.87.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.87.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.88.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.88.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.88.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.89.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.89.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.89.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.9.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.9.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.9.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.90.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.90.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.90.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.91.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.91.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.91.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.92.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.92.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.92.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.93.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.93.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.93.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.94.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.94.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.94.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.95.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.95.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.95.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.96.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.96.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.96.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.97.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.97.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.97.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.98.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.98.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.98.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.99.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.99.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.experts.99.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.gate.e_score_correction_bias": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.gate.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.shared_experts.down_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.shared_experts.gate_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.mlp.shared_experts.up_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.post_attention_layernorm.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.self_attn.k_norm.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.self_attn.k_proj.bias": "model-00024-of-00092.safetensors",
+ "model.layers.23.self_attn.k_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.self_attn.o_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.self_attn.q_norm.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.self_attn.q_proj.bias": "model-00024-of-00092.safetensors",
+ "model.layers.23.self_attn.q_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.23.self_attn.v_proj.bias": "model-00024-of-00092.safetensors",
+ "model.layers.23.self_attn.v_proj.weight": "model-00024-of-00092.safetensors",
+ "model.layers.24.input_layernorm.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.0.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.0.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.0.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.1.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.1.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.1.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.10.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.10.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.10.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.100.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.100.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.100.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.101.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.101.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.101.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.102.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.102.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.102.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.103.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.103.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.103.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.104.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.104.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.104.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.105.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.105.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.105.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.106.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.106.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.106.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.107.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.107.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.107.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.108.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.108.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.108.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.109.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.109.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.109.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.11.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.11.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.11.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.110.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.110.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.110.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.111.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.111.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.111.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.112.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.112.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.112.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.113.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.113.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.113.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.114.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.114.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.114.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.115.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.115.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.115.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.116.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.116.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.116.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.117.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.117.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.117.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.118.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.118.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.118.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.119.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.119.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.119.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.12.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.12.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.12.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.120.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.120.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.120.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.121.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.121.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.121.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.122.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.122.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.122.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.123.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.123.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.123.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.124.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.124.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.124.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.125.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.125.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.125.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.126.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.126.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.126.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.127.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.127.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.127.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.128.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.128.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.128.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.129.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.129.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.129.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.13.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.13.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.13.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.130.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.130.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.130.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.131.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.131.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.131.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.132.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.132.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.132.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.133.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.133.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.133.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.134.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.134.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.134.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.135.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.135.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.135.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.136.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.136.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.136.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.137.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.137.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.137.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.138.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.138.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.138.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.139.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.139.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.139.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.14.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.14.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.14.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.140.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.140.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.140.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.141.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.141.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.141.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.142.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.142.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.142.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.143.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.143.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.143.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.144.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.144.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.144.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.145.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.145.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.145.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.146.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.146.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.146.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.147.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.147.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.147.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.148.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.148.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.148.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.149.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.149.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.149.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.15.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.15.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.15.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.150.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.150.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.150.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.151.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.151.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.151.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.152.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.152.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.152.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.153.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.153.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.153.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.154.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.154.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.154.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.155.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.155.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.155.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.156.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.156.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.156.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.157.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.157.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.157.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.158.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.158.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.158.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.159.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.159.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.159.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.16.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.16.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.16.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.17.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.17.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.17.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.18.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.18.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.18.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.19.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.19.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.19.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.2.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.2.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.2.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.20.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.20.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.20.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.21.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.21.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.21.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.22.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.22.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.22.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.23.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.23.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.23.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.24.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.24.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.24.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.25.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.25.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.25.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.26.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.26.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.26.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.27.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.27.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.27.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.28.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.28.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.28.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.29.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.29.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.29.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.3.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.3.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.3.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.30.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.30.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.30.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.31.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.31.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.31.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.32.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.32.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.32.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.33.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.33.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.33.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.34.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.34.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.34.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.35.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.35.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.35.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.36.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.36.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.36.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.37.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.37.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.37.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.38.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.38.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.38.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.39.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.39.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.39.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.4.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.4.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.4.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.40.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.40.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.40.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.41.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.41.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.41.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.42.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.42.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.42.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.43.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.43.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.43.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.44.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.44.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.44.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.45.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.45.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.45.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.46.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.46.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.46.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.47.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.47.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.47.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.48.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.48.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.48.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.49.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.49.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.49.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.5.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.5.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.5.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.50.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.50.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.50.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.51.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.51.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.51.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.52.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.52.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.52.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.53.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.53.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.53.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.54.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.54.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.54.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.55.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.55.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.55.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.56.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.56.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.56.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.57.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.57.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.57.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.58.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.58.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.58.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.59.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.59.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.59.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.6.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.6.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.6.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.60.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.60.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.60.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.61.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.61.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.61.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.62.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.62.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.62.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.63.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.63.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.63.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.64.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.64.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.64.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.65.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.65.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.65.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.66.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.66.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.66.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.67.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.67.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.67.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.68.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.68.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.68.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.69.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.69.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.69.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.7.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.7.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.7.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.70.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.70.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.70.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.71.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.71.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.71.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.72.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.72.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.72.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.73.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.73.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.73.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.74.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.74.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.74.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.75.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.75.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.75.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.76.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.76.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.76.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.77.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.77.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.77.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.78.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.78.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.78.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.79.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.79.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.79.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.8.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.8.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.8.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.80.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.80.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.80.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.81.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.81.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.81.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.82.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.82.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.82.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.83.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.83.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.83.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.84.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.84.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.84.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.85.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.85.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.85.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.86.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.86.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.86.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.87.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.87.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.87.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.88.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.88.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.88.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.89.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.89.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.89.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.9.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.9.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.9.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.90.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.90.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.90.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.91.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.91.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.91.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.92.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.92.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.92.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.93.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.93.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.93.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.94.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.94.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.94.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.95.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.95.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.95.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.96.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.96.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.96.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.97.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.97.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.97.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.98.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.98.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.98.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.99.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.99.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.experts.99.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.gate.e_score_correction_bias": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.gate.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.shared_experts.down_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.shared_experts.gate_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.mlp.shared_experts.up_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.post_attention_layernorm.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.self_attn.k_norm.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.self_attn.k_proj.bias": "model-00025-of-00092.safetensors",
+ "model.layers.24.self_attn.k_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.self_attn.o_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.self_attn.q_norm.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.self_attn.q_proj.bias": "model-00025-of-00092.safetensors",
+ "model.layers.24.self_attn.q_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.24.self_attn.v_proj.bias": "model-00025-of-00092.safetensors",
+ "model.layers.24.self_attn.v_proj.weight": "model-00025-of-00092.safetensors",
+ "model.layers.25.input_layernorm.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.0.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.0.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.0.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.1.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.1.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.1.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.10.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.10.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.10.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.100.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.100.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.100.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.101.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.101.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.101.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.102.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.102.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.102.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.103.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.103.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.103.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.104.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.104.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.104.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.105.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.105.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.105.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.106.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.106.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.106.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.107.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.107.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.107.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.108.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.108.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.108.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.109.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.109.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.109.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.11.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.11.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.11.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.110.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.110.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.110.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.111.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.111.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.111.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.112.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.112.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.112.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.113.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.113.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.113.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.114.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.114.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.114.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.115.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.115.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.115.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.116.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.116.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.116.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.117.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.117.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.117.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.118.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.118.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.118.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.119.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.119.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.119.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.12.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.12.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.12.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.120.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.120.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.120.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.121.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.121.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.121.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.122.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.122.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.122.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.123.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.123.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.123.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.124.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.124.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.124.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.125.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.125.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.125.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.126.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.126.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.126.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.127.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.127.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.127.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.128.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.128.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.128.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.129.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.129.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.129.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.13.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.13.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.13.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.130.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.130.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.130.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.131.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.131.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.131.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.132.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.132.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.132.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.133.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.133.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.133.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.134.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.134.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.134.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.135.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.135.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.135.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.136.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.136.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.136.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.137.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.137.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.137.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.138.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.138.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.138.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.139.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.139.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.139.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.14.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.14.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.14.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.140.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.140.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.140.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.141.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.141.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.141.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.142.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.142.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.142.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.143.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.143.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.143.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.144.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.144.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.144.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.145.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.145.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.145.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.146.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.146.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.146.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.147.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.147.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.147.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.148.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.148.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.148.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.149.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.149.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.149.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.15.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.15.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.15.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.150.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.150.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.150.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.151.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.151.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.151.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.152.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.152.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.152.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.153.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.153.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.153.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.154.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.154.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.154.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.155.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.155.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.155.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.156.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.156.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.156.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.157.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.157.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.157.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.158.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.158.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.158.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.159.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.159.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.159.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.16.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.16.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.16.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.17.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.17.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.17.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.18.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.18.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.18.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.19.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.19.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.19.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.2.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.2.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.2.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.20.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.20.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.20.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.21.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.21.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.21.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.22.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.22.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.22.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.23.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.23.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.23.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.24.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.24.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.24.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.25.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.25.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.25.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.26.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.26.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.26.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.27.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.27.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.27.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.28.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.28.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.28.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.29.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.29.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.29.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.3.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.3.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.3.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.30.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.30.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.30.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.31.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.31.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.31.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.32.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.32.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.32.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.33.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.33.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.33.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.34.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.34.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.34.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.35.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.35.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.35.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.36.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.36.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.36.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.37.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.37.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.37.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.38.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.38.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.38.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.39.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.39.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.39.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.4.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.4.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.4.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.40.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.40.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.40.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.41.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.41.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.41.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.42.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.42.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.42.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.43.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.43.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.43.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.44.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.44.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.44.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.45.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.45.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.45.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.46.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.46.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.46.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.47.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.47.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.47.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.48.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.48.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.48.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.49.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.49.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.49.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.5.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.5.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.5.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.50.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.50.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.50.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.51.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.51.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.51.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.52.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.52.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.52.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.53.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.53.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.53.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.54.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.54.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.54.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.55.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.55.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.55.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.56.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.56.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.56.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.57.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.57.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.57.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.58.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.58.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.58.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.59.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.59.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.59.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.6.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.6.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.6.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.60.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.60.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.60.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.61.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.61.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.61.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.62.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.62.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.62.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.63.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.63.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.63.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.64.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.64.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.64.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.65.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.65.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.65.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.66.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.66.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.66.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.67.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.67.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.67.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.68.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.68.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.68.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.69.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.69.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.69.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.7.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.7.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.7.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.70.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.70.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.70.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.71.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.71.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.71.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.72.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.72.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.72.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.73.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.73.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.73.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.74.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.74.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.74.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.75.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.75.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.75.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.76.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.76.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.76.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.77.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.77.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.77.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.78.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.78.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.78.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.79.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.79.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.79.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.8.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.8.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.8.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.80.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.80.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.80.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.81.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.81.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.81.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.82.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.82.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.82.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.83.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.83.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.83.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.84.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.84.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.84.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.85.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.85.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.85.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.86.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.86.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.86.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.87.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.87.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.87.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.88.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.88.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.88.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.89.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.89.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.89.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.9.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.9.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.9.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.90.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.90.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.90.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.91.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.91.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.91.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.92.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.92.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.92.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.93.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.93.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.93.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.94.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.94.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.94.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.95.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.95.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.95.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.96.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.96.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.96.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.97.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.97.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.97.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.98.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.98.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.98.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.99.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.99.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.experts.99.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.gate.e_score_correction_bias": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.gate.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.shared_experts.down_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.shared_experts.gate_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.mlp.shared_experts.up_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.post_attention_layernorm.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.self_attn.k_norm.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.self_attn.k_proj.bias": "model-00026-of-00092.safetensors",
+ "model.layers.25.self_attn.k_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.self_attn.o_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.self_attn.q_norm.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.self_attn.q_proj.bias": "model-00026-of-00092.safetensors",
+ "model.layers.25.self_attn.q_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.25.self_attn.v_proj.bias": "model-00026-of-00092.safetensors",
+ "model.layers.25.self_attn.v_proj.weight": "model-00026-of-00092.safetensors",
+ "model.layers.26.input_layernorm.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.0.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.0.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.0.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.1.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.1.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.1.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.10.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.10.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.10.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.100.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.100.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.100.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.101.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.101.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.101.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.102.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.102.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.102.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.103.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.103.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.103.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.104.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.104.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.104.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.105.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.105.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.105.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.106.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.106.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.106.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.107.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.107.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.107.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.108.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.108.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.108.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.109.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.109.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.109.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.11.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.11.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.11.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.110.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.110.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.110.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.111.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.111.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.111.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.112.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.112.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.112.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.113.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.113.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.113.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.114.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.114.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.114.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.115.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.115.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.115.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.116.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.116.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.116.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.117.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.117.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.117.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.118.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.118.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.118.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.119.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.119.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.119.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.12.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.12.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.12.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.120.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.120.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.120.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.121.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.121.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.121.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.122.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.122.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.122.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.123.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.123.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.123.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.124.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.124.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.124.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.125.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.125.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.125.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.126.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.126.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.126.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.127.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.127.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.127.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.128.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.128.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.128.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.129.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.129.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.129.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.13.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.13.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.13.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.130.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.130.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.130.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.131.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.131.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.131.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.132.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.132.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.132.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.133.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.133.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.133.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.134.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.134.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.134.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.135.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.135.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.135.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.136.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.136.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.136.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.137.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.137.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.137.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.138.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.138.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.138.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.139.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.139.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.139.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.14.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.14.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.14.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.140.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.140.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.140.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.141.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.141.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.141.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.142.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.142.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.142.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.143.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.143.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.143.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.144.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.144.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.144.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.145.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.145.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.145.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.146.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.146.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.146.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.147.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.147.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.147.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.148.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.148.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.148.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.149.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.149.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.149.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.15.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.15.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.15.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.150.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.150.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.150.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.151.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.151.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.151.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.152.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.152.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.152.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.153.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.153.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.153.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.154.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.154.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.154.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.155.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.155.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.155.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.156.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.156.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.156.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.157.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.157.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.157.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.158.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.158.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.158.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.159.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.159.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.159.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.16.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.16.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.16.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.17.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.17.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.17.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.18.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.18.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.18.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.19.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.19.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.19.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.2.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.2.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.2.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.20.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.20.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.20.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.21.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.21.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.21.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.22.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.22.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.22.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.23.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.23.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.23.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.24.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.24.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.24.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.25.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.25.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.25.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.26.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.26.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.26.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.27.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.27.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.27.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.28.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.28.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.28.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.29.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.29.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.29.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.3.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.3.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.3.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.30.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.30.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.30.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.31.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.31.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.31.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.32.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.32.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.32.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.33.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.33.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.33.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.34.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.34.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.34.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.35.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.35.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.35.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.36.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.36.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.36.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.37.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.37.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.37.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.38.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.38.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.38.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.39.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.39.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.39.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.4.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.4.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.4.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.40.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.40.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.40.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.41.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.41.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.41.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.42.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.42.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.42.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.43.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.43.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.43.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.44.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.44.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.44.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.45.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.45.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.45.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.46.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.46.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.46.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.47.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.47.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.47.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.48.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.48.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.48.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.49.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.49.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.49.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.5.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.5.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.5.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.50.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.50.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.50.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.51.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.51.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.51.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.52.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.52.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.52.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.53.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.53.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.53.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.54.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.54.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.54.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.55.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.55.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.55.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.56.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.56.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.56.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.57.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.57.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.57.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.58.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.58.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.58.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.59.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.59.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.59.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.6.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.6.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.6.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.60.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.60.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.60.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.61.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.61.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.61.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.62.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.62.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.62.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.63.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.63.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.63.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.64.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.64.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.64.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.65.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.65.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.65.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.66.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.66.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.66.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.67.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.67.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.67.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.68.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.68.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.68.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.69.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.69.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.69.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.7.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.7.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.7.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.70.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.70.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.70.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.71.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.71.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.71.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.72.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.72.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.72.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.73.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.73.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.73.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.74.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.74.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.74.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.75.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.75.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.75.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.76.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.76.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.76.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.77.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.77.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.77.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.78.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.78.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.78.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.79.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.79.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.79.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.8.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.8.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.8.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.80.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.80.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.80.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.81.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.81.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.81.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.82.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.82.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.82.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.83.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.83.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.83.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.84.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.84.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.84.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.85.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.85.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.85.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.86.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.86.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.86.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.87.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.87.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.87.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.88.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.88.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.88.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.89.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.89.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.89.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.9.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.9.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.9.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.90.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.90.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.90.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.91.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.91.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.91.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.92.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.92.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.92.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.93.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.93.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.93.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.94.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.94.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.94.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.95.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.95.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.95.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.96.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.96.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.96.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.97.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.97.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.97.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.98.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.98.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.98.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.99.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.99.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.experts.99.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.gate.e_score_correction_bias": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.gate.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.shared_experts.down_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.shared_experts.gate_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.mlp.shared_experts.up_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.post_attention_layernorm.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.self_attn.k_norm.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.self_attn.k_proj.bias": "model-00027-of-00092.safetensors",
+ "model.layers.26.self_attn.k_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.self_attn.o_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.self_attn.q_norm.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.self_attn.q_proj.bias": "model-00027-of-00092.safetensors",
+ "model.layers.26.self_attn.q_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.26.self_attn.v_proj.bias": "model-00027-of-00092.safetensors",
+ "model.layers.26.self_attn.v_proj.weight": "model-00027-of-00092.safetensors",
+ "model.layers.27.input_layernorm.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.0.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.0.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.0.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.1.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.1.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.1.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.10.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.10.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.10.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.100.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.100.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.100.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.101.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.101.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.101.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.102.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.102.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.102.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.103.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.103.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.103.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.104.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.104.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.104.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.105.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.105.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.105.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.106.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.106.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.106.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.107.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.107.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.107.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.108.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.108.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.108.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.109.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.109.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.109.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.11.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.11.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.11.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.110.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.110.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.110.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.111.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.111.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.111.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.112.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.112.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.112.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.113.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.113.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.113.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.114.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.114.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.114.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.115.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.115.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.115.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.116.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.116.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.116.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.117.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.117.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.117.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.118.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.118.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.118.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.119.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.119.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.119.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.12.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.12.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.12.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.120.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.120.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.120.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.121.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.121.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.121.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.122.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.122.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.122.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.123.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.123.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.123.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.124.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.124.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.124.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.125.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.125.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.125.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.126.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.126.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.126.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.127.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.127.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.127.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.128.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.128.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.128.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.129.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.129.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.129.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.13.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.13.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.13.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.130.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.130.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.130.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.131.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.131.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.131.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.132.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.132.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.132.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.133.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.133.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.133.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.134.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.134.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.134.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.135.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.135.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.135.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.136.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.136.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.136.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.137.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.137.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.137.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.138.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.138.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.138.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.139.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.139.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.139.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.14.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.14.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.14.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.140.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.140.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.140.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.141.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.141.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.141.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.142.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.142.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.142.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.143.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.143.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.143.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.144.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.144.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.144.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.145.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.145.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.145.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.146.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.146.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.146.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.147.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.147.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.147.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.148.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.148.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.148.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.149.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.149.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.149.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.15.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.15.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.15.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.150.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.150.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.150.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.151.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.151.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.151.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.152.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.152.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.152.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.153.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.153.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.153.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.154.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.154.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.154.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.155.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.155.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.155.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.156.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.156.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.156.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.157.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.157.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.157.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.158.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.158.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.158.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.159.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.159.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.159.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.16.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.16.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.16.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.17.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.17.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.17.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.18.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.18.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.18.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.19.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.19.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.19.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.2.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.2.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.2.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.20.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.20.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.20.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.21.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.21.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.21.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.22.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.22.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.22.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.23.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.23.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.23.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.24.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.24.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.24.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.25.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.25.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.25.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.26.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.26.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.26.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.27.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.27.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.27.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.28.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.28.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.28.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.29.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.29.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.29.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.3.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.3.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.3.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.30.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.30.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.30.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.31.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.31.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.31.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.32.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.32.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.32.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.33.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.33.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.33.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.34.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.34.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.34.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.35.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.35.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.35.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.36.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.36.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.36.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.37.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.37.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.37.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.38.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.38.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.38.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.39.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.39.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.39.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.4.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.4.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.4.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.40.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.40.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.40.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.41.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.41.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.41.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.42.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.42.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.42.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.43.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.43.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.43.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.44.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.44.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.44.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.45.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.45.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.45.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.46.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.46.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.46.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.47.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.47.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.47.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.48.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.48.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.48.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.49.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.49.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.49.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.5.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.5.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.5.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.50.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.50.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.50.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.51.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.51.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.51.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.52.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.52.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.52.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.53.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.53.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.53.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.54.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.54.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.54.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.55.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.55.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.55.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.56.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.56.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.56.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.57.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.57.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.57.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.58.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.58.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.58.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.59.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.59.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.59.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.6.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.6.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.6.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.60.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.60.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.60.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.61.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.61.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.61.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.62.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.62.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.62.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.63.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.63.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.63.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.64.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.64.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.64.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.65.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.65.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.65.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.66.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.66.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.66.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.67.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.67.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.67.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.68.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.68.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.68.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.69.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.69.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.69.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.7.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.7.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.7.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.70.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.70.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.70.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.71.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.71.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.71.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.72.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.72.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.72.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.73.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.73.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.73.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.74.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.74.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.74.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.75.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.75.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.75.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.76.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.76.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.76.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.77.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.77.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.77.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.78.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.78.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.78.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.79.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.79.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.79.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.8.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.8.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.8.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.80.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.80.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.80.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.81.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.81.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.81.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.82.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.82.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.82.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.83.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.83.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.83.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.84.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.84.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.84.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.85.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.85.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.85.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.86.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.86.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.86.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.87.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.87.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.87.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.88.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.88.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.88.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.89.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.89.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.89.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.9.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.9.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.9.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.90.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.90.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.90.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.91.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.91.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.91.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.92.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.92.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.92.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.93.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.93.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.93.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.94.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.94.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.94.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.95.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.95.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.95.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.96.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.96.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.96.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.97.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.97.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.97.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.98.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.98.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.98.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.99.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.99.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.experts.99.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.gate.e_score_correction_bias": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.gate.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.shared_experts.down_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.shared_experts.gate_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.mlp.shared_experts.up_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.post_attention_layernorm.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.self_attn.k_norm.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.self_attn.k_proj.bias": "model-00028-of-00092.safetensors",
+ "model.layers.27.self_attn.k_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.self_attn.o_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.self_attn.q_norm.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.self_attn.q_proj.bias": "model-00028-of-00092.safetensors",
+ "model.layers.27.self_attn.q_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.27.self_attn.v_proj.bias": "model-00028-of-00092.safetensors",
+ "model.layers.27.self_attn.v_proj.weight": "model-00028-of-00092.safetensors",
+ "model.layers.28.input_layernorm.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.0.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.0.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.0.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.1.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.1.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.1.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.10.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.10.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.10.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.100.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.100.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.100.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.101.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.101.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.101.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.102.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.102.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.102.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.103.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.103.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.103.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.104.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.104.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.104.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.105.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.105.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.105.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.106.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.106.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.106.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.107.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.107.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.107.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.108.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.108.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.108.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.109.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.109.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.109.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.11.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.11.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.11.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.110.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.110.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.110.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.111.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.111.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.111.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.112.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.112.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.112.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.113.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.113.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.113.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.114.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.114.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.114.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.115.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.115.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.115.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.116.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.116.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.116.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.117.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.117.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.117.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.118.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.118.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.118.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.119.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.119.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.119.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.12.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.12.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.12.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.120.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.120.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.120.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.121.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.121.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.121.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.122.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.122.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.122.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.123.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.123.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.123.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.124.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.124.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.124.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.125.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.125.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.125.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.126.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.126.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.126.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.127.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.127.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.127.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.128.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.128.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.128.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.129.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.129.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.129.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.13.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.13.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.13.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.130.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.130.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.130.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.131.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.131.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.131.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.132.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.132.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.132.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.133.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.133.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.133.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.134.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.134.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.134.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.135.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.135.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.135.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.136.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.136.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.136.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.137.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.137.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.137.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.138.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.138.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.138.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.139.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.139.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.139.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.14.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.14.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.14.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.140.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.140.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.140.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.141.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.141.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.141.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.142.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.142.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.142.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.143.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.143.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.143.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.144.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.144.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.144.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.145.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.145.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.145.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.146.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.146.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.146.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.147.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.147.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.147.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.148.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.148.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.148.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.149.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.149.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.149.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.15.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.15.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.15.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.150.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.150.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.150.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.151.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.151.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.151.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.152.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.152.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.152.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.153.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.153.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.153.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.154.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.154.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.154.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.155.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.155.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.155.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.156.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.156.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.156.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.157.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.157.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.157.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.158.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.158.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.158.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.159.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.159.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.159.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.16.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.16.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.16.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.17.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.17.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.17.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.18.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.18.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.18.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.19.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.19.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.19.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.2.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.2.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.2.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.20.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.20.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.20.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.21.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.21.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.21.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.22.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.22.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.22.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.23.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.23.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.23.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.24.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.24.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.24.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.25.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.25.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.25.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.26.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.26.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.26.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.27.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.27.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.27.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.28.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.28.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.28.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.29.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.29.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.29.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.3.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.3.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.3.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.30.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.30.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.30.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.31.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.31.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.31.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.32.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.32.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.32.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.33.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.33.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.33.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.34.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.34.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.34.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.35.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.35.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.35.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.36.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.36.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.36.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.37.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.37.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.37.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.38.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.38.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.38.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.39.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.39.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.39.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.4.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.4.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.4.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.40.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.40.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.40.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.41.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.41.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.41.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.42.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.42.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.42.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.43.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.43.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.43.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.44.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.44.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.44.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.45.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.45.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.45.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.46.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.46.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.46.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.47.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.47.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.47.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.48.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.48.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.48.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.49.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.49.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.49.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.5.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.5.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.5.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.50.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.50.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.50.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.51.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.51.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.51.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.52.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.52.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.52.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.53.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.53.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.53.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.54.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.54.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.54.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.55.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.55.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.55.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.56.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.56.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.56.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.57.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.57.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.57.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.58.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.58.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.58.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.59.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.59.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.59.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.6.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.6.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.6.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.60.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.60.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.60.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.61.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.61.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.61.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.62.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.62.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.62.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.63.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.63.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.63.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.64.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.64.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.64.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.65.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.65.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.65.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.66.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.66.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.66.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.67.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.67.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.67.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.68.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.68.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.68.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.69.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.69.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.69.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.7.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.7.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.7.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.70.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.70.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.70.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.71.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.71.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.71.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.72.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.72.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.72.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.73.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.73.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.73.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.74.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.74.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.74.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.75.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.75.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.75.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.76.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.76.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.76.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.77.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.77.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.77.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.78.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.78.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.78.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.79.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.79.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.79.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.8.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.8.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.8.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.80.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.80.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.80.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.81.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.81.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.81.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.82.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.82.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.82.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.83.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.83.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.83.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.84.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.84.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.84.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.85.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.85.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.85.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.86.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.86.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.86.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.87.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.87.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.87.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.88.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.88.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.88.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.89.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.89.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.89.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.9.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.9.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.9.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.90.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.90.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.90.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.91.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.91.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.91.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.92.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.92.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.92.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.93.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.93.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.93.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.94.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.94.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.94.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.95.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.95.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.95.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.96.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.96.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.96.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.97.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.97.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.97.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.98.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.98.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.98.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.99.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.99.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.experts.99.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.gate.e_score_correction_bias": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.gate.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.shared_experts.down_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.shared_experts.gate_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.mlp.shared_experts.up_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.post_attention_layernorm.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.self_attn.k_norm.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.self_attn.k_proj.bias": "model-00029-of-00092.safetensors",
+ "model.layers.28.self_attn.k_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.self_attn.o_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.self_attn.q_norm.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.self_attn.q_proj.bias": "model-00029-of-00092.safetensors",
+ "model.layers.28.self_attn.q_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.28.self_attn.v_proj.bias": "model-00029-of-00092.safetensors",
+ "model.layers.28.self_attn.v_proj.weight": "model-00029-of-00092.safetensors",
+ "model.layers.29.input_layernorm.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.0.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.0.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.0.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.1.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.1.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.1.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.10.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.10.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.10.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.100.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.100.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.100.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.101.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.101.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.101.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.102.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.102.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.102.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.103.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.103.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.103.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.104.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.104.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.104.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.105.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.105.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.105.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.106.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.106.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.106.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.107.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.107.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.107.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.108.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.108.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.108.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.109.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.109.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.109.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.11.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.11.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.11.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.110.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.110.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.110.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.111.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.111.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.111.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.112.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.112.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.112.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.113.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.113.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.113.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.114.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.114.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.114.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.115.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.115.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.115.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.116.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.116.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.116.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.117.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.117.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.117.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.118.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.118.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.118.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.119.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.119.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.119.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.12.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.12.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.12.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.120.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.120.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.120.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.121.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.121.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.121.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.122.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.122.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.122.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.123.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.123.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.123.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.124.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.124.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.124.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.125.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.125.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.125.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.126.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.126.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.126.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.127.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.127.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.127.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.128.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.128.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.128.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.129.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.129.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.129.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.13.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.13.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.13.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.130.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.130.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.130.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.131.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.131.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.131.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.132.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.132.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.132.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.133.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.133.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.133.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.134.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.134.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.134.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.135.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.135.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.135.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.136.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.136.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.136.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.137.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.137.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.137.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.138.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.138.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.138.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.139.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.139.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.139.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.14.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.14.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.14.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.140.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.140.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.140.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.141.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.141.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.141.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.142.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.142.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.142.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.143.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.143.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.143.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.144.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.144.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.144.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.145.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.145.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.145.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.146.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.146.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.146.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.147.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.147.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.147.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.148.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.148.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.148.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.149.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.149.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.149.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.15.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.15.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.15.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.150.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.150.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.150.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.151.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.151.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.151.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.152.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.152.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.152.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.153.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.153.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.153.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.154.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.154.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.154.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.155.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.155.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.155.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.156.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.156.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.156.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.157.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.157.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.157.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.158.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.158.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.158.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.159.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.159.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.159.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.16.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.16.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.16.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.17.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.17.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.17.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.18.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.18.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.18.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.19.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.19.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.19.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.2.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.2.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.2.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.20.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.20.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.20.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.21.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.21.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.21.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.22.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.22.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.22.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.23.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.23.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.23.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.24.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.24.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.24.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.25.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.25.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.25.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.26.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.26.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.26.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.27.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.27.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.27.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.28.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.28.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.28.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.29.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.29.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.29.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.3.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.3.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.3.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.30.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.30.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.30.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.31.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.31.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.31.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.32.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.32.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.32.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.33.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.33.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.33.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.34.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.34.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.34.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.35.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.35.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.35.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.36.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.36.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.36.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.37.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.37.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.37.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.38.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.38.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.38.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.39.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.39.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.39.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.4.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.4.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.4.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.40.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.40.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.40.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.41.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.41.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.41.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.42.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.42.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.42.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.43.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.43.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.43.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.44.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.44.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.44.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.45.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.45.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.45.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.46.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.46.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.46.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.47.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.47.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.47.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.48.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.48.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.48.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.49.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.49.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.49.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.5.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.5.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.5.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.50.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.50.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.50.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.51.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.51.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.51.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.52.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.52.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.52.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.53.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.53.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.53.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.54.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.54.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.54.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.55.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.55.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.55.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.56.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.56.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.56.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.57.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.57.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.57.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.58.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.58.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.58.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.59.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.59.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.59.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.6.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.6.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.6.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.60.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.60.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.60.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.61.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.61.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.61.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.62.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.62.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.62.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.63.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.63.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.63.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.64.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.64.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.64.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.65.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.65.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.65.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.66.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.66.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.66.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.67.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.67.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.67.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.68.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.68.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.68.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.69.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.69.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.69.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.7.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.7.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.7.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.70.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.70.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.70.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.71.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.71.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.71.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.72.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.72.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.72.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.73.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.73.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.73.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.74.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.74.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.74.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.75.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.75.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.75.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.76.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.76.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.76.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.77.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.77.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.77.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.78.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.78.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.78.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.79.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.79.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.79.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.8.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.8.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.8.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.80.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.80.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.80.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.81.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.81.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.81.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.82.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.82.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.82.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.83.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.83.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.83.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.84.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.84.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.84.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.85.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.85.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.85.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.86.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.86.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.86.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.87.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.87.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.87.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.88.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.88.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.88.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.89.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.89.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.89.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.9.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.9.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.9.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.90.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.90.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.90.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.91.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.91.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.91.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.92.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.92.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.92.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.93.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.93.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.93.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.94.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.94.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.94.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.95.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.95.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.95.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.96.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.96.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.96.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.97.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.97.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.97.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.98.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.98.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.98.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.99.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.99.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.experts.99.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.gate.e_score_correction_bias": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.gate.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.shared_experts.down_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.shared_experts.gate_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.mlp.shared_experts.up_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.post_attention_layernorm.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.self_attn.k_norm.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.self_attn.k_proj.bias": "model-00030-of-00092.safetensors",
+ "model.layers.29.self_attn.k_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.self_attn.o_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.self_attn.q_norm.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.self_attn.q_proj.bias": "model-00030-of-00092.safetensors",
+ "model.layers.29.self_attn.q_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.29.self_attn.v_proj.bias": "model-00030-of-00092.safetensors",
+ "model.layers.29.self_attn.v_proj.weight": "model-00030-of-00092.safetensors",
+ "model.layers.30.input_layernorm.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.0.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.0.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.0.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.1.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.1.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.1.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.10.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.10.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.10.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.100.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.100.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.100.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.101.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.101.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.101.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.102.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.102.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.102.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.103.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.103.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.103.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.104.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.104.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.104.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.105.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.105.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.105.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.106.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.106.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.106.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.107.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.107.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.107.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.108.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.108.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.108.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.109.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.109.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.109.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.11.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.11.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.11.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.110.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.110.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.110.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.111.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.111.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.111.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.112.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.112.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.112.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.113.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.113.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.113.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.114.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.114.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.114.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.115.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.115.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.115.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.116.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.116.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.116.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.117.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.117.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.117.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.118.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.118.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.118.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.119.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.119.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.119.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.12.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.12.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.12.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.120.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.120.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.120.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.121.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.121.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.121.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.122.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.122.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.122.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.123.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.123.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.123.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.124.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.124.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.124.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.125.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.125.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.125.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.126.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.126.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.126.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.127.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.127.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.127.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.128.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.128.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.128.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.129.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.129.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.129.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.13.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.13.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.13.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.130.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.130.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.130.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.131.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.131.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.131.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.132.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.132.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.132.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.133.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.133.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.133.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.134.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.134.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.134.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.135.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.135.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.135.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.136.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.136.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.136.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.137.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.137.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.137.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.138.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.138.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.138.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.139.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.139.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.139.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.14.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.14.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.14.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.140.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.140.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.140.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.141.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.141.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.141.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.142.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.142.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.142.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.143.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.143.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.143.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.144.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.144.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.144.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.145.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.145.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.145.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.146.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.146.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.146.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.147.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.147.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.147.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.148.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.148.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.148.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.149.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.149.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.149.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.15.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.15.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.15.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.150.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.150.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.150.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.151.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.151.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.151.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.152.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.152.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.152.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.153.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.153.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.153.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.154.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.154.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.154.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.155.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.155.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.155.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.156.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.156.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.156.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.157.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.157.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.157.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.158.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.158.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.158.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.159.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.159.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.159.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.16.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.16.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.16.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.17.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.17.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.17.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.18.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.18.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.18.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.19.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.19.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.19.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.2.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.2.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.2.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.20.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.20.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.20.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.21.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.21.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.21.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.22.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.22.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.22.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.23.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.23.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.23.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.24.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.24.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.24.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.25.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.25.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.25.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.26.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.26.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.26.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.27.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.27.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.27.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.28.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.28.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.28.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.29.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.29.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.29.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.3.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.3.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.3.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.30.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.30.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.30.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.31.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.31.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.31.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.32.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.32.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.32.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.33.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.33.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.33.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.34.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.34.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.34.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.35.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.35.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.35.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.36.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.36.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.36.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.37.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.37.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.37.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.38.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.38.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.38.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.39.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.39.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.39.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.4.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.4.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.4.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.40.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.40.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.40.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.41.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.41.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.41.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.42.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.42.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.42.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.43.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.43.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.43.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.44.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.44.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.44.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.45.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.45.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.45.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.46.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.46.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.46.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.47.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.47.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.47.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.48.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.48.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.48.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.49.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.49.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.49.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.5.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.5.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.5.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.50.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.50.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.50.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.51.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.51.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.51.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.52.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.52.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.52.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.53.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.53.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.53.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.54.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.54.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.54.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.55.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.55.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.55.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.56.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.56.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.56.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.57.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.57.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.57.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.58.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.58.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.58.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.59.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.59.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.59.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.6.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.6.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.6.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.60.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.60.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.60.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.61.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.61.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.61.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.62.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.62.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.62.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.63.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.63.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.63.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.64.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.64.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.64.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.65.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.65.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.65.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.66.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.66.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.66.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.67.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.67.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.67.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.68.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.68.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.68.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.69.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.69.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.69.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.7.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.7.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.7.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.70.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.70.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.70.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.71.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.71.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.71.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.72.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.72.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.72.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.73.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.73.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.73.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.74.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.74.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.74.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.75.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.75.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.75.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.76.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.76.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.76.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.77.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.77.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.77.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.78.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.78.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.78.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.79.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.79.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.79.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.8.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.8.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.8.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.80.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.80.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.80.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.81.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.81.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.81.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.82.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.82.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.82.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.83.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.83.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.83.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.84.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.84.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.84.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.85.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.85.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.85.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.86.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.86.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.86.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.87.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.87.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.87.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.88.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.88.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.88.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.89.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.89.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.89.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.9.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.9.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.9.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.90.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.90.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.90.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.91.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.91.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.91.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.92.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.92.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.92.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.93.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.93.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.93.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.94.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.94.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.94.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.95.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.95.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.95.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.96.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.96.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.96.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.97.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.97.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.97.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.98.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.98.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.98.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.99.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.99.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.experts.99.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.gate.e_score_correction_bias": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.gate.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.shared_experts.down_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.shared_experts.gate_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.mlp.shared_experts.up_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.post_attention_layernorm.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.self_attn.k_norm.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.self_attn.k_proj.bias": "model-00031-of-00092.safetensors",
+ "model.layers.30.self_attn.k_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.self_attn.o_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.self_attn.q_norm.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.self_attn.q_proj.bias": "model-00031-of-00092.safetensors",
+ "model.layers.30.self_attn.q_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.30.self_attn.v_proj.bias": "model-00031-of-00092.safetensors",
+ "model.layers.30.self_attn.v_proj.weight": "model-00031-of-00092.safetensors",
+ "model.layers.31.input_layernorm.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.0.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.0.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.0.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.1.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.1.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.1.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.10.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.10.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.10.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.100.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.100.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.100.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.101.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.101.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.101.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.102.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.102.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.102.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.103.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.103.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.103.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.104.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.104.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.104.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.105.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.105.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.105.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.106.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.106.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.106.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.107.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.107.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.107.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.108.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.108.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.108.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.109.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.109.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.109.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.11.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.11.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.11.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.110.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.110.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.110.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.111.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.111.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.111.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.112.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.112.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.112.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.113.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.113.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.113.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.114.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.114.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.114.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.115.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.115.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.115.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.116.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.116.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.116.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.117.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.117.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.117.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.118.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.118.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.118.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.119.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.119.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.119.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.12.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.12.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.12.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.120.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.120.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.120.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.121.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.121.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.121.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.122.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.122.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.122.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.123.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.123.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.123.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.124.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.124.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.124.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.125.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.125.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.125.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.126.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.126.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.126.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.127.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.127.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.127.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.128.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.128.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.128.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.129.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.129.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.129.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.13.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.13.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.13.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.130.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.130.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.130.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.131.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.131.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.131.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.132.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.132.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.132.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.133.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.133.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.133.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.134.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.134.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.134.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.135.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.135.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.135.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.136.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.136.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.136.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.137.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.137.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.137.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.138.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.138.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.138.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.139.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.139.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.139.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.14.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.14.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.14.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.140.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.140.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.140.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.141.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.141.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.141.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.142.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.142.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.142.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.143.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.143.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.143.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.144.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.144.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.144.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.145.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.145.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.145.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.146.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.146.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.146.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.147.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.147.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.147.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.148.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.148.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.148.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.149.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.149.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.149.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.15.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.15.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.15.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.150.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.150.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.150.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.151.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.151.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.151.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.152.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.152.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.152.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.153.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.153.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.153.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.154.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.154.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.154.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.155.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.155.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.155.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.156.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.156.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.156.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.157.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.157.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.157.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.158.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.158.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.158.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.159.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.159.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.159.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.16.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.16.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.16.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.17.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.17.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.17.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.18.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.18.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.18.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.19.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.19.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.19.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.2.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.2.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.2.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.20.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.20.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.20.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.21.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.21.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.21.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.22.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.22.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.22.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.23.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.23.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.23.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.24.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.24.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.24.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.25.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.25.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.25.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.26.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.26.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.26.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.27.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.27.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.27.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.28.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.28.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.28.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.29.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.29.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.29.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.3.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.3.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.3.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.30.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.30.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.30.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.31.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.31.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.31.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.32.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.32.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.32.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.33.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.33.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.33.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.34.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.34.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.34.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.35.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.35.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.35.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.36.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.36.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.36.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.37.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.37.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.37.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.38.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.38.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.38.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.39.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.39.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.39.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.4.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.4.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.4.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.40.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.40.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.40.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.41.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.41.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.41.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.42.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.42.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.42.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.43.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.43.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.43.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.44.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.44.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.44.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.45.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.45.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.45.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.46.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.46.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.46.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.47.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.47.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.47.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.48.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.48.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.48.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.49.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.49.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.49.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.5.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.5.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.5.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.50.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.50.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.50.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.51.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.51.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.51.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.52.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.52.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.52.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.53.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.53.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.53.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.54.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.54.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.54.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.55.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.55.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.55.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.56.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.56.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.56.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.57.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.57.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.57.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.58.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.58.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.58.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.59.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.59.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.59.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.6.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.6.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.6.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.60.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.60.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.60.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.61.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.61.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.61.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.62.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.62.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.62.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.63.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.63.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.63.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.64.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.64.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.64.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.65.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.65.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.65.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.66.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.66.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.66.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.67.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.67.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.67.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.68.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.68.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.68.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.69.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.69.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.69.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.7.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.7.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.7.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.70.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.70.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.70.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.71.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.71.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.71.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.72.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.72.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.72.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.73.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.73.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.73.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.74.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.74.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.74.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.75.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.75.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.75.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.76.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.76.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.76.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.77.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.77.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.77.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.78.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.78.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.78.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.79.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.79.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.79.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.8.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.8.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.8.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.80.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.80.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.80.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.81.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.81.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.81.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.82.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.82.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.82.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.83.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.83.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.83.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.84.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.84.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.84.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.85.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.85.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.85.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.86.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.86.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.86.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.87.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.87.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.87.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.88.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.88.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.88.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.89.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.89.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.89.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.9.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.9.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.9.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.90.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.90.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.90.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.91.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.91.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.91.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.92.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.92.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.92.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.93.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.93.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.93.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.94.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.94.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.94.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.95.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.95.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.95.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.96.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.96.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.96.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.97.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.97.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.97.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.98.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.98.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.98.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.99.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.99.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.experts.99.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.gate.e_score_correction_bias": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.gate.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.shared_experts.down_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.shared_experts.gate_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.mlp.shared_experts.up_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.post_attention_layernorm.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.self_attn.k_norm.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.self_attn.k_proj.bias": "model-00032-of-00092.safetensors",
+ "model.layers.31.self_attn.k_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.self_attn.o_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.self_attn.q_norm.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.self_attn.q_proj.bias": "model-00032-of-00092.safetensors",
+ "model.layers.31.self_attn.q_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.31.self_attn.v_proj.bias": "model-00032-of-00092.safetensors",
+ "model.layers.31.self_attn.v_proj.weight": "model-00032-of-00092.safetensors",
+ "model.layers.32.input_layernorm.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.0.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.0.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.0.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.1.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.1.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.1.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.10.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.10.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.10.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.100.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.100.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.100.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.101.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.101.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.101.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.102.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.102.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.102.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.103.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.103.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.103.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.104.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.104.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.104.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.105.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.105.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.105.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.106.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.106.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.106.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.107.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.107.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.107.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.108.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.108.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.108.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.109.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.109.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.109.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.11.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.11.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.11.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.110.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.110.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.110.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.111.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.111.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.111.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.112.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.112.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.112.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.113.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.113.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.113.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.114.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.114.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.114.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.115.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.115.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.115.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.116.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.116.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.116.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.117.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.117.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.117.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.118.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.118.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.118.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.119.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.119.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.119.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.12.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.12.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.12.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.120.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.120.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.120.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.121.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.121.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.121.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.122.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.122.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.122.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.123.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.123.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.123.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.124.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.124.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.124.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.125.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.125.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.125.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.126.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.126.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.126.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.127.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.127.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.127.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.128.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.128.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.128.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.129.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.129.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.129.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.13.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.13.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.13.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.130.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.130.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.130.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.131.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.131.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.131.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.132.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.132.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.132.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.133.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.133.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.133.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.134.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.134.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.134.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.135.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.135.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.135.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.136.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.136.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.136.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.137.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.137.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.137.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.138.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.138.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.138.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.139.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.139.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.139.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.14.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.14.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.14.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.140.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.140.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.140.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.141.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.141.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.141.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.142.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.142.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.142.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.143.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.143.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.143.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.144.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.144.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.144.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.145.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.145.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.145.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.146.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.146.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.146.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.147.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.147.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.147.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.148.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.148.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.148.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.149.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.149.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.149.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.15.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.15.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.15.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.150.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.150.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.150.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.151.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.151.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.151.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.152.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.152.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.152.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.153.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.153.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.153.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.154.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.154.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.154.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.155.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.155.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.155.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.156.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.156.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.156.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.157.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.157.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.157.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.158.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.158.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.158.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.159.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.159.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.159.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.16.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.16.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.16.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.17.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.17.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.17.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.18.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.18.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.18.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.19.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.19.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.19.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.2.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.2.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.2.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.20.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.20.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.20.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.21.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.21.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.21.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.22.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.22.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.22.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.23.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.23.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.23.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.24.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.24.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.24.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.25.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.25.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.25.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.26.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.26.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.26.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.27.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.27.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.27.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.28.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.28.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.28.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.29.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.29.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.29.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.3.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.3.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.3.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.30.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.30.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.30.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.31.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.31.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.31.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.32.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.32.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.32.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.33.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.33.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.33.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.34.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.34.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.34.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.35.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.35.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.35.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.36.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.36.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.36.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.37.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.37.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.37.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.38.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.38.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.38.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.39.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.39.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.39.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.4.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.4.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.4.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.40.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.40.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.40.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.41.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.41.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.41.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.42.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.42.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.42.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.43.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.43.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.43.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.44.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.44.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.44.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.45.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.45.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.45.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.46.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.46.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.46.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.47.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.47.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.47.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.48.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.48.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.48.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.49.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.49.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.49.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.5.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.5.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.5.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.50.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.50.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.50.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.51.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.51.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.51.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.52.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.52.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.52.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.53.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.53.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.53.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.54.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.54.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.54.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.55.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.55.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.55.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.56.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.56.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.56.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.57.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.57.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.57.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.58.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.58.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.58.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.59.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.59.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.59.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.6.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.6.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.6.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.60.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.60.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.60.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.61.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.61.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.61.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.62.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.62.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.62.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.63.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.63.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.63.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.64.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.64.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.64.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.65.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.65.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.65.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.66.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.66.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.66.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.67.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.67.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.67.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.68.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.68.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.68.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.69.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.69.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.69.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.7.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.7.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.7.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.70.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.70.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.70.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.71.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.71.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.71.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.72.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.72.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.72.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.73.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.73.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.73.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.74.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.74.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.74.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.75.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.75.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.75.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.76.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.76.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.76.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.77.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.77.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.77.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.78.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.78.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.78.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.79.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.79.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.79.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.8.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.8.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.8.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.80.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.80.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.80.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.81.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.81.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.81.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.82.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.82.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.82.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.83.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.83.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.83.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.84.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.84.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.84.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.85.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.85.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.85.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.86.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.86.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.86.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.87.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.87.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.87.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.88.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.88.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.88.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.89.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.89.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.89.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.9.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.9.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.9.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.90.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.90.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.90.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.91.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.91.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.91.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.92.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.92.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.92.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.93.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.93.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.93.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.94.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.94.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.94.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.95.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.95.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.95.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.96.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.96.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.96.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.97.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.97.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.97.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.98.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.98.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.98.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.99.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.99.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.experts.99.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.gate.e_score_correction_bias": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.gate.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.shared_experts.down_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.shared_experts.gate_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.mlp.shared_experts.up_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.post_attention_layernorm.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.self_attn.k_norm.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.self_attn.k_proj.bias": "model-00033-of-00092.safetensors",
+ "model.layers.32.self_attn.k_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.self_attn.o_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.self_attn.q_norm.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.self_attn.q_proj.bias": "model-00033-of-00092.safetensors",
+ "model.layers.32.self_attn.q_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.32.self_attn.v_proj.bias": "model-00033-of-00092.safetensors",
+ "model.layers.32.self_attn.v_proj.weight": "model-00033-of-00092.safetensors",
+ "model.layers.33.input_layernorm.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.0.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.0.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.0.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.1.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.1.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.1.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.10.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.10.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.10.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.100.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.100.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.100.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.101.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.101.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.101.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.102.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.102.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.102.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.103.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.103.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.103.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.104.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.104.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.104.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.105.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.105.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.105.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.106.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.106.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.106.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.107.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.107.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.107.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.108.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.108.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.108.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.109.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.109.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.109.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.11.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.11.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.11.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.110.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.110.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.110.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.111.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.111.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.111.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.112.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.112.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.112.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.113.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.113.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.113.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.114.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.114.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.114.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.115.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.115.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.115.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.116.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.116.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.116.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.117.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.117.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.117.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.118.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.118.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.118.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.119.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.119.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.119.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.12.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.12.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.12.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.120.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.120.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.120.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.121.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.121.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.121.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.122.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.122.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.122.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.123.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.123.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.123.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.124.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.124.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.124.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.125.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.125.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.125.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.126.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.126.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.126.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.127.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.127.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.127.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.128.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.128.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.128.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.129.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.129.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.129.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.13.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.13.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.13.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.130.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.130.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.130.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.131.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.131.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.131.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.132.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.132.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.132.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.133.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.133.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.133.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.134.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.134.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.134.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.135.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.135.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.135.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.136.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.136.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.136.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.137.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.137.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.137.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.138.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.138.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.138.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.139.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.139.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.139.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.14.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.14.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.14.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.140.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.140.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.140.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.141.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.141.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.141.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.142.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.142.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.142.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.143.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.143.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.143.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.144.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.144.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.144.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.145.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.145.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.145.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.146.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.146.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.146.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.147.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.147.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.147.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.148.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.148.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.148.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.149.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.149.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.149.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.15.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.15.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.15.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.150.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.150.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.150.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.151.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.151.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.151.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.152.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.152.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.152.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.153.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.153.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.153.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.154.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.154.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.154.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.155.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.155.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.155.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.156.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.156.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.156.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.157.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.157.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.157.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.158.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.158.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.158.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.159.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.159.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.159.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.16.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.16.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.16.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.17.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.17.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.17.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.18.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.18.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.18.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.19.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.19.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.19.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.2.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.2.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.2.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.20.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.20.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.20.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.21.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.21.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.21.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.22.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.22.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.22.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.23.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.23.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.23.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.24.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.24.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.24.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.25.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.25.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.25.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.26.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.26.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.26.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.27.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.27.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.27.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.28.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.28.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.28.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.29.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.29.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.29.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.3.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.3.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.3.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.30.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.30.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.30.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.31.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.31.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.31.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.32.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.32.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.32.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.33.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.33.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.33.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.34.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.34.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.34.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.35.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.35.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.35.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.36.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.36.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.36.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.37.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.37.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.37.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.38.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.38.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.38.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.39.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.39.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.39.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.4.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.4.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.4.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.40.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.40.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.40.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.41.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.41.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.41.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.42.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.42.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.42.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.43.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.43.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.43.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.44.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.44.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.44.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.45.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.45.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.45.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.46.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.46.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.46.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.47.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.47.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.47.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.48.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.48.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.48.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.49.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.49.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.49.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.5.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.5.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.5.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.50.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.50.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.50.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.51.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.51.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.51.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.52.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.52.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.52.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.53.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.53.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.53.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.54.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.54.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.54.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.55.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.55.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.55.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.56.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.56.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.56.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.57.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.57.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.57.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.58.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.58.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.58.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.59.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.59.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.59.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.6.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.6.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.6.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.60.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.60.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.60.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.61.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.61.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.61.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.62.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.62.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.62.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.63.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.63.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.63.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.64.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.64.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.64.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.65.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.65.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.65.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.66.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.66.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.66.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.67.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.67.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.67.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.68.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.68.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.68.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.69.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.69.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.69.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.7.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.7.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.7.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.70.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.70.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.70.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.71.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.71.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.71.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.72.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.72.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.72.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.73.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.73.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.73.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.74.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.74.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.74.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.75.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.75.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.75.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.76.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.76.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.76.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.77.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.77.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.77.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.78.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.78.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.78.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.79.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.79.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.79.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.8.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.8.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.8.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.80.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.80.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.80.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.81.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.81.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.81.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.82.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.82.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.82.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.83.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.83.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.83.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.84.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.84.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.84.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.85.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.85.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.85.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.86.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.86.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.86.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.87.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.87.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.87.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.88.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.88.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.88.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.89.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.89.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.89.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.9.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.9.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.9.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.90.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.90.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.90.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.91.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.91.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.91.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.92.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.92.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.92.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.93.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.93.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.93.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.94.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.94.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.94.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.95.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.95.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.95.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.96.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.96.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.96.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.97.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.97.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.97.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.98.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.98.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.98.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.99.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.99.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.experts.99.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.gate.e_score_correction_bias": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.gate.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.shared_experts.down_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.shared_experts.gate_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.mlp.shared_experts.up_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.post_attention_layernorm.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.self_attn.k_norm.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.self_attn.k_proj.bias": "model-00034-of-00092.safetensors",
+ "model.layers.33.self_attn.k_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.self_attn.o_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.self_attn.q_norm.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.self_attn.q_proj.bias": "model-00034-of-00092.safetensors",
+ "model.layers.33.self_attn.q_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.33.self_attn.v_proj.bias": "model-00034-of-00092.safetensors",
+ "model.layers.33.self_attn.v_proj.weight": "model-00034-of-00092.safetensors",
+ "model.layers.34.input_layernorm.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.0.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.0.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.0.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.1.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.1.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.1.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.10.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.10.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.10.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.100.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.100.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.100.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.101.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.101.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.101.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.102.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.102.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.102.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.103.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.103.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.103.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.104.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.104.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.104.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.105.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.105.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.105.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.106.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.106.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.106.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.107.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.107.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.107.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.108.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.108.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.108.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.109.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.109.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.109.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.11.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.11.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.11.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.110.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.110.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.110.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.111.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.111.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.111.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.112.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.112.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.112.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.113.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.113.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.113.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.114.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.114.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.114.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.115.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.115.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.115.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.116.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.116.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.116.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.117.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.117.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.117.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.118.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.118.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.118.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.119.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.119.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.119.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.12.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.12.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.12.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.120.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.120.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.120.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.121.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.121.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.121.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.122.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.122.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.122.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.123.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.123.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.123.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.124.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.124.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.124.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.125.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.125.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.125.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.126.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.126.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.126.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.127.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.127.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.127.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.128.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.128.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.128.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.129.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.129.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.129.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.13.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.13.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.13.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.130.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.130.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.130.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.131.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.131.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.131.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.132.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.132.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.132.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.133.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.133.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.133.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.134.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.134.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.134.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.135.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.135.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.135.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.136.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.136.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.136.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.137.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.137.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.137.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.138.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.138.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.138.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.139.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.139.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.139.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.14.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.14.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.14.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.140.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.140.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.140.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.141.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.141.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.141.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.142.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.142.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.142.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.143.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.143.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.143.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.144.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.144.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.144.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.145.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.145.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.145.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.146.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.146.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.146.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.147.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.147.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.147.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.148.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.148.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.148.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.149.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.149.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.149.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.15.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.15.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.15.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.150.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.150.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.150.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.151.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.151.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.151.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.152.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.152.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.152.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.153.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.153.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.153.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.154.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.154.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.154.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.155.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.155.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.155.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.156.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.156.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.156.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.157.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.157.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.157.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.158.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.158.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.158.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.159.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.159.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.159.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.16.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.16.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.16.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.17.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.17.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.17.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.18.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.18.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.18.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.19.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.19.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.19.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.2.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.2.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.2.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.20.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.20.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.20.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.21.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.21.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.21.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.22.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.22.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.22.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.23.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.23.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.23.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.24.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.24.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.24.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.25.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.25.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.25.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.26.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.26.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.26.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.27.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.27.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.27.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.28.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.28.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.28.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.29.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.29.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.29.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.3.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.3.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.3.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.30.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.30.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.30.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.31.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.31.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.31.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.32.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.32.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.32.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.33.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.33.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.33.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.34.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.34.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.34.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.35.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.35.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.35.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.36.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.36.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.36.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.37.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.37.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.37.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.38.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.38.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.38.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.39.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.39.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.39.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.4.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.4.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.4.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.40.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.40.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.40.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.41.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.41.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.41.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.42.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.42.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.42.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.43.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.43.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.43.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.44.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.44.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.44.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.45.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.45.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.45.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.46.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.46.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.46.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.47.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.47.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.47.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.48.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.48.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.48.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.49.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.49.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.49.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.5.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.5.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.5.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.50.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.50.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.50.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.51.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.51.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.51.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.52.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.52.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.52.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.53.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.53.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.53.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.54.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.54.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.54.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.55.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.55.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.55.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.56.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.56.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.56.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.57.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.57.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.57.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.58.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.58.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.58.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.59.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.59.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.59.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.6.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.6.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.6.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.60.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.60.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.60.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.61.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.61.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.61.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.62.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.62.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.62.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.63.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.63.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.63.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.64.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.64.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.64.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.65.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.65.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.65.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.66.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.66.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.66.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.67.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.67.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.67.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.68.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.68.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.68.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.69.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.69.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.69.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.7.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.7.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.7.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.70.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.70.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.70.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.71.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.71.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.71.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.72.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.72.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.72.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.73.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.73.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.73.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.74.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.74.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.74.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.75.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.75.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.75.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.76.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.76.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.76.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.77.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.77.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.77.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.78.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.78.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.78.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.79.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.79.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.79.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.8.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.8.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.8.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.80.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.80.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.80.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.81.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.81.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.81.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.82.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.82.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.82.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.83.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.83.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.83.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.84.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.84.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.84.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.85.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.85.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.85.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.86.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.86.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.86.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.87.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.87.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.87.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.88.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.88.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.88.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.89.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.89.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.89.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.9.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.9.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.9.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.90.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.90.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.90.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.91.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.91.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.91.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.92.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.92.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.92.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.93.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.93.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.93.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.94.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.94.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.94.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.95.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.95.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.95.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.96.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.96.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.96.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.97.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.97.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.97.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.98.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.98.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.98.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.99.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.99.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.experts.99.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.gate.e_score_correction_bias": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.gate.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.shared_experts.down_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.shared_experts.gate_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.mlp.shared_experts.up_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.post_attention_layernorm.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.self_attn.k_norm.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.self_attn.k_proj.bias": "model-00035-of-00092.safetensors",
+ "model.layers.34.self_attn.k_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.self_attn.o_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.self_attn.q_norm.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.self_attn.q_proj.bias": "model-00035-of-00092.safetensors",
+ "model.layers.34.self_attn.q_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.34.self_attn.v_proj.bias": "model-00035-of-00092.safetensors",
+ "model.layers.34.self_attn.v_proj.weight": "model-00035-of-00092.safetensors",
+ "model.layers.35.input_layernorm.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.0.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.0.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.0.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.1.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.1.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.1.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.10.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.10.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.10.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.100.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.100.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.100.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.101.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.101.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.101.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.102.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.102.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.102.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.103.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.103.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.103.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.104.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.104.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.104.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.105.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.105.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.105.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.106.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.106.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.106.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.107.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.107.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.107.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.108.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.108.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.108.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.109.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.109.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.109.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.11.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.11.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.11.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.110.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.110.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.110.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.111.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.111.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.111.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.112.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.112.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.112.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.113.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.113.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.113.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.114.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.114.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.114.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.115.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.115.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.115.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.116.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.116.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.116.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.117.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.117.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.117.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.118.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.118.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.118.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.119.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.119.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.119.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.12.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.12.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.12.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.120.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.120.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.120.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.121.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.121.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.121.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.122.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.122.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.122.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.123.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.123.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.123.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.124.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.124.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.124.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.125.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.125.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.125.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.126.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.126.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.126.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.127.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.127.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.127.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.128.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.128.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.128.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.129.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.129.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.129.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.13.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.13.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.13.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.130.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.130.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.130.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.131.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.131.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.131.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.132.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.132.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.132.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.133.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.133.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.133.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.134.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.134.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.134.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.135.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.135.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.135.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.136.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.136.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.136.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.137.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.137.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.137.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.138.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.138.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.138.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.139.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.139.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.139.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.14.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.14.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.14.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.140.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.140.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.140.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.141.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.141.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.141.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.142.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.142.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.142.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.143.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.143.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.143.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.144.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.144.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.144.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.145.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.145.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.145.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.146.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.146.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.146.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.147.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.147.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.147.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.148.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.148.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.148.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.149.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.149.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.149.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.15.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.15.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.15.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.150.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.150.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.150.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.151.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.151.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.151.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.152.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.152.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.152.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.153.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.153.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.153.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.154.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.154.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.154.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.155.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.155.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.155.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.156.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.156.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.156.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.157.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.157.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.157.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.158.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.158.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.158.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.159.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.159.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.159.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.16.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.16.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.16.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.17.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.17.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.17.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.18.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.18.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.18.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.19.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.19.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.19.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.2.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.2.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.2.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.20.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.20.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.20.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.21.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.21.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.21.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.22.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.22.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.22.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.23.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.23.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.23.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.24.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.24.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.24.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.25.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.25.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.25.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.26.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.26.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.26.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.27.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.27.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.27.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.28.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.28.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.28.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.29.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.29.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.29.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.3.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.3.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.3.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.30.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.30.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.30.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.31.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.31.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.31.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.32.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.32.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.32.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.33.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.33.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.33.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.34.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.34.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.34.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.35.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.35.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.35.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.36.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.36.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.36.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.37.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.37.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.37.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.38.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.38.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.38.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.39.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.39.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.39.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.4.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.4.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.4.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.40.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.40.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.40.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.41.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.41.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.41.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.42.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.42.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.42.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.43.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.43.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.43.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.44.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.44.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.44.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.45.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.45.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.45.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.46.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.46.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.46.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.47.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.47.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.47.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.48.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.48.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.48.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.49.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.49.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.49.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.5.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.5.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.5.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.50.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.50.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.50.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.51.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.51.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.51.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.52.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.52.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.52.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.53.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.53.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.53.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.54.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.54.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.54.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.55.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.55.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.55.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.56.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.56.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.56.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.57.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.57.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.57.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.58.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.58.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.58.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.59.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.59.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.59.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.6.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.6.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.6.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.60.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.60.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.60.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.61.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.61.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.61.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.62.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.62.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.62.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.63.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.63.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.63.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.64.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.64.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.64.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.65.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.65.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.65.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.66.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.66.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.66.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.67.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.67.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.67.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.68.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.68.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.68.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.69.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.69.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.69.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.7.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.7.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.7.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.70.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.70.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.70.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.71.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.71.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.71.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.72.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.72.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.72.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.73.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.73.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.73.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.74.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.74.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.74.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.75.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.75.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.75.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.76.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.76.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.76.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.77.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.77.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.77.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.78.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.78.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.78.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.79.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.79.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.79.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.8.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.8.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.8.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.80.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.80.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.80.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.81.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.81.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.81.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.82.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.82.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.82.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.83.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.83.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.83.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.84.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.84.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.84.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.85.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.85.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.85.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.86.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.86.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.86.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.87.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.87.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.87.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.88.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.88.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.88.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.89.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.89.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.89.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.9.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.9.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.9.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.90.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.90.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.90.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.91.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.91.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.91.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.92.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.92.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.92.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.93.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.93.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.93.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.94.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.94.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.94.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.95.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.95.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.95.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.96.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.96.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.96.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.97.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.97.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.97.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.98.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.98.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.98.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.99.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.99.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.experts.99.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.gate.e_score_correction_bias": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.gate.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.shared_experts.down_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.shared_experts.gate_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.mlp.shared_experts.up_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.post_attention_layernorm.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.self_attn.k_norm.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.self_attn.k_proj.bias": "model-00036-of-00092.safetensors",
+ "model.layers.35.self_attn.k_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.self_attn.o_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.self_attn.q_norm.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.self_attn.q_proj.bias": "model-00036-of-00092.safetensors",
+ "model.layers.35.self_attn.q_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.35.self_attn.v_proj.bias": "model-00036-of-00092.safetensors",
+ "model.layers.35.self_attn.v_proj.weight": "model-00036-of-00092.safetensors",
+ "model.layers.36.input_layernorm.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.0.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.0.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.0.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.1.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.1.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.1.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.10.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.10.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.10.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.100.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.100.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.100.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.101.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.101.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.101.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.102.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.102.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.102.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.103.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.103.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.103.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.104.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.104.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.104.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.105.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.105.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.105.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.106.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.106.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.106.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.107.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.107.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.107.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.108.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.108.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.108.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.109.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.109.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.109.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.11.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.11.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.11.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.110.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.110.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.110.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.111.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.111.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.111.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.112.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.112.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.112.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.113.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.113.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.113.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.114.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.114.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.114.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.115.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.115.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.115.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.116.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.116.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.116.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.117.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.117.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.117.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.118.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.118.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.118.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.119.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.119.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.119.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.12.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.12.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.12.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.120.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.120.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.120.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.121.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.121.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.121.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.122.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.122.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.122.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.123.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.123.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.123.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.124.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.124.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.124.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.125.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.125.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.125.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.126.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.126.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.126.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.127.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.127.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.127.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.128.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.128.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.128.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.129.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.129.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.129.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.13.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.13.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.13.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.130.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.130.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.130.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.131.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.131.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.131.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.132.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.132.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.132.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.133.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.133.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.133.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.134.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.134.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.134.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.135.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.135.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.135.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.136.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.136.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.136.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.137.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.137.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.137.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.138.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.138.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.138.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.139.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.139.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.139.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.14.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.14.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.14.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.140.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.140.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.140.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.141.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.141.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.141.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.142.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.142.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.142.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.143.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.143.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.143.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.144.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.144.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.144.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.145.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.145.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.145.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.146.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.146.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.146.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.147.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.147.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.147.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.148.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.148.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.148.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.149.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.149.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.149.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.15.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.15.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.15.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.150.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.150.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.150.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.151.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.151.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.151.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.152.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.152.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.152.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.153.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.153.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.153.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.154.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.154.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.154.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.155.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.155.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.155.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.156.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.156.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.156.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.157.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.157.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.157.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.158.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.158.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.158.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.159.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.159.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.159.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.16.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.16.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.16.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.17.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.17.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.17.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.18.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.18.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.18.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.19.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.19.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.19.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.2.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.2.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.2.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.20.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.20.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.20.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.21.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.21.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.21.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.22.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.22.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.22.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.23.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.23.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.23.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.24.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.24.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.24.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.25.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.25.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.25.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.26.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.26.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.26.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.27.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.27.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.27.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.28.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.28.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.28.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.29.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.29.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.29.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.3.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.3.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.3.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.30.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.30.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.30.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.31.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.31.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.31.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.32.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.32.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.32.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.33.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.33.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.33.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.34.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.34.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.34.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.35.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.35.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.35.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.36.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.36.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.36.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.37.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.37.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.37.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.38.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.38.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.38.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.39.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.39.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.39.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.4.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.4.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.4.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.40.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.40.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.40.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.41.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.41.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.41.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.42.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.42.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.42.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.43.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.43.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.43.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.44.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.44.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.44.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.45.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.45.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.45.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.46.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.46.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.46.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.47.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.47.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.47.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.48.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.48.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.48.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.49.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.49.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.49.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.5.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.5.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.5.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.50.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.50.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.50.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.51.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.51.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.51.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.52.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.52.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.52.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.53.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.53.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.53.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.54.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.54.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.54.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.55.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.55.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.55.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.56.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.56.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.56.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.57.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.57.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.57.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.58.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.58.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.58.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.59.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.59.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.59.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.6.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.6.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.6.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.60.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.60.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.60.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.61.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.61.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.61.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.62.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.62.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.62.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.63.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.63.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.63.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.64.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.64.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.64.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.65.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.65.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.65.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.66.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.66.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.66.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.67.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.67.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.67.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.68.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.68.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.68.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.69.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.69.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.69.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.7.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.7.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.7.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.70.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.70.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.70.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.71.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.71.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.71.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.72.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.72.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.72.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.73.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.73.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.73.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.74.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.74.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.74.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.75.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.75.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.75.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.76.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.76.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.76.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.77.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.77.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.77.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.78.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.78.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.78.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.79.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.79.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.79.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.8.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.8.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.8.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.80.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.80.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.80.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.81.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.81.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.81.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.82.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.82.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.82.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.83.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.83.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.83.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.84.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.84.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.84.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.85.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.85.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.85.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.86.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.86.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.86.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.87.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.87.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.87.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.88.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.88.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.88.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.89.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.89.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.89.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.9.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.9.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.9.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.90.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.90.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.90.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.91.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.91.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.91.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.92.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.92.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.92.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.93.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.93.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.93.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.94.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.94.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.94.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.95.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.95.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.95.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.96.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.96.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.96.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.97.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.97.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.97.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.98.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.98.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.98.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.99.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.99.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.experts.99.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.gate.e_score_correction_bias": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.gate.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.shared_experts.down_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.shared_experts.gate_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.mlp.shared_experts.up_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.post_attention_layernorm.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.self_attn.k_norm.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.self_attn.k_proj.bias": "model-00037-of-00092.safetensors",
+ "model.layers.36.self_attn.k_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.self_attn.o_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.self_attn.q_norm.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.self_attn.q_proj.bias": "model-00037-of-00092.safetensors",
+ "model.layers.36.self_attn.q_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.36.self_attn.v_proj.bias": "model-00037-of-00092.safetensors",
+ "model.layers.36.self_attn.v_proj.weight": "model-00037-of-00092.safetensors",
+ "model.layers.37.input_layernorm.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.0.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.0.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.0.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.1.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.1.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.1.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.10.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.10.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.10.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.100.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.100.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.100.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.101.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.101.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.101.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.102.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.102.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.102.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.103.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.103.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.103.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.104.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.104.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.104.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.105.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.105.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.105.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.106.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.106.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.106.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.107.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.107.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.107.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.108.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.108.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.108.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.109.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.109.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.109.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.11.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.11.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.11.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.110.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.110.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.110.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.111.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.111.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.111.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.112.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.112.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.112.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.113.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.113.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.113.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.114.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.114.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.114.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.115.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.115.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.115.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.116.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.116.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.116.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.117.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.117.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.117.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.118.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.118.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.118.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.119.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.119.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.119.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.12.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.12.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.12.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.120.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.120.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.120.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.121.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.121.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.121.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.122.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.122.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.122.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.123.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.123.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.123.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.124.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.124.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.124.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.125.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.125.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.125.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.126.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.126.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.126.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.127.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.127.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.127.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.128.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.128.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.128.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.129.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.129.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.129.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.13.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.13.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.13.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.130.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.130.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.130.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.131.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.131.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.131.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.132.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.132.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.132.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.133.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.133.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.133.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.134.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.134.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.134.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.135.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.135.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.135.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.136.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.136.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.136.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.137.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.137.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.137.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.138.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.138.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.138.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.139.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.139.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.139.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.14.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.14.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.14.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.140.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.140.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.140.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.141.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.141.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.141.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.142.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.142.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.142.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.143.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.143.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.143.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.144.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.144.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.144.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.145.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.145.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.145.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.146.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.146.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.146.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.147.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.147.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.147.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.148.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.148.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.148.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.149.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.149.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.149.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.15.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.15.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.15.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.150.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.150.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.150.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.151.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.151.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.151.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.152.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.152.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.152.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.153.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.153.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.153.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.154.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.154.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.154.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.155.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.155.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.155.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.156.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.156.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.156.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.157.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.157.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.157.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.158.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.158.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.158.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.159.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.159.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.159.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.16.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.16.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.16.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.17.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.17.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.17.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.18.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.18.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.18.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.19.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.19.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.19.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.2.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.2.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.2.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.20.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.20.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.20.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.21.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.21.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.21.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.22.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.22.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.22.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.23.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.23.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.23.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.24.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.24.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.24.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.25.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.25.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.25.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.26.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.26.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.26.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.27.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.27.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.27.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.28.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.28.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.28.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.29.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.29.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.29.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.3.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.3.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.3.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.30.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.30.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.30.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.31.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.31.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.31.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.32.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.32.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.32.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.33.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.33.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.33.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.34.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.34.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.34.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.35.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.35.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.35.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.36.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.36.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.36.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.37.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.37.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.37.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.38.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.38.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.38.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.39.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.39.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.39.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.4.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.4.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.4.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.40.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.40.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.40.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.41.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.41.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.41.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.42.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.42.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.42.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.43.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.43.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.43.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.44.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.44.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.44.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.45.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.45.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.45.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.46.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.46.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.46.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.47.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.47.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.47.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.48.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.48.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.48.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.49.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.49.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.49.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.5.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.5.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.5.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.50.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.50.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.50.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.51.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.51.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.51.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.52.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.52.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.52.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.53.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.53.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.53.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.54.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.54.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.54.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.55.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.55.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.55.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.56.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.56.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.56.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.57.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.57.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.57.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.58.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.58.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.58.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.59.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.59.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.59.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.6.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.6.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.6.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.60.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.60.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.60.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.61.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.61.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.61.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.62.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.62.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.62.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.63.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.63.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.63.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.64.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.64.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.64.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.65.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.65.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.65.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.66.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.66.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.66.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.67.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.67.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.67.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.68.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.68.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.68.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.69.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.69.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.69.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.7.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.7.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.7.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.70.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.70.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.70.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.71.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.71.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.71.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.72.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.72.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.72.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.73.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.73.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.73.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.74.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.74.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.74.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.75.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.75.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.75.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.76.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.76.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.76.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.77.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.77.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.77.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.78.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.78.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.78.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.79.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.79.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.79.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.8.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.8.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.8.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.80.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.80.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.80.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.81.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.81.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.81.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.82.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.82.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.82.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.83.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.83.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.83.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.84.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.84.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.84.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.85.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.85.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.85.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.86.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.86.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.86.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.87.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.87.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.87.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.88.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.88.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.88.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.89.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.89.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.89.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.9.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.9.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.9.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.90.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.90.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.90.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.91.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.91.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.91.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.92.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.92.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.92.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.93.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.93.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.93.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.94.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.94.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.94.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.95.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.95.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.95.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.96.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.96.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.96.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.97.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.97.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.97.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.98.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.98.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.98.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.99.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.99.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.experts.99.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.gate.e_score_correction_bias": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.gate.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.shared_experts.down_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.shared_experts.gate_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.mlp.shared_experts.up_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.post_attention_layernorm.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.self_attn.k_norm.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.self_attn.k_proj.bias": "model-00038-of-00092.safetensors",
+ "model.layers.37.self_attn.k_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.self_attn.o_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.self_attn.q_norm.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.self_attn.q_proj.bias": "model-00038-of-00092.safetensors",
+ "model.layers.37.self_attn.q_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.37.self_attn.v_proj.bias": "model-00038-of-00092.safetensors",
+ "model.layers.37.self_attn.v_proj.weight": "model-00038-of-00092.safetensors",
+ "model.layers.38.input_layernorm.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.0.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.0.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.0.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.1.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.1.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.1.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.10.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.10.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.10.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.100.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.100.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.100.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.101.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.101.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.101.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.102.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.102.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.102.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.103.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.103.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.103.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.104.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.104.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.104.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.105.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.105.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.105.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.106.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.106.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.106.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.107.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.107.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.107.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.108.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.108.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.108.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.109.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.109.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.109.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.11.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.11.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.11.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.110.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.110.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.110.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.111.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.111.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.111.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.112.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.112.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.112.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.113.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.113.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.113.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.114.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.114.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.114.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.115.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.115.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.115.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.116.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.116.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.116.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.117.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.117.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.117.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.118.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.118.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.118.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.119.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.119.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.119.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.12.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.12.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.12.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.120.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.120.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.120.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.121.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.121.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.121.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.122.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.122.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.122.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.123.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.123.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.123.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.124.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.124.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.124.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.125.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.125.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.125.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.126.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.126.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.126.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.127.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.127.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.127.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.128.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.128.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.128.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.129.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.129.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.129.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.13.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.13.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.13.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.130.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.130.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.130.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.131.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.131.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.131.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.132.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.132.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.132.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.133.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.133.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.133.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.134.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.134.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.134.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.135.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.135.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.135.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.136.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.136.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.136.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.137.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.137.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.137.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.138.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.138.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.138.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.139.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.139.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.139.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.14.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.14.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.14.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.140.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.140.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.140.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.141.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.141.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.141.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.142.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.142.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.142.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.143.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.143.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.143.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.144.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.144.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.144.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.145.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.145.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.145.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.146.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.146.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.146.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.147.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.147.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.147.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.148.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.148.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.148.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.149.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.149.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.149.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.15.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.15.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.15.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.150.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.150.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.150.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.151.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.151.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.151.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.152.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.152.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.152.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.153.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.153.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.153.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.154.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.154.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.154.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.155.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.155.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.155.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.156.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.156.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.156.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.157.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.157.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.157.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.158.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.158.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.158.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.159.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.159.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.159.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.16.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.16.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.16.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.17.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.17.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.17.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.18.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.18.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.18.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.19.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.19.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.19.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.2.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.2.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.2.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.20.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.20.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.20.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.21.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.21.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.21.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.22.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.22.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.22.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.23.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.23.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.23.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.24.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.24.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.24.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.25.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.25.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.25.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.26.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.26.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.26.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.27.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.27.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.27.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.28.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.28.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.28.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.29.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.29.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.29.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.3.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.3.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.3.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.30.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.30.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.30.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.31.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.31.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.31.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.32.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.32.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.32.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.33.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.33.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.33.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.34.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.34.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.34.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.35.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.35.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.35.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.36.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.36.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.36.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.37.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.37.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.37.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.38.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.38.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.38.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.39.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.39.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.39.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.4.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.4.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.4.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.40.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.40.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.40.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.41.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.41.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.41.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.42.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.42.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.42.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.43.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.43.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.43.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.44.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.44.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.44.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.45.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.45.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.45.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.46.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.46.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.46.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.47.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.47.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.47.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.48.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.48.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.48.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.49.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.49.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.49.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.5.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.5.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.5.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.50.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.50.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.50.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.51.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.51.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.51.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.52.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.52.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.52.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.53.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.53.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.53.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.54.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.54.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.54.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.55.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.55.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.55.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.56.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.56.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.56.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.57.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.57.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.57.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.58.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.58.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.58.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.59.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.59.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.59.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.6.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.6.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.6.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.60.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.60.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.60.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.61.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.61.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.61.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.62.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.62.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.62.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.63.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.63.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.63.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.64.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.64.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.64.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.65.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.65.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.65.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.66.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.66.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.66.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.67.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.67.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.67.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.68.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.68.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.68.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.69.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.69.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.69.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.7.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.7.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.7.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.70.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.70.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.70.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.71.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.71.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.71.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.72.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.72.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.72.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.73.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.73.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.73.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.74.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.74.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.74.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.75.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.75.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.75.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.76.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.76.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.76.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.77.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.77.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.77.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.78.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.78.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.78.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.79.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.79.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.79.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.8.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.8.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.8.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.80.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.80.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.80.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.81.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.81.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.81.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.82.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.82.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.82.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.83.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.83.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.83.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.84.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.84.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.84.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.85.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.85.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.85.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.86.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.86.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.86.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.87.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.87.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.87.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.88.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.88.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.88.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.89.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.89.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.89.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.9.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.9.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.9.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.90.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.90.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.90.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.91.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.91.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.91.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.92.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.92.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.92.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.93.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.93.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.93.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.94.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.94.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.94.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.95.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.95.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.95.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.96.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.96.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.96.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.97.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.97.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.97.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.98.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.98.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.98.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.99.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.99.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.experts.99.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.gate.e_score_correction_bias": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.gate.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.shared_experts.down_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.shared_experts.gate_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.mlp.shared_experts.up_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.post_attention_layernorm.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.self_attn.k_norm.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.self_attn.k_proj.bias": "model-00039-of-00092.safetensors",
+ "model.layers.38.self_attn.k_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.self_attn.o_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.self_attn.q_norm.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.self_attn.q_proj.bias": "model-00039-of-00092.safetensors",
+ "model.layers.38.self_attn.q_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.38.self_attn.v_proj.bias": "model-00039-of-00092.safetensors",
+ "model.layers.38.self_attn.v_proj.weight": "model-00039-of-00092.safetensors",
+ "model.layers.39.input_layernorm.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.0.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.0.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.0.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.1.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.1.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.1.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.10.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.10.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.10.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.100.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.100.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.100.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.101.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.101.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.101.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.102.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.102.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.102.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.103.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.103.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.103.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.104.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.104.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.104.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.105.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.105.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.105.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.106.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.106.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.106.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.107.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.107.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.107.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.108.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.108.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.108.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.109.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.109.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.109.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.11.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.11.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.11.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.110.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.110.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.110.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.111.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.111.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.111.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.112.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.112.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.112.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.113.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.113.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.113.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.114.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.114.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.114.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.115.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.115.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.115.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.116.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.116.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.116.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.117.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.117.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.117.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.118.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.118.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.118.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.119.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.119.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.119.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.12.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.12.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.12.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.120.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.120.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.120.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.121.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.121.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.121.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.122.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.122.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.122.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.123.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.123.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.123.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.124.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.124.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.124.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.125.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.125.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.125.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.126.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.126.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.126.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.127.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.127.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.127.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.128.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.128.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.128.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.129.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.129.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.129.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.13.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.13.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.13.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.130.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.130.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.130.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.131.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.131.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.131.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.132.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.132.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.132.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.133.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.133.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.133.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.134.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.134.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.134.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.135.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.135.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.135.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.136.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.136.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.136.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.137.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.137.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.137.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.138.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.138.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.138.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.139.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.139.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.139.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.14.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.14.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.14.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.140.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.140.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.140.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.141.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.141.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.141.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.142.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.142.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.142.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.143.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.143.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.143.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.144.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.144.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.144.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.145.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.145.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.145.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.146.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.146.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.146.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.147.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.147.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.147.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.148.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.148.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.148.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.149.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.149.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.149.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.15.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.15.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.15.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.150.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.150.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.150.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.151.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.151.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.151.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.152.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.152.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.152.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.153.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.153.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.153.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.154.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.154.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.154.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.155.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.155.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.155.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.156.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.156.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.156.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.157.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.157.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.157.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.158.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.158.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.158.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.159.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.159.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.159.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.16.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.16.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.16.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.17.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.17.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.17.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.18.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.18.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.18.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.19.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.19.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.19.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.2.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.2.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.2.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.20.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.20.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.20.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.21.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.21.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.21.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.22.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.22.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.22.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.23.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.23.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.23.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.24.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.24.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.24.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.25.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.25.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.25.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.26.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.26.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.26.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.27.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.27.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.27.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.28.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.28.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.28.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.29.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.29.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.29.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.3.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.3.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.3.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.30.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.30.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.30.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.31.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.31.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.31.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.32.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.32.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.32.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.33.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.33.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.33.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.34.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.34.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.34.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.35.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.35.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.35.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.36.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.36.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.36.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.37.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.37.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.37.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.38.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.38.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.38.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.39.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.39.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.39.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.4.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.4.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.4.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.40.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.40.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.40.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.41.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.41.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.41.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.42.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.42.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.42.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.43.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.43.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.43.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.44.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.44.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.44.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.45.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.45.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.45.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.46.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.46.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.46.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.47.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.47.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.47.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.48.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.48.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.48.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.49.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.49.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.49.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.5.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.5.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.5.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.50.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.50.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.50.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.51.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.51.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.51.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.52.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.52.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.52.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.53.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.53.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.53.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.54.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.54.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.54.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.55.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.55.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.55.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.56.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.56.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.56.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.57.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.57.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.57.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.58.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.58.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.58.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.59.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.59.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.59.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.6.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.6.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.6.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.60.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.60.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.60.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.61.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.61.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.61.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.62.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.62.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.62.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.63.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.63.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.63.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.64.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.64.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.64.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.65.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.65.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.65.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.66.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.66.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.66.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.67.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.67.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.67.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.68.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.68.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.68.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.69.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.69.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.69.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.7.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.7.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.7.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.70.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.70.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.70.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.71.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.71.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.71.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.72.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.72.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.72.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.73.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.73.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.73.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.74.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.74.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.74.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.75.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.75.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.75.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.76.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.76.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.76.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.77.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.77.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.77.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.78.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.78.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.78.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.79.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.79.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.79.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.8.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.8.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.8.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.80.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.80.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.80.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.81.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.81.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.81.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.82.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.82.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.82.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.83.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.83.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.83.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.84.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.84.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.84.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.85.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.85.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.85.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.86.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.86.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.86.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.87.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.87.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.87.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.88.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.88.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.88.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.89.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.89.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.89.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.9.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.9.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.9.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.90.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.90.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.90.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.91.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.91.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.91.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.92.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.92.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.92.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.93.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.93.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.93.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.94.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.94.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.94.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.95.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.95.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.95.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.96.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.96.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.96.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.97.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.97.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.97.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.98.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.98.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.98.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.99.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.99.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.experts.99.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.gate.e_score_correction_bias": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.gate.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.shared_experts.down_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.shared_experts.gate_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.mlp.shared_experts.up_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.post_attention_layernorm.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.self_attn.k_norm.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.self_attn.k_proj.bias": "model-00040-of-00092.safetensors",
+ "model.layers.39.self_attn.k_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.self_attn.o_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.self_attn.q_norm.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.self_attn.q_proj.bias": "model-00040-of-00092.safetensors",
+ "model.layers.39.self_attn.q_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.39.self_attn.v_proj.bias": "model-00040-of-00092.safetensors",
+ "model.layers.39.self_attn.v_proj.weight": "model-00040-of-00092.safetensors",
+ "model.layers.40.input_layernorm.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.0.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.0.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.0.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.1.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.1.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.1.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.10.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.10.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.10.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.100.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.100.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.100.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.101.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.101.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.101.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.102.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.102.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.102.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.103.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.103.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.103.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.104.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.104.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.104.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.105.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.105.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.105.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.106.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.106.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.106.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.107.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.107.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.107.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.108.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.108.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.108.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.109.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.109.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.109.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.11.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.11.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.11.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.110.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.110.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.110.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.111.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.111.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.111.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.112.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.112.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.112.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.113.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.113.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.113.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.114.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.114.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.114.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.115.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.115.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.115.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.116.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.116.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.116.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.117.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.117.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.117.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.118.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.118.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.118.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.119.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.119.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.119.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.12.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.12.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.12.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.120.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.120.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.120.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.121.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.121.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.121.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.122.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.122.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.122.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.123.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.123.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.123.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.124.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.124.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.124.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.125.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.125.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.125.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.126.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.126.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.126.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.127.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.127.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.127.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.128.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.128.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.128.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.129.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.129.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.129.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.13.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.13.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.13.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.130.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.130.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.130.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.131.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.131.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.131.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.132.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.132.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.132.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.133.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.133.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.133.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.134.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.134.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.134.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.135.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.135.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.135.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.136.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.136.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.136.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.137.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.137.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.137.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.138.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.138.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.138.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.139.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.139.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.139.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.14.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.14.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.14.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.140.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.140.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.140.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.141.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.141.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.141.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.142.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.142.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.142.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.143.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.143.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.143.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.144.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.144.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.144.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.145.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.145.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.145.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.146.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.146.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.146.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.147.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.147.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.147.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.148.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.148.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.148.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.149.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.149.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.149.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.15.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.15.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.15.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.150.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.150.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.150.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.151.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.151.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.151.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.152.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.152.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.152.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.153.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.153.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.153.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.154.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.154.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.154.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.155.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.155.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.155.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.156.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.156.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.156.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.157.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.157.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.157.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.158.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.158.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.158.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.159.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.159.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.159.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.16.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.16.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.16.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.17.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.17.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.17.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.18.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.18.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.18.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.19.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.19.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.19.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.2.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.2.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.2.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.20.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.20.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.20.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.21.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.21.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.21.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.22.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.22.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.22.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.23.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.23.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.23.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.24.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.24.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.24.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.25.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.25.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.25.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.26.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.26.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.26.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.27.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.27.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.27.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.28.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.28.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.28.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.29.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.29.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.29.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.3.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.3.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.3.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.30.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.30.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.30.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.31.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.31.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.31.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.32.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.32.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.32.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.33.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.33.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.33.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.34.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.34.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.34.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.35.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.35.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.35.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.36.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.36.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.36.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.37.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.37.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.37.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.38.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.38.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.38.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.39.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.39.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.39.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.4.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.4.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.4.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.40.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.40.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.40.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.41.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.41.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.41.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.42.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.42.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.42.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.43.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.43.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.43.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.44.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.44.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.44.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.45.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.45.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.45.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.46.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.46.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.46.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.47.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.47.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.47.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.48.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.48.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.48.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.49.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.49.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.49.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.5.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.5.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.5.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.50.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.50.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.50.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.51.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.51.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.51.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.52.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.52.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.52.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.53.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.53.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.53.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.54.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.54.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.54.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.55.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.55.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.55.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.56.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.56.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.56.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.57.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.57.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.57.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.58.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.58.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.58.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.59.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.59.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.59.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.6.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.6.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.6.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.60.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.60.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.60.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.61.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.61.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.61.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.62.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.62.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.62.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.63.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.63.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.63.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.64.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.64.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.64.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.65.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.65.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.65.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.66.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.66.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.66.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.67.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.67.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.67.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.68.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.68.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.68.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.69.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.69.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.69.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.7.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.7.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.7.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.70.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.70.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.70.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.71.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.71.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.71.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.72.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.72.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.72.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.73.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.73.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.73.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.74.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.74.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.74.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.75.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.75.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.75.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.76.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.76.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.76.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.77.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.77.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.77.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.78.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.78.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.78.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.79.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.79.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.79.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.8.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.8.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.8.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.80.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.80.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.80.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.81.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.81.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.81.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.82.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.82.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.82.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.83.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.83.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.83.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.84.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.84.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.84.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.85.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.85.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.85.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.86.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.86.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.86.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.87.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.87.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.87.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.88.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.88.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.88.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.89.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.89.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.89.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.9.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.9.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.9.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.90.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.90.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.90.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.91.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.91.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.91.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.92.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.92.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.92.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.93.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.93.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.93.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.94.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.94.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.94.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.95.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.95.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.95.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.96.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.96.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.96.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.97.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.97.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.97.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.98.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.98.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.98.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.99.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.99.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.experts.99.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.gate.e_score_correction_bias": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.gate.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.shared_experts.down_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.shared_experts.gate_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.mlp.shared_experts.up_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.post_attention_layernorm.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.self_attn.k_norm.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.self_attn.k_proj.bias": "model-00041-of-00092.safetensors",
+ "model.layers.40.self_attn.k_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.self_attn.o_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.self_attn.q_norm.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.self_attn.q_proj.bias": "model-00041-of-00092.safetensors",
+ "model.layers.40.self_attn.q_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.40.self_attn.v_proj.bias": "model-00041-of-00092.safetensors",
+ "model.layers.40.self_attn.v_proj.weight": "model-00041-of-00092.safetensors",
+ "model.layers.41.input_layernorm.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.0.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.0.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.0.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.1.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.1.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.1.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.10.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.10.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.10.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.100.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.100.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.100.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.101.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.101.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.101.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.102.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.102.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.102.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.103.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.103.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.103.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.104.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.104.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.104.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.105.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.105.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.105.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.106.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.106.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.106.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.107.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.107.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.107.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.108.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.108.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.108.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.109.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.109.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.109.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.11.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.11.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.11.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.110.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.110.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.110.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.111.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.111.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.111.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.112.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.112.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.112.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.113.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.113.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.113.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.114.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.114.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.114.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.115.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.115.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.115.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.116.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.116.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.116.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.117.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.117.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.117.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.118.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.118.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.118.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.119.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.119.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.119.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.12.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.12.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.12.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.120.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.120.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.120.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.121.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.121.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.121.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.122.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.122.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.122.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.123.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.123.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.123.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.124.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.124.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.124.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.125.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.125.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.125.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.126.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.126.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.126.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.127.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.127.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.127.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.128.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.128.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.128.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.129.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.129.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.129.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.13.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.13.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.13.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.130.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.130.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.130.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.131.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.131.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.131.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.132.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.132.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.132.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.133.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.133.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.133.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.134.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.134.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.134.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.135.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.135.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.135.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.136.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.136.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.136.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.137.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.137.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.137.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.138.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.138.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.138.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.139.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.139.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.139.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.14.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.14.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.14.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.140.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.140.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.140.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.141.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.141.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.141.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.142.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.142.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.142.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.143.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.143.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.143.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.144.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.144.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.144.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.145.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.145.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.145.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.146.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.146.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.146.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.147.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.147.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.147.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.148.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.148.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.148.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.149.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.149.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.149.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.15.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.15.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.15.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.150.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.150.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.150.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.151.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.151.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.151.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.152.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.152.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.152.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.153.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.153.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.153.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.154.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.154.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.154.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.155.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.155.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.155.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.156.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.156.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.156.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.157.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.157.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.157.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.158.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.158.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.158.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.159.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.159.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.159.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.16.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.16.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.16.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.17.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.17.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.17.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.18.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.18.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.18.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.19.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.19.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.19.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.2.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.2.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.2.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.20.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.20.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.20.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.21.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.21.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.21.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.22.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.22.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.22.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.23.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.23.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.23.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.24.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.24.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.24.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.25.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.25.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.25.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.26.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.26.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.26.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.27.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.27.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.27.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.28.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.28.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.28.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.29.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.29.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.29.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.3.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.3.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.3.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.30.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.30.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.30.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.31.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.31.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.31.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.32.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.32.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.32.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.33.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.33.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.33.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.34.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.34.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.34.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.35.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.35.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.35.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.36.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.36.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.36.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.37.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.37.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.37.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.38.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.38.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.38.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.39.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.39.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.39.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.4.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.4.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.4.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.40.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.40.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.40.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.41.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.41.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.41.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.42.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.42.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.42.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.43.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.43.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.43.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.44.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.44.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.44.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.45.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.45.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.45.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.46.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.46.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.46.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.47.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.47.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.47.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.48.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.48.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.48.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.49.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.49.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.49.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.5.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.5.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.5.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.50.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.50.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.50.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.51.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.51.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.51.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.52.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.52.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.52.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.53.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.53.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.53.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.54.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.54.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.54.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.55.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.55.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.55.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.56.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.56.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.56.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.57.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.57.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.57.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.58.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.58.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.58.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.59.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.59.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.59.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.6.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.6.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.6.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.60.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.60.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.60.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.61.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.61.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.61.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.62.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.62.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.62.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.63.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.63.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.63.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.64.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.64.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.64.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.65.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.65.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.65.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.66.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.66.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.66.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.67.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.67.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.67.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.68.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.68.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.68.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.69.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.69.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.69.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.7.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.7.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.7.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.70.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.70.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.70.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.71.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.71.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.71.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.72.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.72.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.72.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.73.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.73.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.73.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.74.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.74.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.74.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.75.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.75.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.75.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.76.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.76.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.76.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.77.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.77.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.77.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.78.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.78.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.78.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.79.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.79.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.79.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.8.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.8.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.8.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.80.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.80.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.80.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.81.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.81.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.81.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.82.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.82.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.82.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.83.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.83.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.83.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.84.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.84.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.84.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.85.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.85.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.85.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.86.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.86.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.86.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.87.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.87.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.87.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.88.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.88.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.88.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.89.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.89.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.89.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.9.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.9.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.9.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.90.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.90.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.90.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.91.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.91.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.91.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.92.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.92.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.92.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.93.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.93.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.93.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.94.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.94.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.94.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.95.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.95.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.95.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.96.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.96.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.96.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.97.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.97.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.97.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.98.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.98.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.98.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.99.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.99.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.experts.99.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.gate.e_score_correction_bias": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.gate.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.shared_experts.down_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.shared_experts.gate_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.mlp.shared_experts.up_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.post_attention_layernorm.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.self_attn.k_norm.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.self_attn.k_proj.bias": "model-00042-of-00092.safetensors",
+ "model.layers.41.self_attn.k_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.self_attn.o_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.self_attn.q_norm.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.self_attn.q_proj.bias": "model-00042-of-00092.safetensors",
+ "model.layers.41.self_attn.q_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.41.self_attn.v_proj.bias": "model-00042-of-00092.safetensors",
+ "model.layers.41.self_attn.v_proj.weight": "model-00042-of-00092.safetensors",
+ "model.layers.42.input_layernorm.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.0.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.0.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.0.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.1.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.1.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.1.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.10.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.10.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.10.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.100.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.100.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.100.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.101.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.101.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.101.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.102.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.102.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.102.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.103.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.103.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.103.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.104.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.104.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.104.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.105.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.105.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.105.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.106.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.106.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.106.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.107.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.107.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.107.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.108.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.108.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.108.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.109.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.109.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.109.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.11.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.11.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.11.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.110.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.110.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.110.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.111.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.111.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.111.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.112.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.112.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.112.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.113.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.113.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.113.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.114.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.114.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.114.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.115.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.115.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.115.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.116.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.116.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.116.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.117.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.117.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.117.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.118.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.118.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.118.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.119.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.119.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.119.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.12.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.12.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.12.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.120.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.120.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.120.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.121.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.121.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.121.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.122.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.122.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.122.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.123.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.123.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.123.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.124.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.124.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.124.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.125.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.125.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.125.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.126.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.126.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.126.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.127.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.127.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.127.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.128.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.128.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.128.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.129.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.129.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.129.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.13.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.13.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.13.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.130.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.130.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.130.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.131.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.131.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.131.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.132.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.132.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.132.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.133.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.133.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.133.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.134.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.134.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.134.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.135.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.135.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.135.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.136.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.136.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.136.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.137.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.137.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.137.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.138.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.138.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.138.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.139.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.139.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.139.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.14.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.14.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.14.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.140.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.140.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.140.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.141.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.141.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.141.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.142.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.142.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.142.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.143.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.143.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.143.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.144.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.144.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.144.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.145.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.145.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.145.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.146.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.146.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.146.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.147.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.147.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.147.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.148.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.148.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.148.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.149.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.149.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.149.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.15.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.15.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.15.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.150.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.150.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.150.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.151.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.151.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.151.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.152.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.152.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.152.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.153.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.153.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.153.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.154.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.154.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.154.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.155.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.155.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.155.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.156.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.156.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.156.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.157.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.157.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.157.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.158.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.158.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.158.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.159.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.159.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.159.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.16.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.16.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.16.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.17.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.17.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.17.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.18.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.18.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.18.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.19.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.19.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.19.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.2.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.2.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.2.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.20.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.20.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.20.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.21.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.21.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.21.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.22.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.22.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.22.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.23.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.23.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.23.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.24.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.24.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.24.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.25.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.25.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.25.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.26.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.26.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.26.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.27.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.27.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.27.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.28.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.28.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.28.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.29.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.29.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.29.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.3.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.3.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.3.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.30.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.30.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.30.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.31.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.31.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.31.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.32.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.32.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.32.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.33.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.33.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.33.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.34.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.34.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.34.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.35.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.35.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.35.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.36.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.36.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.36.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.37.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.37.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.37.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.38.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.38.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.38.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.39.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.39.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.39.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.4.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.4.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.4.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.40.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.40.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.40.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.41.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.41.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.41.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.42.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.42.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.42.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.43.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.43.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.43.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.44.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.44.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.44.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.45.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.45.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.45.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.46.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.46.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.46.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.47.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.47.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.47.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.48.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.48.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.48.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.49.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.49.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.49.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.5.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.5.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.5.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.50.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.50.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.50.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.51.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.51.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.51.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.52.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.52.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.52.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.53.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.53.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.53.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.54.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.54.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.54.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.55.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.55.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.55.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.56.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.56.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.56.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.57.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.57.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.57.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.58.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.58.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.58.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.59.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.59.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.59.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.6.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.6.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.6.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.60.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.60.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.60.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.61.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.61.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.61.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.62.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.62.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.62.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.63.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.63.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.63.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.64.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.64.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.64.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.65.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.65.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.65.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.66.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.66.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.66.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.67.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.67.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.67.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.68.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.68.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.68.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.69.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.69.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.69.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.7.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.7.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.7.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.70.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.70.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.70.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.71.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.71.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.71.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.72.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.72.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.72.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.73.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.73.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.73.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.74.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.74.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.74.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.75.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.75.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.75.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.76.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.76.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.76.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.77.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.77.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.77.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.78.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.78.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.78.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.79.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.79.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.79.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.8.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.8.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.8.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.80.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.80.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.80.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.81.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.81.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.81.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.82.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.82.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.82.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.83.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.83.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.83.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.84.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.84.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.84.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.85.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.85.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.85.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.86.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.86.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.86.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.87.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.87.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.87.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.88.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.88.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.88.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.89.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.89.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.89.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.9.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.9.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.9.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.90.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.90.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.90.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.91.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.91.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.91.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.92.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.92.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.92.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.93.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.93.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.93.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.94.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.94.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.94.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.95.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.95.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.95.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.96.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.96.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.96.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.97.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.97.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.97.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.98.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.98.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.98.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.99.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.99.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.experts.99.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.gate.e_score_correction_bias": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.gate.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.shared_experts.down_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.shared_experts.gate_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.mlp.shared_experts.up_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.post_attention_layernorm.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.self_attn.k_norm.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.self_attn.k_proj.bias": "model-00043-of-00092.safetensors",
+ "model.layers.42.self_attn.k_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.self_attn.o_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.self_attn.q_norm.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.self_attn.q_proj.bias": "model-00043-of-00092.safetensors",
+ "model.layers.42.self_attn.q_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.42.self_attn.v_proj.bias": "model-00043-of-00092.safetensors",
+ "model.layers.42.self_attn.v_proj.weight": "model-00043-of-00092.safetensors",
+ "model.layers.43.input_layernorm.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.0.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.0.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.0.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.1.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.1.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.1.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.10.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.10.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.10.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.100.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.100.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.100.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.101.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.101.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.101.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.102.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.102.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.102.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.103.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.103.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.103.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.104.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.104.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.104.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.105.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.105.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.105.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.106.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.106.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.106.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.107.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.107.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.107.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.108.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.108.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.108.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.109.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.109.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.109.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.11.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.11.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.11.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.110.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.110.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.110.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.111.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.111.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.111.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.112.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.112.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.112.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.113.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.113.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.113.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.114.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.114.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.114.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.115.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.115.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.115.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.116.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.116.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.116.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.117.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.117.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.117.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.118.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.118.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.118.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.119.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.119.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.119.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.12.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.12.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.12.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.120.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.120.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.120.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.121.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.121.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.121.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.122.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.122.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.122.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.123.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.123.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.123.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.124.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.124.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.124.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.125.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.125.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.125.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.126.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.126.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.126.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.127.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.127.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.127.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.128.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.128.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.128.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.129.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.129.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.129.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.13.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.13.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.13.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.130.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.130.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.130.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.131.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.131.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.131.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.132.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.132.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.132.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.133.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.133.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.133.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.134.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.134.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.134.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.135.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.135.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.135.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.136.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.136.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.136.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.137.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.137.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.137.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.138.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.138.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.138.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.139.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.139.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.139.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.14.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.14.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.14.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.140.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.140.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.140.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.141.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.141.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.141.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.142.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.142.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.142.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.143.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.143.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.143.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.144.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.144.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.144.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.145.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.145.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.145.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.146.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.146.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.146.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.147.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.147.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.147.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.148.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.148.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.148.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.149.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.149.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.149.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.15.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.15.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.15.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.150.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.150.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.150.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.151.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.151.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.151.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.152.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.152.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.152.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.153.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.153.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.153.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.154.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.154.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.154.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.155.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.155.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.155.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.156.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.156.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.156.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.157.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.157.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.157.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.158.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.158.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.158.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.159.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.159.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.159.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.16.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.16.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.16.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.17.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.17.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.17.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.18.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.18.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.18.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.19.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.19.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.19.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.2.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.2.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.2.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.20.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.20.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.20.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.21.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.21.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.21.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.22.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.22.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.22.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.23.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.23.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.23.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.24.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.24.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.24.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.25.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.25.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.25.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.26.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.26.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.26.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.27.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.27.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.27.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.28.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.28.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.28.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.29.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.29.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.29.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.3.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.3.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.3.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.30.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.30.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.30.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.31.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.31.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.31.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.32.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.32.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.32.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.33.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.33.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.33.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.34.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.34.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.34.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.35.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.35.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.35.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.36.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.36.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.36.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.37.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.37.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.37.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.38.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.38.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.38.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.39.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.39.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.39.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.4.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.4.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.4.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.40.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.40.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.40.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.41.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.41.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.41.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.42.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.42.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.42.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.43.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.43.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.43.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.44.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.44.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.44.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.45.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.45.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.45.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.46.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.46.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.46.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.47.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.47.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.47.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.48.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.48.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.48.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.49.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.49.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.49.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.5.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.5.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.5.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.50.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.50.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.50.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.51.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.51.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.51.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.52.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.52.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.52.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.53.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.53.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.53.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.54.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.54.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.54.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.55.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.55.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.55.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.56.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.56.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.56.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.57.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.57.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.57.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.58.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.58.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.58.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.59.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.59.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.59.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.6.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.6.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.6.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.60.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.60.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.60.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.61.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.61.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.61.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.62.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.62.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.62.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.63.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.63.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.63.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.64.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.64.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.64.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.65.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.65.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.65.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.66.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.66.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.66.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.67.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.67.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.67.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.68.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.68.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.68.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.69.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.69.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.69.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.7.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.7.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.7.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.70.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.70.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.70.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.71.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.71.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.71.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.72.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.72.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.72.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.73.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.73.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.73.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.74.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.74.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.74.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.75.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.75.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.75.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.76.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.76.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.76.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.77.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.77.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.77.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.78.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.78.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.78.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.79.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.79.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.79.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.8.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.8.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.8.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.80.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.80.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.80.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.81.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.81.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.81.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.82.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.82.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.82.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.83.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.83.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.83.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.84.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.84.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.84.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.85.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.85.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.85.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.86.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.86.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.86.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.87.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.87.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.87.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.88.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.88.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.88.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.89.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.89.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.89.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.9.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.9.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.9.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.90.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.90.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.90.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.91.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.91.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.91.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.92.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.92.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.92.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.93.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.93.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.93.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.94.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.94.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.94.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.95.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.95.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.95.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.96.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.96.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.96.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.97.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.97.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.97.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.98.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.98.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.98.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.99.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.99.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.experts.99.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.gate.e_score_correction_bias": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.gate.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.shared_experts.down_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.shared_experts.gate_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.mlp.shared_experts.up_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.post_attention_layernorm.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.self_attn.k_norm.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.self_attn.k_proj.bias": "model-00044-of-00092.safetensors",
+ "model.layers.43.self_attn.k_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.self_attn.o_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.self_attn.q_norm.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.self_attn.q_proj.bias": "model-00044-of-00092.safetensors",
+ "model.layers.43.self_attn.q_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.43.self_attn.v_proj.bias": "model-00044-of-00092.safetensors",
+ "model.layers.43.self_attn.v_proj.weight": "model-00044-of-00092.safetensors",
+ "model.layers.44.input_layernorm.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.0.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.0.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.0.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.1.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.1.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.1.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.10.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.10.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.10.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.100.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.100.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.100.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.101.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.101.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.101.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.102.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.102.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.102.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.103.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.103.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.103.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.104.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.104.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.104.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.105.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.105.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.105.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.106.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.106.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.106.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.107.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.107.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.107.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.108.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.108.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.108.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.109.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.109.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.109.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.11.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.11.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.11.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.110.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.110.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.110.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.111.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.111.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.111.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.112.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.112.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.112.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.113.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.113.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.113.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.114.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.114.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.114.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.115.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.115.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.115.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.116.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.116.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.116.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.117.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.117.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.117.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.118.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.118.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.118.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.119.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.119.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.119.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.12.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.12.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.12.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.120.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.120.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.120.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.121.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.121.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.121.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.122.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.122.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.122.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.123.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.123.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.123.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.124.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.124.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.124.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.125.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.125.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.125.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.126.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.126.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.126.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.127.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.127.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.127.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.128.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.128.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.128.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.129.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.129.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.129.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.13.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.13.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.13.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.130.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.130.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.130.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.131.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.131.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.131.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.132.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.132.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.132.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.133.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.133.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.133.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.134.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.134.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.134.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.135.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.135.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.135.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.136.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.136.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.136.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.137.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.137.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.137.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.138.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.138.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.138.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.139.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.139.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.139.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.14.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.14.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.14.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.140.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.140.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.140.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.141.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.141.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.141.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.142.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.142.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.142.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.143.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.143.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.143.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.144.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.144.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.144.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.145.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.145.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.145.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.146.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.146.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.146.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.147.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.147.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.147.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.148.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.148.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.148.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.149.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.149.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.149.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.15.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.15.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.15.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.150.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.150.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.150.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.151.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.151.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.151.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.152.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.152.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.152.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.153.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.153.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.153.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.154.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.154.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.154.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.155.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.155.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.155.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.156.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.156.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.156.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.157.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.157.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.157.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.158.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.158.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.158.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.159.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.159.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.159.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.16.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.16.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.16.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.17.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.17.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.17.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.18.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.18.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.18.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.19.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.19.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.19.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.2.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.2.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.2.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.20.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.20.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.20.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.21.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.21.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.21.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.22.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.22.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.22.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.23.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.23.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.23.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.24.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.24.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.24.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.25.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.25.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.25.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.26.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.26.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.26.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.27.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.27.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.27.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.28.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.28.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.28.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.29.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.29.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.29.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.3.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.3.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.3.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.30.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.30.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.30.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.31.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.31.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.31.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.32.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.32.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.32.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.33.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.33.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.33.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.34.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.34.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.34.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.35.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.35.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.35.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.36.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.36.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.36.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.37.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.37.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.37.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.38.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.38.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.38.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.39.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.39.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.39.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.4.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.4.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.4.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.40.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.40.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.40.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.41.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.41.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.41.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.42.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.42.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.42.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.43.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.43.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.43.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.44.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.44.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.44.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.45.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.45.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.45.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.46.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.46.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.46.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.47.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.47.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.47.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.48.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.48.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.48.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.49.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.49.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.49.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.5.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.5.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.5.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.50.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.50.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.50.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.51.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.51.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.51.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.52.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.52.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.52.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.53.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.53.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.53.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.54.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.54.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.54.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.55.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.55.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.55.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.56.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.56.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.56.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.57.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.57.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.57.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.58.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.58.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.58.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.59.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.59.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.59.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.6.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.6.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.6.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.60.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.60.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.60.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.61.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.61.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.61.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.62.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.62.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.62.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.63.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.63.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.63.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.64.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.64.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.64.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.65.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.65.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.65.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.66.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.66.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.66.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.67.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.67.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.67.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.68.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.68.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.68.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.69.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.69.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.69.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.7.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.7.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.7.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.70.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.70.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.70.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.71.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.71.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.71.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.72.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.72.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.72.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.73.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.73.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.73.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.74.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.74.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.74.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.75.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.75.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.75.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.76.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.76.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.76.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.77.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.77.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.77.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.78.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.78.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.78.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.79.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.79.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.79.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.8.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.8.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.8.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.80.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.80.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.80.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.81.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.81.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.81.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.82.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.82.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.82.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.83.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.83.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.83.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.84.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.84.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.84.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.85.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.85.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.85.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.86.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.86.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.86.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.87.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.87.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.87.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.88.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.88.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.88.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.89.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.89.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.89.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.9.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.9.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.9.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.90.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.90.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.90.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.91.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.91.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.91.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.92.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.92.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.92.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.93.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.93.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.93.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.94.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.94.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.94.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.95.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.95.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.95.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.96.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.96.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.96.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.97.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.97.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.97.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.98.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.98.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.98.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.99.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.99.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.experts.99.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.gate.e_score_correction_bias": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.gate.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.shared_experts.down_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.shared_experts.gate_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.mlp.shared_experts.up_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.post_attention_layernorm.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.self_attn.k_norm.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.self_attn.k_proj.bias": "model-00045-of-00092.safetensors",
+ "model.layers.44.self_attn.k_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.self_attn.o_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.self_attn.q_norm.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.self_attn.q_proj.bias": "model-00045-of-00092.safetensors",
+ "model.layers.44.self_attn.q_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.44.self_attn.v_proj.bias": "model-00045-of-00092.safetensors",
+ "model.layers.44.self_attn.v_proj.weight": "model-00045-of-00092.safetensors",
+ "model.layers.45.input_layernorm.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.0.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.0.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.0.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.1.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.1.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.1.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.10.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.10.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.10.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.100.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.100.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.100.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.101.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.101.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.101.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.102.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.102.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.102.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.103.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.103.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.103.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.104.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.104.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.104.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.105.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.105.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.105.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.106.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.106.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.106.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.107.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.107.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.107.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.108.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.108.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.108.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.109.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.109.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.109.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.11.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.11.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.11.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.110.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.110.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.110.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.111.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.111.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.111.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.112.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.112.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.112.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.113.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.113.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.113.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.114.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.114.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.114.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.115.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.115.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.115.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.116.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.116.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.116.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.117.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.117.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.117.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.118.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.118.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.118.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.119.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.119.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.119.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.12.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.12.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.12.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.120.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.120.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.120.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.121.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.121.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.121.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.122.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.122.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.122.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.123.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.123.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.123.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.124.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.124.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.124.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.125.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.125.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.125.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.126.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.126.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.126.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.127.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.127.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.127.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.128.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.128.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.128.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.129.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.129.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.129.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.13.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.13.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.13.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.130.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.130.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.130.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.131.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.131.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.131.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.132.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.132.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.132.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.133.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.133.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.133.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.134.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.134.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.134.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.135.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.135.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.135.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.136.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.136.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.136.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.137.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.137.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.137.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.138.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.138.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.138.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.139.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.139.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.139.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.14.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.14.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.14.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.140.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.140.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.140.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.141.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.141.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.141.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.142.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.142.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.142.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.143.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.143.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.143.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.144.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.144.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.144.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.145.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.145.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.145.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.146.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.146.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.146.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.147.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.147.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.147.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.148.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.148.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.148.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.149.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.149.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.149.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.15.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.15.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.15.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.150.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.150.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.150.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.151.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.151.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.151.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.152.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.152.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.152.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.153.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.153.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.153.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.154.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.154.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.154.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.155.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.155.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.155.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.156.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.156.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.156.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.157.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.157.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.157.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.158.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.158.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.158.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.159.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.159.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.159.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.16.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.16.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.16.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.17.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.17.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.17.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.18.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.18.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.18.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.19.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.19.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.19.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.2.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.2.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.2.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.20.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.20.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.20.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.21.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.21.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.21.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.22.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.22.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.22.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.23.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.23.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.23.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.24.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.24.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.24.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.25.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.25.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.25.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.26.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.26.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.26.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.27.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.27.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.27.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.28.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.28.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.28.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.29.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.29.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.29.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.3.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.3.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.3.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.30.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.30.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.30.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.31.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.31.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.31.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.32.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.32.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.32.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.33.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.33.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.33.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.34.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.34.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.34.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.35.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.35.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.35.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.36.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.36.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.36.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.37.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.37.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.37.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.38.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.38.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.38.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.39.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.39.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.39.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.4.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.4.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.4.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.40.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.40.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.40.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.41.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.41.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.41.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.42.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.42.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.42.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.43.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.43.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.43.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.44.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.44.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.44.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.45.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.45.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.45.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.46.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.46.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.46.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.47.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.47.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.47.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.48.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.48.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.48.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.49.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.49.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.49.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.5.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.5.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.5.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.50.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.50.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.50.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.51.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.51.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.51.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.52.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.52.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.52.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.53.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.53.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.53.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.54.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.54.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.54.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.55.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.55.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.55.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.56.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.56.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.56.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.57.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.57.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.57.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.58.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.58.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.58.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.59.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.59.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.59.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.6.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.6.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.6.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.60.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.60.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.60.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.61.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.61.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.61.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.62.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.62.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.62.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.63.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.63.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.63.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.64.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.64.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.64.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.65.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.65.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.65.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.66.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.66.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.66.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.67.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.67.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.67.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.68.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.68.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.68.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.69.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.69.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.69.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.7.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.7.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.7.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.70.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.70.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.70.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.71.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.71.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.71.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.72.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.72.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.72.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.73.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.73.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.73.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.74.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.74.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.74.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.75.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.75.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.75.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.76.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.76.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.76.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.77.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.77.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.77.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.78.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.78.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.78.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.79.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.79.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.79.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.8.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.8.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.8.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.80.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.80.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.80.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.81.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.81.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.81.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.82.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.82.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.82.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.83.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.83.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.83.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.84.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.84.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.84.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.85.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.85.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.85.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.86.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.86.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.86.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.87.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.87.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.87.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.88.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.88.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.88.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.89.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.89.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.89.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.9.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.9.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.9.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.90.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.90.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.90.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.91.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.91.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.91.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.92.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.92.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.92.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.93.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.93.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.93.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.94.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.94.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.94.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.95.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.95.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.95.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.96.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.96.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.96.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.97.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.97.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.97.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.98.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.98.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.98.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.99.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.99.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.experts.99.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.gate.e_score_correction_bias": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.gate.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.shared_experts.down_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.shared_experts.gate_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.mlp.shared_experts.up_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.post_attention_layernorm.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.self_attn.k_norm.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.self_attn.k_proj.bias": "model-00046-of-00092.safetensors",
+ "model.layers.45.self_attn.k_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.self_attn.o_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.self_attn.q_norm.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.self_attn.q_proj.bias": "model-00046-of-00092.safetensors",
+ "model.layers.45.self_attn.q_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.45.self_attn.v_proj.bias": "model-00046-of-00092.safetensors",
+ "model.layers.45.self_attn.v_proj.weight": "model-00046-of-00092.safetensors",
+ "model.layers.46.input_layernorm.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.0.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.0.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.0.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.1.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.1.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.1.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.10.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.10.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.10.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.100.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.100.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.100.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.101.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.101.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.101.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.102.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.102.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.102.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.103.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.103.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.103.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.104.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.104.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.104.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.105.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.105.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.105.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.106.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.106.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.106.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.107.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.107.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.107.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.108.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.108.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.108.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.109.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.109.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.109.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.11.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.11.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.11.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.110.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.110.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.110.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.111.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.111.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.111.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.112.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.112.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.112.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.113.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.113.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.113.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.114.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.114.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.114.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.115.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.115.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.115.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.116.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.116.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.116.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.117.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.117.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.117.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.118.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.118.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.118.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.119.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.119.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.119.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.12.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.12.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.12.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.120.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.120.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.120.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.121.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.121.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.121.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.122.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.122.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.122.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.123.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.123.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.123.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.124.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.124.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.124.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.125.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.125.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.125.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.126.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.126.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.126.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.127.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.127.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.127.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.128.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.128.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.128.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.129.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.129.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.129.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.13.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.13.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.13.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.130.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.130.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.130.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.131.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.131.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.131.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.132.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.132.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.132.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.133.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.133.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.133.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.134.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.134.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.134.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.135.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.135.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.135.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.136.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.136.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.136.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.137.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.137.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.137.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.138.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.138.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.138.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.139.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.139.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.139.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.14.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.14.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.14.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.140.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.140.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.140.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.141.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.141.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.141.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.142.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.142.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.142.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.143.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.143.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.143.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.144.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.144.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.144.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.145.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.145.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.145.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.146.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.146.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.146.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.147.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.147.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.147.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.148.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.148.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.148.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.149.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.149.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.149.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.15.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.15.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.15.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.150.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.150.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.150.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.151.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.151.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.151.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.152.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.152.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.152.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.153.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.153.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.153.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.154.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.154.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.154.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.155.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.155.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.155.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.156.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.156.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.156.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.157.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.157.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.157.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.158.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.158.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.158.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.159.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.159.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.159.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.16.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.16.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.16.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.17.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.17.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.17.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.18.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.18.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.18.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.19.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.19.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.19.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.2.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.2.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.2.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.20.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.20.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.20.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.21.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.21.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.21.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.22.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.22.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.22.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.23.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.23.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.23.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.24.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.24.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.24.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.25.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.25.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.25.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.26.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.26.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.26.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.27.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.27.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.27.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.28.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.28.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.28.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.29.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.29.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.29.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.3.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.3.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.3.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.30.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.30.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.30.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.31.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.31.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.31.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.32.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.32.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.32.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.33.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.33.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.33.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.34.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.34.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.34.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.35.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.35.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.35.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.36.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.36.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.36.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.37.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.37.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.37.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.38.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.38.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.38.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.39.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.39.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.39.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.4.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.4.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.4.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.40.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.40.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.40.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.41.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.41.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.41.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.42.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.42.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.42.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.43.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.43.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.43.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.44.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.44.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.44.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.45.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.45.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.45.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.46.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.46.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.46.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.47.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.47.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.47.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.48.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.48.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.48.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.49.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.49.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.49.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.5.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.5.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.5.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.50.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.50.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.50.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.51.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.51.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.51.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.52.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.52.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.52.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.53.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.53.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.53.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.54.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.54.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.54.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.55.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.55.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.55.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.56.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.56.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.56.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.57.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.57.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.57.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.58.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.58.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.58.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.59.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.59.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.59.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.6.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.6.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.6.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.60.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.60.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.60.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.61.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.61.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.61.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.62.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.62.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.62.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.63.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.63.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.63.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.64.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.64.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.64.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.65.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.65.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.65.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.66.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.66.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.66.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.67.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.67.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.67.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.68.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.68.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.68.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.69.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.69.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.69.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.7.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.7.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.7.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.70.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.70.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.70.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.71.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.71.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.71.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.72.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.72.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.72.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.73.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.73.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.73.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.74.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.74.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.74.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.75.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.75.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.75.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.76.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.76.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.76.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.77.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.77.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.77.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.78.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.78.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.78.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.79.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.79.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.79.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.8.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.8.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.8.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.80.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.80.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.80.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.81.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.81.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.81.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.82.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.82.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.82.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.83.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.83.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.83.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.84.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.84.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.84.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.85.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.85.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.85.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.86.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.86.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.86.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.87.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.87.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.87.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.88.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.88.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.88.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.89.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.89.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.89.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.9.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.9.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.9.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.90.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.90.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.90.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.91.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.91.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.91.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.92.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.92.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.92.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.93.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.93.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.93.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.94.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.94.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.94.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.95.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.95.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.95.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.96.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.96.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.96.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.97.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.97.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.97.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.98.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.98.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.98.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.99.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.99.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.experts.99.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.gate.e_score_correction_bias": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.gate.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.shared_experts.down_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.shared_experts.gate_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.mlp.shared_experts.up_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.post_attention_layernorm.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.self_attn.k_norm.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.self_attn.k_proj.bias": "model-00047-of-00092.safetensors",
+ "model.layers.46.self_attn.k_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.self_attn.o_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.self_attn.q_norm.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.self_attn.q_proj.bias": "model-00047-of-00092.safetensors",
+ "model.layers.46.self_attn.q_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.46.self_attn.v_proj.bias": "model-00047-of-00092.safetensors",
+ "model.layers.46.self_attn.v_proj.weight": "model-00047-of-00092.safetensors",
+ "model.layers.47.input_layernorm.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.0.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.0.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.0.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.1.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.1.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.1.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.10.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.10.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.10.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.100.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.100.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.100.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.101.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.101.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.101.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.102.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.102.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.102.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.103.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.103.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.103.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.104.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.104.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.104.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.105.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.105.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.105.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.106.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.106.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.106.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.107.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.107.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.107.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.108.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.108.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.108.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.109.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.109.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.109.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.11.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.11.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.11.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.110.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.110.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.110.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.111.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.111.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.111.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.112.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.112.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.112.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.113.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.113.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.113.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.114.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.114.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.114.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.115.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.115.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.115.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.116.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.116.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.116.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.117.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.117.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.117.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.118.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.118.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.118.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.119.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.119.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.119.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.12.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.12.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.12.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.120.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.120.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.120.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.121.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.121.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.121.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.122.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.122.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.122.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.123.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.123.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.123.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.124.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.124.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.124.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.125.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.125.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.125.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.126.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.126.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.126.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.127.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.127.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.127.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.128.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.128.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.128.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.129.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.129.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.129.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.13.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.13.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.13.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.130.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.130.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.130.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.131.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.131.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.131.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.132.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.132.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.132.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.133.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.133.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.133.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.134.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.134.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.134.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.135.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.135.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.135.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.136.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.136.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.136.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.137.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.137.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.137.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.138.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.138.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.138.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.139.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.139.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.139.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.14.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.14.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.14.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.140.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.140.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.140.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.141.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.141.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.141.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.142.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.142.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.142.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.143.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.143.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.143.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.144.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.144.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.144.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.145.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.145.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.145.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.146.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.146.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.146.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.147.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.147.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.147.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.148.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.148.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.148.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.149.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.149.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.149.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.15.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.15.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.15.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.150.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.150.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.150.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.151.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.151.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.151.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.152.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.152.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.152.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.153.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.153.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.153.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.154.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.154.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.154.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.155.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.155.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.155.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.156.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.156.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.156.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.157.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.157.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.157.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.158.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.158.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.158.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.159.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.159.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.159.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.16.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.16.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.16.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.17.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.17.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.17.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.18.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.18.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.18.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.19.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.19.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.19.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.2.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.2.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.2.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.20.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.20.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.20.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.21.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.21.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.21.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.22.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.22.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.22.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.23.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.23.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.23.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.24.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.24.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.24.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.25.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.25.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.25.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.26.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.26.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.26.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.27.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.27.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.27.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.28.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.28.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.28.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.29.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.29.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.29.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.3.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.3.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.3.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.30.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.30.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.30.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.31.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.31.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.31.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.32.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.32.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.32.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.33.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.33.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.33.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.34.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.34.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.34.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.35.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.35.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.35.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.36.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.36.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.36.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.37.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.37.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.37.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.38.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.38.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.38.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.39.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.39.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.39.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.4.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.4.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.4.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.40.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.40.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.40.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.41.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.41.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.41.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.42.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.42.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.42.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.43.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.43.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.43.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.44.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.44.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.44.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.45.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.45.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.45.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.46.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.46.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.46.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.47.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.47.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.47.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.48.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.48.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.48.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.49.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.49.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.49.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.5.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.5.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.5.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.50.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.50.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.50.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.51.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.51.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.51.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.52.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.52.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.52.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.53.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.53.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.53.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.54.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.54.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.54.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.55.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.55.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.55.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.56.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.56.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.56.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.57.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.57.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.57.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.58.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.58.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.58.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.59.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.59.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.59.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.6.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.6.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.6.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.60.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.60.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.60.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.61.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.61.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.61.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.62.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.62.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.62.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.63.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.63.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.63.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.64.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.64.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.64.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.65.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.65.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.65.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.66.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.66.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.66.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.67.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.67.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.67.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.68.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.68.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.68.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.69.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.69.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.69.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.7.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.7.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.7.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.70.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.70.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.70.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.71.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.71.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.71.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.72.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.72.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.72.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.73.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.73.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.73.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.74.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.74.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.74.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.75.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.75.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.75.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.76.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.76.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.76.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.77.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.77.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.77.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.78.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.78.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.78.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.79.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.79.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.79.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.8.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.8.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.8.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.80.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.80.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.80.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.81.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.81.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.81.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.82.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.82.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.82.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.83.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.83.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.83.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.84.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.84.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.84.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.85.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.85.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.85.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.86.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.86.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.86.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.87.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.87.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.87.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.88.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.88.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.88.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.89.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.89.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.89.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.9.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.9.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.9.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.90.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.90.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.90.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.91.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.91.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.91.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.92.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.92.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.92.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.93.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.93.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.93.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.94.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.94.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.94.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.95.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.95.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.95.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.96.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.96.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.96.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.97.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.97.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.97.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.98.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.98.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.98.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.99.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.99.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.experts.99.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.gate.e_score_correction_bias": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.gate.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.shared_experts.down_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.shared_experts.gate_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.mlp.shared_experts.up_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.post_attention_layernorm.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.self_attn.k_norm.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.self_attn.k_proj.bias": "model-00048-of-00092.safetensors",
+ "model.layers.47.self_attn.k_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.self_attn.o_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.self_attn.q_norm.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.self_attn.q_proj.bias": "model-00048-of-00092.safetensors",
+ "model.layers.47.self_attn.q_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.47.self_attn.v_proj.bias": "model-00048-of-00092.safetensors",
+ "model.layers.47.self_attn.v_proj.weight": "model-00048-of-00092.safetensors",
+ "model.layers.48.input_layernorm.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.0.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.0.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.0.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.1.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.1.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.1.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.10.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.10.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.10.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.100.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.100.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.100.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.101.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.101.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.101.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.102.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.102.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.102.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.103.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.103.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.103.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.104.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.104.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.104.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.105.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.105.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.105.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.106.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.106.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.106.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.107.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.107.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.107.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.108.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.108.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.108.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.109.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.109.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.109.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.11.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.11.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.11.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.110.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.110.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.110.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.111.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.111.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.111.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.112.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.112.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.112.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.113.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.113.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.113.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.114.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.114.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.114.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.115.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.115.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.115.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.116.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.116.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.116.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.117.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.117.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.117.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.118.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.118.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.118.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.119.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.119.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.119.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.12.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.12.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.12.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.120.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.120.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.120.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.121.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.121.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.121.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.122.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.122.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.122.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.123.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.123.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.123.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.124.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.124.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.124.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.125.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.125.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.125.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.126.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.126.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.126.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.127.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.127.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.127.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.128.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.128.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.128.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.129.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.129.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.129.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.13.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.13.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.13.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.130.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.130.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.130.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.131.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.131.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.131.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.132.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.132.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.132.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.133.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.133.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.133.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.134.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.134.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.134.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.135.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.135.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.135.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.136.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.136.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.136.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.137.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.137.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.137.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.138.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.138.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.138.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.139.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.139.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.139.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.14.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.14.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.14.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.140.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.140.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.140.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.141.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.141.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.141.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.142.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.142.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.142.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.143.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.143.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.143.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.144.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.144.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.144.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.145.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.145.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.145.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.146.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.146.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.146.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.147.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.147.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.147.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.148.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.148.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.148.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.149.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.149.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.149.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.15.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.15.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.15.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.150.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.150.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.150.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.151.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.151.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.151.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.152.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.152.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.152.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.153.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.153.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.153.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.154.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.154.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.154.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.155.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.155.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.155.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.156.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.156.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.156.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.157.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.157.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.157.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.158.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.158.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.158.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.159.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.159.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.159.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.16.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.16.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.16.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.17.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.17.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.17.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.18.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.18.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.18.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.19.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.19.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.19.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.2.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.2.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.2.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.20.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.20.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.20.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.21.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.21.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.21.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.22.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.22.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.22.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.23.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.23.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.23.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.24.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.24.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.24.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.25.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.25.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.25.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.26.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.26.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.26.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.27.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.27.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.27.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.28.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.28.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.28.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.29.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.29.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.29.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.3.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.3.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.3.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.30.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.30.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.30.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.31.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.31.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.31.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.32.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.32.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.32.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.33.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.33.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.33.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.34.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.34.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.34.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.35.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.35.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.35.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.36.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.36.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.36.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.37.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.37.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.37.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.38.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.38.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.38.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.39.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.39.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.39.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.4.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.4.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.4.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.40.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.40.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.40.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.41.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.41.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.41.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.42.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.42.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.42.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.43.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.43.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.43.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.44.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.44.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.44.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.45.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.45.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.45.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.46.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.46.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.46.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.47.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.47.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.47.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.48.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.48.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.48.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.49.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.49.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.49.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.5.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.5.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.5.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.50.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.50.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.50.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.51.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.51.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.51.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.52.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.52.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.52.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.53.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.53.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.53.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.54.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.54.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.54.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.55.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.55.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.55.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.56.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.56.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.56.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.57.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.57.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.57.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.58.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.58.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.58.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.59.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.59.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.59.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.6.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.6.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.6.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.60.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.60.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.60.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.61.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.61.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.61.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.62.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.62.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.62.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.63.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.63.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.63.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.64.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.64.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.64.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.65.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.65.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.65.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.66.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.66.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.66.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.67.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.67.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.67.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.68.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.68.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.68.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.69.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.69.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.69.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.7.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.7.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.7.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.70.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.70.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.70.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.71.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.71.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.71.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.72.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.72.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.72.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.73.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.73.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.73.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.74.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.74.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.74.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.75.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.75.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.75.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.76.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.76.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.76.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.77.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.77.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.77.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.78.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.78.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.78.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.79.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.79.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.79.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.8.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.8.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.8.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.80.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.80.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.80.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.81.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.81.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.81.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.82.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.82.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.82.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.83.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.83.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.83.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.84.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.84.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.84.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.85.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.85.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.85.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.86.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.86.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.86.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.87.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.87.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.87.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.88.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.88.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.88.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.89.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.89.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.89.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.9.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.9.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.9.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.90.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.90.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.90.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.91.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.91.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.91.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.92.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.92.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.92.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.93.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.93.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.93.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.94.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.94.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.94.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.95.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.95.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.95.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.96.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.96.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.96.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.97.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.97.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.97.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.98.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.98.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.98.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.99.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.99.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.experts.99.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.gate.e_score_correction_bias": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.gate.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.shared_experts.down_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.shared_experts.gate_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.mlp.shared_experts.up_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.post_attention_layernorm.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.self_attn.k_norm.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.self_attn.k_proj.bias": "model-00049-of-00092.safetensors",
+ "model.layers.48.self_attn.k_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.self_attn.o_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.self_attn.q_norm.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.self_attn.q_proj.bias": "model-00049-of-00092.safetensors",
+ "model.layers.48.self_attn.q_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.48.self_attn.v_proj.bias": "model-00049-of-00092.safetensors",
+ "model.layers.48.self_attn.v_proj.weight": "model-00049-of-00092.safetensors",
+ "model.layers.49.input_layernorm.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.0.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.0.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.0.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.1.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.1.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.1.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.10.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.10.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.10.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.100.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.100.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.100.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.101.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.101.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.101.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.102.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.102.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.102.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.103.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.103.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.103.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.104.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.104.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.104.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.105.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.105.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.105.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.106.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.106.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.106.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.107.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.107.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.107.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.108.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.108.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.108.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.109.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.109.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.109.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.11.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.11.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.11.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.110.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.110.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.110.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.111.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.111.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.111.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.112.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.112.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.112.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.113.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.113.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.113.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.114.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.114.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.114.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.115.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.115.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.115.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.116.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.116.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.116.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.117.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.117.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.117.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.118.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.118.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.118.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.119.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.119.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.119.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.12.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.12.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.12.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.120.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.120.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.120.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.121.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.121.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.121.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.122.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.122.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.122.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.123.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.123.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.123.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.124.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.124.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.124.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.125.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.125.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.125.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.126.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.126.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.126.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.127.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.127.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.127.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.128.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.128.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.128.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.129.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.129.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.129.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.13.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.13.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.13.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.130.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.130.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.130.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.131.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.131.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.131.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.132.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.132.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.132.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.133.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.133.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.133.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.134.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.134.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.134.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.135.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.135.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.135.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.136.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.136.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.136.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.137.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.137.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.137.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.138.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.138.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.138.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.139.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.139.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.139.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.14.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.14.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.14.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.140.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.140.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.140.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.141.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.141.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.141.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.142.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.142.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.142.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.143.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.143.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.143.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.144.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.144.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.144.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.145.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.145.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.145.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.146.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.146.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.146.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.147.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.147.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.147.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.148.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.148.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.148.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.149.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.149.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.149.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.15.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.15.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.15.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.150.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.150.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.150.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.151.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.151.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.151.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.152.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.152.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.152.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.153.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.153.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.153.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.154.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.154.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.154.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.155.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.155.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.155.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.156.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.156.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.156.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.157.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.157.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.157.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.158.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.158.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.158.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.159.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.159.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.159.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.16.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.16.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.16.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.17.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.17.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.17.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.18.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.18.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.18.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.19.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.19.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.19.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.2.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.2.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.2.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.20.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.20.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.20.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.21.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.21.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.21.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.22.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.22.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.22.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.23.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.23.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.23.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.24.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.24.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.24.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.25.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.25.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.25.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.26.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.26.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.26.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.27.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.27.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.27.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.28.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.28.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.28.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.29.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.29.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.29.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.3.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.3.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.3.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.30.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.30.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.30.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.31.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.31.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.31.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.32.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.32.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.32.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.33.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.33.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.33.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.34.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.34.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.34.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.35.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.35.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.35.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.36.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.36.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.36.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.37.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.37.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.37.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.38.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.38.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.38.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.39.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.39.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.39.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.4.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.4.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.4.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.40.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.40.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.40.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.41.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.41.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.41.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.42.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.42.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.42.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.43.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.43.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.43.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.44.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.44.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.44.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.45.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.45.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.45.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.46.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.46.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.46.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.47.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.47.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.47.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.48.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.48.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.48.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.49.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.49.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.49.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.5.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.5.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.5.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.50.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.50.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.50.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.51.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.51.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.51.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.52.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.52.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.52.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.53.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.53.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.53.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.54.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.54.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.54.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.55.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.55.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.55.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.56.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.56.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.56.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.57.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.57.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.57.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.58.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.58.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.58.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.59.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.59.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.59.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.6.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.6.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.6.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.60.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.60.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.60.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.61.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.61.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.61.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.62.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.62.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.62.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.63.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.63.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.63.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.64.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.64.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.64.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.65.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.65.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.65.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.66.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.66.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.66.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.67.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.67.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.67.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.68.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.68.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.68.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.69.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.69.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.69.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.7.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.7.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.7.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.70.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.70.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.70.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.71.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.71.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.71.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.72.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.72.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.72.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.73.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.73.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.73.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.74.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.74.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.74.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.75.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.75.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.75.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.76.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.76.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.76.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.77.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.77.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.77.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.78.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.78.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.78.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.79.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.79.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.79.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.8.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.8.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.8.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.80.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.80.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.80.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.81.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.81.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.81.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.82.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.82.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.82.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.83.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.83.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.83.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.84.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.84.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.84.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.85.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.85.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.85.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.86.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.86.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.86.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.87.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.87.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.87.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.88.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.88.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.88.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.89.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.89.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.89.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.9.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.9.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.9.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.90.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.90.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.90.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.91.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.91.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.91.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.92.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.92.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.92.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.93.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.93.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.93.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.94.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.94.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.94.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.95.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.95.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.95.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.96.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.96.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.96.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.97.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.97.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.97.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.98.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.98.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.98.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.99.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.99.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.experts.99.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.gate.e_score_correction_bias": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.gate.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.shared_experts.down_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.shared_experts.gate_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.mlp.shared_experts.up_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.post_attention_layernorm.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.self_attn.k_norm.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.self_attn.k_proj.bias": "model-00050-of-00092.safetensors",
+ "model.layers.49.self_attn.k_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.self_attn.o_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.self_attn.q_norm.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.self_attn.q_proj.bias": "model-00050-of-00092.safetensors",
+ "model.layers.49.self_attn.q_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.49.self_attn.v_proj.bias": "model-00050-of-00092.safetensors",
+ "model.layers.49.self_attn.v_proj.weight": "model-00050-of-00092.safetensors",
+ "model.layers.50.input_layernorm.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.0.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.0.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.0.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.1.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.1.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.1.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.10.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.10.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.10.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.100.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.100.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.100.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.101.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.101.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.101.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.102.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.102.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.102.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.103.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.103.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.103.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.104.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.104.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.104.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.105.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.105.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.105.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.106.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.106.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.106.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.107.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.107.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.107.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.108.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.108.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.108.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.109.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.109.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.109.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.11.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.11.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.11.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.110.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.110.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.110.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.111.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.111.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.111.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.112.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.112.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.112.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.113.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.113.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.113.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.114.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.114.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.114.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.115.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.115.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.115.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.116.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.116.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.116.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.117.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.117.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.117.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.118.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.118.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.118.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.119.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.119.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.119.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.12.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.12.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.12.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.120.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.120.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.120.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.121.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.121.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.121.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.122.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.122.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.122.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.123.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.123.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.123.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.124.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.124.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.124.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.125.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.125.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.125.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.126.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.126.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.126.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.127.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.127.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.127.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.128.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.128.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.128.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.129.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.129.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.129.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.13.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.13.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.13.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.130.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.130.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.130.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.131.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.131.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.131.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.132.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.132.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.132.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.133.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.133.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.133.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.134.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.134.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.134.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.135.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.135.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.135.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.136.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.136.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.136.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.137.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.137.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.137.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.138.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.138.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.138.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.139.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.139.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.139.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.14.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.14.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.14.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.140.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.140.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.140.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.141.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.141.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.141.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.142.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.142.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.142.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.143.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.143.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.143.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.144.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.144.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.144.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.145.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.145.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.145.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.146.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.146.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.146.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.147.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.147.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.147.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.148.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.148.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.148.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.149.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.149.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.149.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.15.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.15.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.15.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.150.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.150.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.150.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.151.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.151.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.151.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.152.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.152.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.152.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.153.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.153.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.153.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.154.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.154.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.154.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.155.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.155.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.155.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.156.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.156.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.156.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.157.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.157.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.157.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.158.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.158.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.158.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.159.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.159.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.159.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.16.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.16.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.16.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.17.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.17.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.17.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.18.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.18.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.18.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.19.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.19.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.19.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.2.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.2.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.2.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.20.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.20.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.20.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.21.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.21.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.21.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.22.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.22.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.22.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.23.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.23.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.23.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.24.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.24.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.24.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.25.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.25.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.25.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.26.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.26.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.26.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.27.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.27.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.27.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.28.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.28.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.28.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.29.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.29.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.29.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.3.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.3.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.3.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.30.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.30.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.30.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.31.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.31.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.31.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.32.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.32.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.32.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.33.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.33.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.33.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.34.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.34.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.34.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.35.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.35.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.35.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.36.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.36.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.36.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.37.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.37.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.37.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.38.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.38.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.38.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.39.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.39.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.39.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.4.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.4.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.4.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.40.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.40.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.40.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.41.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.41.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.41.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.42.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.42.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.42.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.43.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.43.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.43.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.44.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.44.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.44.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.45.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.45.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.45.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.46.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.46.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.46.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.47.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.47.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.47.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.48.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.48.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.48.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.49.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.49.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.49.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.5.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.5.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.5.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.50.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.50.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.50.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.51.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.51.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.51.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.52.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.52.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.52.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.53.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.53.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.53.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.54.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.54.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.54.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.55.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.55.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.55.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.56.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.56.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.56.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.57.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.57.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.57.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.58.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.58.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.58.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.59.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.59.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.59.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.6.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.6.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.6.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.60.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.60.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.60.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.61.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.61.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.61.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.62.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.62.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.62.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.63.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.63.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.63.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.64.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.64.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.64.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.65.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.65.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.65.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.66.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.66.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.66.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.67.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.67.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.67.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.68.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.68.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.68.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.69.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.69.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.69.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.7.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.7.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.7.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.70.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.70.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.70.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.71.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.71.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.71.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.72.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.72.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.72.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.73.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.73.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.73.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.74.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.74.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.74.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.75.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.75.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.75.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.76.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.76.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.76.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.77.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.77.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.77.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.78.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.78.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.78.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.79.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.79.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.79.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.8.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.8.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.8.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.80.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.80.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.80.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.81.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.81.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.81.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.82.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.82.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.82.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.83.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.83.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.83.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.84.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.84.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.84.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.85.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.85.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.85.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.86.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.86.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.86.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.87.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.87.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.87.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.88.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.88.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.88.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.89.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.89.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.89.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.9.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.9.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.9.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.90.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.90.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.90.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.91.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.91.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.91.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.92.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.92.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.92.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.93.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.93.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.93.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.94.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.94.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.94.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.95.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.95.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.95.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.96.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.96.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.96.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.97.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.97.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.97.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.98.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.98.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.98.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.99.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.99.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.experts.99.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.gate.e_score_correction_bias": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.gate.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.shared_experts.down_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.shared_experts.gate_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.mlp.shared_experts.up_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.post_attention_layernorm.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.self_attn.k_norm.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.self_attn.k_proj.bias": "model-00051-of-00092.safetensors",
+ "model.layers.50.self_attn.k_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.self_attn.o_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.self_attn.q_norm.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.self_attn.q_proj.bias": "model-00051-of-00092.safetensors",
+ "model.layers.50.self_attn.q_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.50.self_attn.v_proj.bias": "model-00051-of-00092.safetensors",
+ "model.layers.50.self_attn.v_proj.weight": "model-00051-of-00092.safetensors",
+ "model.layers.51.input_layernorm.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.0.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.0.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.0.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.1.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.1.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.1.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.10.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.10.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.10.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.100.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.100.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.100.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.101.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.101.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.101.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.102.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.102.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.102.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.103.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.103.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.103.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.104.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.104.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.104.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.105.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.105.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.105.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.106.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.106.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.106.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.107.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.107.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.107.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.108.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.108.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.108.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.109.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.109.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.109.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.11.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.11.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.11.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.110.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.110.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.110.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.111.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.111.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.111.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.112.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.112.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.112.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.113.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.113.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.113.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.114.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.114.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.114.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.115.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.115.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.115.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.116.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.116.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.116.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.117.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.117.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.117.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.118.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.118.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.118.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.119.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.119.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.119.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.12.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.12.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.12.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.120.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.120.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.120.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.121.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.121.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.121.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.122.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.122.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.122.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.123.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.123.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.123.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.124.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.124.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.124.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.125.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.125.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.125.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.126.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.126.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.126.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.127.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.127.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.127.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.128.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.128.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.128.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.129.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.129.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.129.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.13.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.13.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.13.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.130.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.130.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.130.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.131.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.131.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.131.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.132.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.132.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.132.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.133.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.133.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.133.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.134.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.134.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.134.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.135.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.135.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.135.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.136.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.136.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.136.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.137.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.137.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.137.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.138.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.138.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.138.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.139.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.139.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.139.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.14.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.14.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.14.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.140.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.140.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.140.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.141.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.141.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.141.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.142.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.142.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.142.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.143.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.143.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.143.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.144.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.144.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.144.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.145.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.145.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.145.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.146.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.146.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.146.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.147.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.147.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.147.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.148.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.148.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.148.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.149.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.149.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.149.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.15.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.15.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.15.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.150.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.150.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.150.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.151.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.151.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.151.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.152.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.152.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.152.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.153.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.153.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.153.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.154.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.154.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.154.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.155.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.155.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.155.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.156.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.156.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.156.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.157.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.157.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.157.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.158.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.158.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.158.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.159.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.159.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.159.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.16.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.16.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.16.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.17.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.17.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.17.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.18.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.18.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.18.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.19.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.19.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.19.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.2.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.2.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.2.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.20.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.20.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.20.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.21.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.21.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.21.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.22.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.22.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.22.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.23.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.23.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.23.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.24.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.24.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.24.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.25.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.25.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.25.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.26.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.26.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.26.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.27.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.27.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.27.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.28.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.28.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.28.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.29.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.29.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.29.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.3.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.3.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.3.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.30.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.30.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.30.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.31.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.31.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.31.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.32.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.32.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.32.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.33.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.33.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.33.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.34.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.34.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.34.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.35.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.35.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.35.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.36.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.36.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.36.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.37.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.37.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.37.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.38.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.38.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.38.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.39.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.39.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.39.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.4.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.4.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.4.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.40.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.40.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.40.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.41.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.41.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.41.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.42.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.42.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.42.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.43.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.43.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.43.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.44.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.44.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.44.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.45.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.45.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.45.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.46.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.46.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.46.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.47.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.47.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.47.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.48.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.48.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.48.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.49.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.49.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.49.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.5.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.5.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.5.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.50.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.50.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.50.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.51.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.51.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.51.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.52.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.52.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.52.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.53.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.53.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.53.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.54.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.54.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.54.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.55.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.55.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.55.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.56.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.56.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.56.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.57.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.57.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.57.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.58.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.58.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.58.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.59.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.59.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.59.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.6.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.6.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.6.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.60.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.60.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.60.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.61.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.61.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.61.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.62.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.62.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.62.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.63.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.63.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.63.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.64.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.64.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.64.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.65.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.65.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.65.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.66.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.66.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.66.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.67.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.67.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.67.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.68.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.68.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.68.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.69.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.69.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.69.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.7.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.7.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.7.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.70.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.70.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.70.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.71.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.71.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.71.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.72.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.72.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.72.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.73.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.73.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.73.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.74.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.74.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.74.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.75.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.75.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.75.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.76.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.76.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.76.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.77.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.77.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.77.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.78.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.78.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.78.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.79.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.79.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.79.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.8.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.8.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.8.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.80.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.80.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.80.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.81.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.81.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.81.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.82.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.82.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.82.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.83.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.83.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.83.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.84.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.84.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.84.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.85.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.85.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.85.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.86.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.86.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.86.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.87.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.87.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.87.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.88.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.88.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.88.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.89.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.89.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.89.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.9.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.9.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.9.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.90.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.90.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.90.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.91.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.91.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.91.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.92.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.92.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.92.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.93.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.93.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.93.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.94.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.94.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.94.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.95.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.95.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.95.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.96.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.96.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.96.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.97.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.97.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.97.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.98.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.98.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.98.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.99.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.99.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.experts.99.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.gate.e_score_correction_bias": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.gate.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.shared_experts.down_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.shared_experts.gate_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.mlp.shared_experts.up_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.post_attention_layernorm.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.self_attn.k_norm.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.self_attn.k_proj.bias": "model-00052-of-00092.safetensors",
+ "model.layers.51.self_attn.k_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.self_attn.o_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.self_attn.q_norm.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.self_attn.q_proj.bias": "model-00052-of-00092.safetensors",
+ "model.layers.51.self_attn.q_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.51.self_attn.v_proj.bias": "model-00052-of-00092.safetensors",
+ "model.layers.51.self_attn.v_proj.weight": "model-00052-of-00092.safetensors",
+ "model.layers.52.input_layernorm.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.0.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.0.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.0.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.1.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.1.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.1.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.10.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.10.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.10.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.100.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.100.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.100.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.101.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.101.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.101.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.102.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.102.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.102.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.103.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.103.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.103.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.104.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.104.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.104.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.105.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.105.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.105.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.106.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.106.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.106.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.107.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.107.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.107.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.108.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.108.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.108.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.109.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.109.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.109.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.11.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.11.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.11.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.110.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.110.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.110.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.111.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.111.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.111.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.112.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.112.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.112.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.113.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.113.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.113.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.114.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.114.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.114.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.115.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.115.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.115.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.116.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.116.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.116.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.117.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.117.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.117.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.118.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.118.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.118.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.119.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.119.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.119.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.12.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.12.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.12.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.120.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.120.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.120.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.121.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.121.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.121.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.122.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.122.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.122.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.123.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.123.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.123.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.124.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.124.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.124.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.125.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.125.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.125.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.126.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.126.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.126.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.127.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.127.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.127.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.128.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.128.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.128.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.129.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.129.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.129.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.13.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.13.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.13.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.130.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.130.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.130.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.131.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.131.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.131.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.132.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.132.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.132.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.133.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.133.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.133.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.134.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.134.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.134.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.135.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.135.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.135.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.136.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.136.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.136.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.137.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.137.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.137.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.138.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.138.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.138.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.139.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.139.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.139.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.14.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.14.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.14.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.140.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.140.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.140.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.141.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.141.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.141.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.142.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.142.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.142.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.143.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.143.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.143.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.144.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.144.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.144.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.145.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.145.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.145.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.146.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.146.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.146.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.147.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.147.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.147.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.148.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.148.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.148.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.149.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.149.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.149.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.15.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.15.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.15.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.150.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.150.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.150.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.151.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.151.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.151.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.152.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.152.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.152.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.153.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.153.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.153.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.154.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.154.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.154.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.155.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.155.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.155.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.156.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.156.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.156.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.157.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.157.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.157.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.158.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.158.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.158.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.159.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.159.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.159.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.16.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.16.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.16.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.17.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.17.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.17.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.18.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.18.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.18.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.19.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.19.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.19.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.2.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.2.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.2.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.20.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.20.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.20.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.21.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.21.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.21.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.22.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.22.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.22.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.23.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.23.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.23.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.24.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.24.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.24.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.25.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.25.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.25.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.26.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.26.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.26.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.27.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.27.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.27.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.28.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.28.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.28.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.29.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.29.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.29.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.3.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.3.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.3.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.30.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.30.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.30.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.31.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.31.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.31.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.32.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.32.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.32.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.33.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.33.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.33.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.34.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.34.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.34.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.35.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.35.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.35.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.36.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.36.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.36.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.37.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.37.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.37.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.38.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.38.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.38.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.39.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.39.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.39.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.4.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.4.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.4.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.40.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.40.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.40.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.41.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.41.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.41.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.42.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.42.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.42.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.43.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.43.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.43.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.44.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.44.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.44.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.45.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.45.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.45.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.46.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.46.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.46.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.47.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.47.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.47.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.48.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.48.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.48.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.49.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.49.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.49.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.5.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.5.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.5.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.50.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.50.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.50.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.51.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.51.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.51.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.52.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.52.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.52.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.53.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.53.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.53.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.54.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.54.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.54.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.55.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.55.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.55.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.56.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.56.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.56.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.57.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.57.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.57.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.58.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.58.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.58.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.59.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.59.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.59.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.6.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.6.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.6.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.60.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.60.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.60.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.61.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.61.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.61.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.62.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.62.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.62.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.63.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.63.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.63.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.64.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.64.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.64.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.65.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.65.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.65.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.66.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.66.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.66.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.67.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.67.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.67.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.68.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.68.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.68.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.69.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.69.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.69.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.7.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.7.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.7.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.70.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.70.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.70.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.71.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.71.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.71.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.72.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.72.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.72.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.73.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.73.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.73.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.74.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.74.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.74.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.75.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.75.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.75.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.76.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.76.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.76.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.77.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.77.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.77.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.78.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.78.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.78.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.79.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.79.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.79.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.8.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.8.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.8.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.80.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.80.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.80.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.81.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.81.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.81.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.82.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.82.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.82.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.83.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.83.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.83.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.84.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.84.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.84.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.85.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.85.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.85.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.86.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.86.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.86.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.87.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.87.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.87.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.88.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.88.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.88.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.89.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.89.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.89.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.9.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.9.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.9.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.90.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.90.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.90.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.91.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.91.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.91.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.92.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.92.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.92.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.93.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.93.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.93.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.94.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.94.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.94.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.95.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.95.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.95.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.96.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.96.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.96.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.97.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.97.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.97.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.98.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.98.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.98.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.99.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.99.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.experts.99.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.gate.e_score_correction_bias": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.gate.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.shared_experts.down_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.shared_experts.gate_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.mlp.shared_experts.up_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.post_attention_layernorm.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.self_attn.k_norm.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.self_attn.k_proj.bias": "model-00053-of-00092.safetensors",
+ "model.layers.52.self_attn.k_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.self_attn.o_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.self_attn.q_norm.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.self_attn.q_proj.bias": "model-00053-of-00092.safetensors",
+ "model.layers.52.self_attn.q_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.52.self_attn.v_proj.bias": "model-00053-of-00092.safetensors",
+ "model.layers.52.self_attn.v_proj.weight": "model-00053-of-00092.safetensors",
+ "model.layers.53.input_layernorm.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.0.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.0.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.0.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.1.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.1.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.1.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.10.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.10.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.10.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.100.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.100.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.100.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.101.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.101.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.101.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.102.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.102.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.102.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.103.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.103.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.103.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.104.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.104.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.104.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.105.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.105.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.105.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.106.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.106.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.106.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.107.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.107.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.107.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.108.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.108.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.108.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.109.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.109.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.109.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.11.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.11.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.11.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.110.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.110.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.110.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.111.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.111.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.111.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.112.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.112.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.112.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.113.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.113.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.113.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.114.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.114.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.114.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.115.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.115.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.115.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.116.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.116.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.116.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.117.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.117.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.117.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.118.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.118.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.118.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.119.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.119.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.119.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.12.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.12.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.12.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.120.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.120.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.120.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.121.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.121.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.121.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.122.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.122.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.122.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.123.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.123.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.123.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.124.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.124.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.124.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.125.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.125.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.125.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.126.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.126.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.126.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.127.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.127.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.127.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.128.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.128.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.128.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.129.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.129.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.129.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.13.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.13.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.13.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.130.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.130.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.130.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.131.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.131.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.131.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.132.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.132.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.132.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.133.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.133.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.133.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.134.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.134.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.134.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.135.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.135.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.135.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.136.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.136.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.136.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.137.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.137.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.137.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.138.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.138.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.138.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.139.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.139.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.139.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.14.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.14.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.14.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.140.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.140.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.140.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.141.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.141.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.141.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.142.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.142.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.142.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.143.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.143.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.143.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.144.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.144.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.144.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.145.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.145.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.145.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.146.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.146.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.146.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.147.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.147.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.147.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.148.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.148.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.148.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.149.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.149.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.149.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.15.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.15.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.15.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.150.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.150.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.150.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.151.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.151.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.151.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.152.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.152.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.152.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.153.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.153.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.153.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.154.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.154.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.154.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.155.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.155.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.155.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.156.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.156.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.156.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.157.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.157.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.157.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.158.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.158.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.158.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.159.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.159.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.159.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.16.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.16.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.16.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.17.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.17.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.17.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.18.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.18.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.18.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.19.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.19.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.19.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.2.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.2.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.2.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.20.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.20.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.20.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.21.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.21.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.21.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.22.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.22.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.22.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.23.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.23.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.23.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.24.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.24.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.24.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.25.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.25.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.25.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.26.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.26.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.26.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.27.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.27.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.27.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.28.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.28.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.28.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.29.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.29.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.29.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.3.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.3.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.3.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.30.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.30.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.30.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.31.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.31.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.31.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.32.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.32.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.32.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.33.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.33.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.33.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.34.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.34.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.34.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.35.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.35.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.35.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.36.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.36.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.36.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.37.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.37.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.37.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.38.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.38.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.38.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.39.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.39.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.39.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.4.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.4.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.4.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.40.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.40.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.40.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.41.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.41.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.41.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.42.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.42.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.42.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.43.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.43.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.43.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.44.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.44.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.44.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.45.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.45.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.45.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.46.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.46.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.46.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.47.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.47.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.47.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.48.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.48.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.48.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.49.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.49.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.49.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.5.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.5.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.5.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.50.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.50.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.50.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.51.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.51.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.51.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.52.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.52.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.52.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.53.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.53.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.53.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.54.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.54.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.54.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.55.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.55.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.55.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.56.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.56.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.56.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.57.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.57.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.57.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.58.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.58.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.58.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.59.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.59.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.59.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.6.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.6.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.6.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.60.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.60.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.60.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.61.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.61.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.61.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.62.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.62.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.62.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.63.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.63.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.63.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.64.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.64.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.64.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.65.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.65.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.65.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.66.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.66.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.66.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.67.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.67.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.67.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.68.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.68.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.68.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.69.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.69.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.69.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.7.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.7.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.7.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.70.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.70.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.70.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.71.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.71.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.71.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.72.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.72.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.72.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.73.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.73.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.73.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.74.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.74.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.74.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.75.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.75.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.75.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.76.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.76.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.76.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.77.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.77.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.77.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.78.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.78.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.78.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.79.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.79.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.79.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.8.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.8.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.8.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.80.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.80.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.80.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.81.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.81.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.81.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.82.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.82.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.82.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.83.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.83.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.83.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.84.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.84.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.84.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.85.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.85.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.85.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.86.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.86.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.86.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.87.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.87.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.87.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.88.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.88.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.88.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.89.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.89.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.89.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.9.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.9.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.9.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.90.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.90.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.90.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.91.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.91.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.91.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.92.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.92.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.92.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.93.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.93.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.93.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.94.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.94.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.94.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.95.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.95.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.95.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.96.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.96.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.96.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.97.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.97.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.97.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.98.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.98.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.98.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.99.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.99.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.experts.99.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.gate.e_score_correction_bias": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.gate.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.shared_experts.down_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.shared_experts.gate_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.mlp.shared_experts.up_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.post_attention_layernorm.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.self_attn.k_norm.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.self_attn.k_proj.bias": "model-00054-of-00092.safetensors",
+ "model.layers.53.self_attn.k_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.self_attn.o_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.self_attn.q_norm.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.self_attn.q_proj.bias": "model-00054-of-00092.safetensors",
+ "model.layers.53.self_attn.q_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.53.self_attn.v_proj.bias": "model-00054-of-00092.safetensors",
+ "model.layers.53.self_attn.v_proj.weight": "model-00054-of-00092.safetensors",
+ "model.layers.54.input_layernorm.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.0.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.0.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.0.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.1.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.1.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.1.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.10.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.10.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.10.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.100.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.100.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.100.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.101.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.101.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.101.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.102.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.102.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.102.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.103.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.103.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.103.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.104.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.104.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.104.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.105.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.105.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.105.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.106.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.106.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.106.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.107.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.107.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.107.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.108.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.108.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.108.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.109.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.109.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.109.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.11.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.11.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.11.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.110.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.110.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.110.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.111.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.111.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.111.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.112.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.112.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.112.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.113.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.113.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.113.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.114.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.114.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.114.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.115.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.115.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.115.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.116.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.116.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.116.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.117.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.117.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.117.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.118.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.118.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.118.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.119.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.119.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.119.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.12.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.12.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.12.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.120.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.120.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.120.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.121.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.121.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.121.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.122.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.122.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.122.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.123.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.123.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.123.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.124.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.124.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.124.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.125.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.125.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.125.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.126.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.126.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.126.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.127.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.127.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.127.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.128.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.128.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.128.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.129.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.129.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.129.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.13.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.13.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.13.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.130.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.130.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.130.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.131.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.131.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.131.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.132.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.132.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.132.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.133.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.133.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.133.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.134.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.134.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.134.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.135.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.135.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.135.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.136.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.136.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.136.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.137.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.137.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.137.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.138.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.138.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.138.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.139.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.139.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.139.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.14.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.14.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.14.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.140.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.140.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.140.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.141.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.141.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.141.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.142.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.142.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.142.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.143.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.143.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.143.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.144.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.144.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.144.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.145.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.145.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.145.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.146.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.146.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.146.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.147.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.147.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.147.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.148.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.148.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.148.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.149.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.149.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.149.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.15.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.15.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.15.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.150.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.150.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.150.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.151.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.151.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.151.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.152.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.152.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.152.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.153.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.153.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.153.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.154.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.154.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.154.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.155.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.155.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.155.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.156.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.156.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.156.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.157.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.157.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.157.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.158.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.158.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.158.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.159.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.159.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.159.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.16.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.16.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.16.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.17.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.17.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.17.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.18.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.18.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.18.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.19.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.19.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.19.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.2.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.2.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.2.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.20.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.20.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.20.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.21.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.21.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.21.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.22.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.22.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.22.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.23.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.23.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.23.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.24.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.24.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.24.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.25.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.25.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.25.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.26.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.26.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.26.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.27.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.27.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.27.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.28.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.28.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.28.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.29.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.29.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.29.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.3.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.3.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.3.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.30.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.30.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.30.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.31.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.31.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.31.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.32.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.32.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.32.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.33.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.33.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.33.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.34.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.34.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.34.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.35.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.35.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.35.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.36.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.36.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.36.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.37.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.37.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.37.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.38.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.38.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.38.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.39.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.39.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.39.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.4.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.4.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.4.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.40.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.40.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.40.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.41.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.41.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.41.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.42.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.42.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.42.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.43.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.43.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.43.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.44.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.44.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.44.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.45.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.45.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.45.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.46.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.46.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.46.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.47.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.47.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.47.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.48.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.48.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.48.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.49.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.49.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.49.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.5.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.5.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.5.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.50.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.50.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.50.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.51.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.51.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.51.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.52.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.52.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.52.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.53.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.53.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.53.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.54.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.54.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.54.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.55.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.55.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.55.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.56.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.56.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.56.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.57.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.57.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.57.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.58.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.58.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.58.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.59.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.59.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.59.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.6.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.6.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.6.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.60.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.60.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.60.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.61.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.61.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.61.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.62.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.62.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.62.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.63.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.63.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.63.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.64.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.64.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.64.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.65.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.65.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.65.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.66.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.66.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.66.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.67.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.67.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.67.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.68.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.68.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.68.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.69.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.69.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.69.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.7.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.7.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.7.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.70.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.70.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.70.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.71.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.71.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.71.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.72.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.72.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.72.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.73.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.73.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.73.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.74.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.74.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.74.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.75.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.75.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.75.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.76.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.76.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.76.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.77.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.77.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.77.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.78.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.78.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.78.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.79.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.79.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.79.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.8.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.8.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.8.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.80.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.80.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.80.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.81.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.81.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.81.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.82.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.82.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.82.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.83.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.83.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.83.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.84.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.84.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.84.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.85.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.85.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.85.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.86.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.86.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.86.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.87.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.87.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.87.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.88.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.88.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.88.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.89.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.89.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.89.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.9.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.9.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.9.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.90.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.90.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.90.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.91.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.91.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.91.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.92.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.92.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.92.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.93.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.93.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.93.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.94.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.94.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.94.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.95.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.95.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.95.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.96.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.96.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.96.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.97.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.97.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.97.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.98.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.98.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.98.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.99.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.99.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.experts.99.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.gate.e_score_correction_bias": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.gate.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.shared_experts.down_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.shared_experts.gate_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.mlp.shared_experts.up_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.post_attention_layernorm.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.self_attn.k_norm.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.self_attn.k_proj.bias": "model-00055-of-00092.safetensors",
+ "model.layers.54.self_attn.k_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.self_attn.o_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.self_attn.q_norm.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.self_attn.q_proj.bias": "model-00055-of-00092.safetensors",
+ "model.layers.54.self_attn.q_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.54.self_attn.v_proj.bias": "model-00055-of-00092.safetensors",
+ "model.layers.54.self_attn.v_proj.weight": "model-00055-of-00092.safetensors",
+ "model.layers.55.input_layernorm.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.0.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.0.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.0.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.1.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.1.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.1.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.10.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.10.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.10.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.100.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.100.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.100.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.101.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.101.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.101.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.102.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.102.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.102.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.103.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.103.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.103.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.104.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.104.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.104.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.105.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.105.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.105.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.106.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.106.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.106.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.107.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.107.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.107.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.108.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.108.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.108.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.109.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.109.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.109.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.11.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.11.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.11.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.110.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.110.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.110.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.111.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.111.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.111.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.112.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.112.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.112.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.113.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.113.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.113.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.114.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.114.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.114.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.115.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.115.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.115.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.116.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.116.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.116.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.117.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.117.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.117.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.118.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.118.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.118.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.119.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.119.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.119.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.12.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.12.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.12.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.120.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.120.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.120.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.121.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.121.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.121.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.122.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.122.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.122.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.123.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.123.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.123.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.124.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.124.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.124.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.125.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.125.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.125.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.126.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.126.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.126.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.127.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.127.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.127.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.128.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.128.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.128.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.129.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.129.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.129.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.13.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.13.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.13.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.130.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.130.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.130.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.131.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.131.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.131.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.132.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.132.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.132.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.133.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.133.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.133.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.134.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.134.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.134.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.135.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.135.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.135.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.136.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.136.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.136.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.137.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.137.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.137.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.138.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.138.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.138.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.139.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.139.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.139.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.14.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.14.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.14.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.140.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.140.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.140.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.141.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.141.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.141.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.142.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.142.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.142.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.143.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.143.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.143.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.144.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.144.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.144.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.145.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.145.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.145.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.146.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.146.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.146.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.147.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.147.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.147.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.148.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.148.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.148.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.149.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.149.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.149.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.15.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.15.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.15.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.150.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.150.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.150.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.151.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.151.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.151.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.152.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.152.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.152.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.153.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.153.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.153.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.154.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.154.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.154.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.155.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.155.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.155.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.156.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.156.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.156.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.157.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.157.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.157.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.158.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.158.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.158.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.159.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.159.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.159.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.16.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.16.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.16.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.17.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.17.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.17.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.18.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.18.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.18.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.19.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.19.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.19.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.2.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.2.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.2.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.20.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.20.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.20.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.21.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.21.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.21.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.22.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.22.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.22.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.23.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.23.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.23.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.24.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.24.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.24.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.25.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.25.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.25.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.26.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.26.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.26.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.27.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.27.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.27.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.28.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.28.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.28.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.29.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.29.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.29.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.3.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.3.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.3.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.30.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.30.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.30.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.31.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.31.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.31.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.32.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.32.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.32.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.33.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.33.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.33.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.34.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.34.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.34.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.35.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.35.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.35.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.36.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.36.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.36.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.37.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.37.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.37.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.38.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.38.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.38.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.39.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.39.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.39.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.4.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.4.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.4.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.40.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.40.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.40.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.41.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.41.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.41.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.42.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.42.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.42.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.43.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.43.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.43.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.44.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.44.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.44.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.45.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.45.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.45.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.46.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.46.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.46.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.47.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.47.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.47.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.48.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.48.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.48.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.49.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.49.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.49.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.5.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.5.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.5.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.50.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.50.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.50.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.51.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.51.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.51.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.52.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.52.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.52.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.53.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.53.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.53.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.54.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.54.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.54.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.55.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.55.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.55.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.56.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.56.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.56.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.57.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.57.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.57.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.58.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.58.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.58.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.59.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.59.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.59.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.6.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.6.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.6.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.60.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.60.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.60.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.61.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.61.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.61.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.62.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.62.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.62.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.63.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.63.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.63.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.64.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.64.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.64.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.65.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.65.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.65.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.66.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.66.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.66.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.67.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.67.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.67.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.68.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.68.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.68.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.69.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.69.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.69.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.7.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.7.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.7.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.70.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.70.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.70.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.71.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.71.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.71.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.72.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.72.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.72.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.73.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.73.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.73.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.74.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.74.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.74.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.75.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.75.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.75.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.76.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.76.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.76.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.77.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.77.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.77.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.78.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.78.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.78.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.79.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.79.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.79.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.8.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.8.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.8.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.80.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.80.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.80.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.81.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.81.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.81.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.82.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.82.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.82.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.83.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.83.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.83.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.84.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.84.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.84.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.85.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.85.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.85.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.86.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.86.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.86.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.87.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.87.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.87.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.88.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.88.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.88.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.89.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.89.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.89.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.9.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.9.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.9.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.90.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.90.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.90.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.91.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.91.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.91.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.92.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.92.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.92.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.93.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.93.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.93.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.94.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.94.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.94.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.95.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.95.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.95.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.96.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.96.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.96.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.97.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.97.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.97.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.98.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.98.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.98.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.99.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.99.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.experts.99.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.gate.e_score_correction_bias": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.gate.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.shared_experts.down_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.shared_experts.gate_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.mlp.shared_experts.up_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.post_attention_layernorm.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.self_attn.k_norm.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.self_attn.k_proj.bias": "model-00056-of-00092.safetensors",
+ "model.layers.55.self_attn.k_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.self_attn.o_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.self_attn.q_norm.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.self_attn.q_proj.bias": "model-00056-of-00092.safetensors",
+ "model.layers.55.self_attn.q_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.55.self_attn.v_proj.bias": "model-00056-of-00092.safetensors",
+ "model.layers.55.self_attn.v_proj.weight": "model-00056-of-00092.safetensors",
+ "model.layers.56.input_layernorm.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.0.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.0.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.0.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.1.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.1.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.1.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.10.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.10.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.10.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.100.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.100.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.100.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.101.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.101.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.101.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.102.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.102.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.102.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.103.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.103.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.103.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.104.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.104.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.104.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.105.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.105.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.105.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.106.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.106.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.106.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.107.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.107.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.107.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.108.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.108.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.108.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.109.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.109.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.109.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.11.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.11.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.11.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.110.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.110.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.110.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.111.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.111.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.111.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.112.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.112.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.112.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.113.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.113.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.113.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.114.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.114.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.114.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.115.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.115.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.115.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.116.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.116.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.116.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.117.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.117.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.117.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.118.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.118.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.118.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.119.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.119.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.119.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.12.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.12.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.12.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.120.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.120.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.120.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.121.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.121.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.121.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.122.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.122.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.122.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.123.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.123.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.123.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.124.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.124.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.124.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.125.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.125.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.125.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.126.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.126.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.126.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.127.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.127.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.127.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.128.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.128.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.128.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.129.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.129.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.129.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.13.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.13.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.13.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.130.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.130.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.130.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.131.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.131.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.131.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.132.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.132.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.132.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.133.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.133.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.133.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.134.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.134.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.134.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.135.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.135.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.135.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.136.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.136.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.136.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.137.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.137.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.137.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.138.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.138.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.138.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.139.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.139.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.139.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.14.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.14.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.14.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.140.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.140.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.140.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.141.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.141.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.141.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.142.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.142.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.142.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.143.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.143.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.143.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.144.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.144.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.144.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.145.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.145.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.145.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.146.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.146.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.146.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.147.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.147.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.147.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.148.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.148.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.148.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.149.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.149.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.149.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.15.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.15.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.15.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.150.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.150.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.150.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.151.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.151.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.151.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.152.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.152.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.152.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.153.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.153.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.153.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.154.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.154.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.154.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.155.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.155.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.155.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.156.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.156.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.156.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.157.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.157.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.157.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.158.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.158.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.158.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.159.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.159.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.159.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.16.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.16.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.16.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.17.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.17.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.17.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.18.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.18.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.18.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.19.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.19.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.19.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.2.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.2.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.2.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.20.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.20.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.20.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.21.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.21.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.21.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.22.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.22.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.22.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.23.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.23.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.23.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.24.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.24.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.24.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.25.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.25.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.25.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.26.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.26.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.26.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.27.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.27.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.27.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.28.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.28.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.28.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.29.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.29.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.29.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.3.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.3.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.3.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.30.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.30.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.30.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.31.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.31.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.31.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.32.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.32.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.32.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.33.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.33.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.33.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.34.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.34.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.34.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.35.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.35.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.35.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.36.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.36.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.36.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.37.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.37.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.37.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.38.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.38.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.38.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.39.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.39.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.39.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.4.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.4.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.4.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.40.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.40.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.40.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.41.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.41.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.41.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.42.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.42.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.42.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.43.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.43.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.43.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.44.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.44.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.44.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.45.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.45.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.45.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.46.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.46.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.46.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.47.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.47.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.47.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.48.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.48.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.48.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.49.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.49.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.49.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.5.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.5.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.5.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.50.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.50.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.50.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.51.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.51.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.51.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.52.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.52.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.52.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.53.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.53.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.53.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.54.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.54.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.54.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.55.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.55.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.55.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.56.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.56.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.56.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.57.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.57.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.57.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.58.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.58.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.58.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.59.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.59.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.59.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.6.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.6.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.6.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.60.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.60.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.60.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.61.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.61.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.61.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.62.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.62.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.62.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.63.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.63.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.63.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.64.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.64.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.64.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.65.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.65.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.65.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.66.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.66.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.66.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.67.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.67.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.67.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.68.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.68.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.68.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.69.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.69.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.69.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.7.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.7.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.7.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.70.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.70.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.70.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.71.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.71.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.71.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.72.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.72.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.72.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.73.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.73.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.73.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.74.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.74.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.74.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.75.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.75.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.75.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.76.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.76.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.76.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.77.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.77.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.77.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.78.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.78.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.78.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.79.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.79.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.79.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.8.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.8.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.8.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.80.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.80.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.80.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.81.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.81.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.81.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.82.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.82.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.82.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.83.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.83.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.83.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.84.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.84.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.84.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.85.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.85.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.85.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.86.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.86.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.86.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.87.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.87.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.87.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.88.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.88.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.88.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.89.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.89.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.89.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.9.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.9.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.9.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.90.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.90.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.90.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.91.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.91.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.91.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.92.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.92.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.92.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.93.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.93.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.93.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.94.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.94.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.94.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.95.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.95.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.95.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.96.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.96.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.96.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.97.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.97.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.97.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.98.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.98.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.98.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.99.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.99.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.experts.99.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.gate.e_score_correction_bias": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.gate.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.shared_experts.down_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.shared_experts.gate_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.mlp.shared_experts.up_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.post_attention_layernorm.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.self_attn.k_norm.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.self_attn.k_proj.bias": "model-00057-of-00092.safetensors",
+ "model.layers.56.self_attn.k_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.self_attn.o_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.self_attn.q_norm.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.self_attn.q_proj.bias": "model-00057-of-00092.safetensors",
+ "model.layers.56.self_attn.q_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.56.self_attn.v_proj.bias": "model-00057-of-00092.safetensors",
+ "model.layers.56.self_attn.v_proj.weight": "model-00057-of-00092.safetensors",
+ "model.layers.57.input_layernorm.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.0.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.0.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.0.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.1.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.1.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.1.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.10.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.10.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.10.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.100.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.100.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.100.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.101.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.101.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.101.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.102.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.102.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.102.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.103.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.103.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.103.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.104.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.104.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.104.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.105.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.105.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.105.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.106.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.106.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.106.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.107.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.107.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.107.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.108.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.108.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.108.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.109.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.109.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.109.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.11.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.11.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.11.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.110.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.110.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.110.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.111.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.111.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.111.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.112.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.112.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.112.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.113.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.113.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.113.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.114.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.114.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.114.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.115.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.115.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.115.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.116.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.116.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.116.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.117.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.117.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.117.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.118.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.118.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.118.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.119.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.119.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.119.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.12.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.12.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.12.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.120.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.120.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.120.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.121.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.121.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.121.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.122.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.122.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.122.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.123.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.123.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.123.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.124.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.124.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.124.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.125.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.125.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.125.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.126.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.126.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.126.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.127.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.127.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.127.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.128.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.128.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.128.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.129.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.129.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.129.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.13.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.13.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.13.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.130.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.130.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.130.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.131.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.131.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.131.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.132.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.132.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.132.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.133.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.133.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.133.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.134.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.134.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.134.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.135.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.135.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.135.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.136.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.136.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.136.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.137.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.137.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.137.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.138.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.138.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.138.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.139.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.139.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.139.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.14.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.14.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.14.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.140.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.140.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.140.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.141.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.141.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.141.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.142.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.142.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.142.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.143.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.143.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.143.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.144.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.144.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.144.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.145.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.145.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.145.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.146.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.146.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.146.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.147.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.147.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.147.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.148.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.148.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.148.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.149.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.149.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.149.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.15.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.15.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.15.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.150.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.150.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.150.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.151.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.151.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.151.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.152.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.152.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.152.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.153.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.153.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.153.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.154.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.154.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.154.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.155.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.155.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.155.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.156.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.156.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.156.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.157.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.157.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.157.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.158.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.158.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.158.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.159.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.159.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.159.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.16.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.16.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.16.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.17.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.17.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.17.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.18.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.18.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.18.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.19.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.19.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.19.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.2.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.2.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.2.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.20.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.20.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.20.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.21.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.21.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.21.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.22.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.22.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.22.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.23.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.23.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.23.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.24.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.24.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.24.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.25.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.25.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.25.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.26.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.26.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.26.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.27.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.27.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.27.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.28.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.28.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.28.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.29.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.29.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.29.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.3.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.3.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.3.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.30.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.30.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.30.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.31.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.31.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.31.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.32.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.32.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.32.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.33.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.33.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.33.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.34.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.34.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.34.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.35.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.35.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.35.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.36.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.36.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.36.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.37.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.37.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.37.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.38.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.38.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.38.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.39.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.39.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.39.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.4.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.4.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.4.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.40.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.40.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.40.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.41.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.41.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.41.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.42.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.42.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.42.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.43.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.43.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.43.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.44.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.44.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.44.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.45.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.45.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.45.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.46.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.46.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.46.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.47.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.47.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.47.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.48.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.48.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.48.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.49.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.49.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.49.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.5.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.5.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.5.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.50.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.50.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.50.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.51.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.51.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.51.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.52.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.52.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.52.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.53.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.53.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.53.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.54.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.54.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.54.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.55.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.55.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.55.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.56.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.56.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.56.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.57.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.57.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.57.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.58.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.58.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.58.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.59.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.59.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.59.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.6.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.6.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.6.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.60.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.60.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.60.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.61.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.61.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.61.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.62.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.62.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.62.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.63.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.63.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.63.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.64.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.64.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.64.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.65.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.65.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.65.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.66.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.66.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.66.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.67.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.67.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.67.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.68.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.68.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.68.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.69.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.69.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.69.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.7.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.7.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.7.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.70.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.70.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.70.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.71.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.71.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.71.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.72.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.72.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.72.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.73.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.73.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.73.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.74.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.74.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.74.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.75.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.75.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.75.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.76.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.76.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.76.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.77.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.77.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.77.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.78.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.78.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.78.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.79.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.79.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.79.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.8.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.8.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.8.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.80.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.80.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.80.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.81.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.81.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.81.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.82.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.82.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.82.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.83.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.83.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.83.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.84.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.84.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.84.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.85.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.85.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.85.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.86.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.86.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.86.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.87.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.87.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.87.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.88.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.88.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.88.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.89.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.89.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.89.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.9.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.9.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.9.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.90.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.90.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.90.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.91.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.91.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.91.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.92.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.92.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.92.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.93.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.93.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.93.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.94.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.94.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.94.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.95.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.95.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.95.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.96.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.96.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.96.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.97.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.97.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.97.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.98.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.98.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.98.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.99.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.99.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.experts.99.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.gate.e_score_correction_bias": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.gate.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.shared_experts.down_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.shared_experts.gate_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.mlp.shared_experts.up_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.post_attention_layernorm.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.self_attn.k_norm.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.self_attn.k_proj.bias": "model-00058-of-00092.safetensors",
+ "model.layers.57.self_attn.k_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.self_attn.o_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.self_attn.q_norm.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.self_attn.q_proj.bias": "model-00058-of-00092.safetensors",
+ "model.layers.57.self_attn.q_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.57.self_attn.v_proj.bias": "model-00058-of-00092.safetensors",
+ "model.layers.57.self_attn.v_proj.weight": "model-00058-of-00092.safetensors",
+ "model.layers.58.input_layernorm.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.0.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.0.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.0.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.1.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.1.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.1.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.10.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.10.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.10.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.100.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.100.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.100.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.101.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.101.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.101.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.102.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.102.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.102.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.103.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.103.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.103.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.104.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.104.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.104.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.105.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.105.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.105.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.106.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.106.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.106.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.107.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.107.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.107.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.108.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.108.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.108.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.109.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.109.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.109.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.11.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.11.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.11.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.110.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.110.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.110.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.111.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.111.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.111.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.112.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.112.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.112.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.113.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.113.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.113.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.114.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.114.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.114.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.115.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.115.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.115.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.116.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.116.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.116.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.117.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.117.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.117.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.118.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.118.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.118.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.119.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.119.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.119.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.12.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.12.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.12.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.120.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.120.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.120.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.121.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.121.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.121.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.122.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.122.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.122.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.123.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.123.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.123.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.124.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.124.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.124.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.125.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.125.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.125.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.126.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.126.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.126.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.127.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.127.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.127.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.128.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.128.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.128.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.129.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.129.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.129.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.13.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.13.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.13.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.130.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.130.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.130.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.131.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.131.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.131.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.132.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.132.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.132.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.133.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.133.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.133.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.134.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.134.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.134.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.135.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.135.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.135.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.136.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.136.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.136.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.137.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.137.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.137.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.138.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.138.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.138.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.139.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.139.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.139.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.14.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.14.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.14.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.140.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.140.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.140.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.141.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.141.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.141.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.142.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.142.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.142.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.143.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.143.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.143.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.144.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.144.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.144.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.145.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.145.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.145.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.146.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.146.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.146.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.147.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.147.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.147.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.148.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.148.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.148.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.149.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.149.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.149.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.15.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.15.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.15.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.150.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.150.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.150.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.151.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.151.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.151.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.152.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.152.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.152.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.153.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.153.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.153.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.154.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.154.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.154.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.155.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.155.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.155.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.156.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.156.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.156.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.157.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.157.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.157.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.158.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.158.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.158.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.159.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.159.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.159.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.16.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.16.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.16.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.17.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.17.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.17.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.18.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.18.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.18.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.19.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.19.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.19.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.2.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.2.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.2.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.20.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.20.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.20.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.21.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.21.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.21.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.22.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.22.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.22.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.23.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.23.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.23.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.24.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.24.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.24.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.25.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.25.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.25.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.26.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.26.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.26.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.27.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.27.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.27.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.28.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.28.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.28.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.29.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.29.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.29.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.3.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.3.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.3.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.30.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.30.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.30.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.31.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.31.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.31.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.32.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.32.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.32.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.33.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.33.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.33.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.34.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.34.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.34.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.35.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.35.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.35.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.36.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.36.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.36.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.37.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.37.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.37.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.38.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.38.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.38.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.39.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.39.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.39.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.4.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.4.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.4.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.40.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.40.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.40.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.41.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.41.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.41.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.42.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.42.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.42.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.43.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.43.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.43.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.44.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.44.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.44.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.45.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.45.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.45.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.46.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.46.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.46.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.47.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.47.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.47.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.48.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.48.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.48.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.49.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.49.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.49.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.5.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.5.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.5.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.50.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.50.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.50.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.51.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.51.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.51.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.52.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.52.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.52.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.53.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.53.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.53.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.54.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.54.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.54.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.55.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.55.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.55.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.56.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.56.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.56.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.57.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.57.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.57.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.58.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.58.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.58.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.59.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.59.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.59.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.6.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.6.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.6.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.60.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.60.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.60.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.61.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.61.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.61.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.62.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.62.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.62.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.63.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.63.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.63.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.64.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.64.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.64.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.65.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.65.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.65.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.66.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.66.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.66.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.67.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.67.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.67.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.68.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.68.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.68.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.69.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.69.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.69.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.7.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.7.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.7.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.70.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.70.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.70.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.71.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.71.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.71.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.72.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.72.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.72.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.73.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.73.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.73.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.74.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.74.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.74.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.75.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.75.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.75.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.76.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.76.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.76.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.77.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.77.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.77.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.78.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.78.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.78.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.79.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.79.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.79.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.8.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.8.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.8.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.80.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.80.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.80.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.81.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.81.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.81.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.82.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.82.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.82.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.83.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.83.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.83.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.84.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.84.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.84.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.85.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.85.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.85.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.86.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.86.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.86.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.87.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.87.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.87.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.88.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.88.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.88.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.89.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.89.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.89.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.9.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.9.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.9.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.90.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.90.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.90.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.91.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.91.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.91.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.92.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.92.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.92.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.93.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.93.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.93.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.94.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.94.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.94.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.95.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.95.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.95.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.96.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.96.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.96.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.97.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.97.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.97.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.98.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.98.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.98.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.99.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.99.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.experts.99.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.gate.e_score_correction_bias": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.gate.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.shared_experts.down_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.shared_experts.gate_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.mlp.shared_experts.up_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.post_attention_layernorm.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.self_attn.k_norm.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.self_attn.k_proj.bias": "model-00059-of-00092.safetensors",
+ "model.layers.58.self_attn.k_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.self_attn.o_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.self_attn.q_norm.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.self_attn.q_proj.bias": "model-00059-of-00092.safetensors",
+ "model.layers.58.self_attn.q_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.58.self_attn.v_proj.bias": "model-00059-of-00092.safetensors",
+ "model.layers.58.self_attn.v_proj.weight": "model-00059-of-00092.safetensors",
+ "model.layers.59.input_layernorm.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.0.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.0.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.0.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.1.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.1.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.1.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.10.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.10.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.10.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.100.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.100.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.100.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.101.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.101.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.101.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.102.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.102.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.102.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.103.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.103.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.103.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.104.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.104.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.104.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.105.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.105.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.105.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.106.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.106.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.106.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.107.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.107.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.107.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.108.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.108.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.108.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.109.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.109.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.109.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.11.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.11.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.11.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.110.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.110.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.110.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.111.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.111.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.111.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.112.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.112.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.112.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.113.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.113.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.113.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.114.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.114.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.114.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.115.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.115.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.115.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.116.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.116.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.116.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.117.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.117.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.117.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.118.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.118.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.118.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.119.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.119.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.119.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.12.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.12.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.12.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.120.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.120.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.120.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.121.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.121.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.121.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.122.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.122.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.122.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.123.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.123.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.123.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.124.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.124.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.124.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.125.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.125.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.125.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.126.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.126.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.126.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.127.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.127.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.127.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.128.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.128.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.128.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.129.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.129.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.129.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.13.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.13.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.13.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.130.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.130.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.130.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.131.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.131.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.131.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.132.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.132.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.132.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.133.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.133.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.133.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.134.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.134.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.134.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.135.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.135.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.135.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.136.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.136.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.136.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.137.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.137.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.137.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.138.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.138.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.138.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.139.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.139.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.139.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.14.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.14.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.14.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.140.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.140.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.140.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.141.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.141.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.141.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.142.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.142.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.142.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.143.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.143.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.143.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.144.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.144.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.144.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.145.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.145.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.145.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.146.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.146.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.146.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.147.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.147.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.147.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.148.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.148.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.148.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.149.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.149.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.149.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.15.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.15.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.15.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.150.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.150.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.150.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.151.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.151.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.151.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.152.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.152.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.152.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.153.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.153.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.153.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.154.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.154.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.154.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.155.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.155.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.155.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.156.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.156.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.156.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.157.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.157.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.157.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.158.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.158.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.158.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.159.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.159.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.159.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.16.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.16.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.16.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.17.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.17.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.17.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.18.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.18.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.18.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.19.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.19.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.19.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.2.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.2.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.2.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.20.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.20.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.20.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.21.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.21.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.21.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.22.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.22.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.22.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.23.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.23.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.23.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.24.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.24.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.24.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.25.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.25.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.25.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.26.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.26.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.26.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.27.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.27.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.27.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.28.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.28.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.28.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.29.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.29.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.29.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.3.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.3.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.3.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.30.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.30.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.30.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.31.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.31.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.31.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.32.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.32.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.32.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.33.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.33.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.33.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.34.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.34.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.34.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.35.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.35.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.35.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.36.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.36.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.36.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.37.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.37.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.37.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.38.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.38.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.38.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.39.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.39.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.39.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.4.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.4.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.4.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.40.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.40.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.40.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.41.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.41.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.41.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.42.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.42.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.42.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.43.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.43.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.43.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.44.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.44.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.44.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.45.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.45.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.45.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.46.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.46.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.46.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.47.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.47.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.47.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.48.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.48.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.48.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.49.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.49.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.49.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.5.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.5.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.5.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.50.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.50.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.50.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.51.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.51.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.51.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.52.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.52.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.52.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.53.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.53.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.53.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.54.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.54.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.54.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.55.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.55.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.55.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.56.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.56.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.56.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.57.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.57.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.57.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.58.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.58.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.58.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.59.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.59.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.59.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.6.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.6.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.6.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.60.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.60.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.60.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.61.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.61.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.61.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.62.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.62.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.62.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.63.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.63.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.63.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.64.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.64.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.64.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.65.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.65.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.65.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.66.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.66.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.66.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.67.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.67.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.67.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.68.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.68.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.68.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.69.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.69.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.69.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.7.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.7.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.7.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.70.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.70.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.70.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.71.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.71.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.71.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.72.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.72.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.72.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.73.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.73.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.73.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.74.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.74.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.74.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.75.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.75.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.75.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.76.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.76.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.76.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.77.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.77.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.77.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.78.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.78.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.78.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.79.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.79.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.79.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.8.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.8.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.8.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.80.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.80.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.80.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.81.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.81.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.81.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.82.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.82.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.82.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.83.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.83.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.83.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.84.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.84.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.84.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.85.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.85.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.85.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.86.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.86.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.86.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.87.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.87.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.87.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.88.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.88.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.88.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.89.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.89.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.89.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.9.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.9.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.9.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.90.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.90.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.90.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.91.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.91.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.91.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.92.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.92.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.92.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.93.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.93.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.93.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.94.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.94.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.94.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.95.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.95.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.95.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.96.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.96.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.96.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.97.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.97.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.97.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.98.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.98.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.98.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.99.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.99.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.experts.99.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.gate.e_score_correction_bias": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.gate.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.shared_experts.down_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.shared_experts.gate_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.mlp.shared_experts.up_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.post_attention_layernorm.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.self_attn.k_norm.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.self_attn.k_proj.bias": "model-00060-of-00092.safetensors",
+ "model.layers.59.self_attn.k_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.self_attn.o_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.self_attn.q_norm.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.self_attn.q_proj.bias": "model-00060-of-00092.safetensors",
+ "model.layers.59.self_attn.q_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.59.self_attn.v_proj.bias": "model-00060-of-00092.safetensors",
+ "model.layers.59.self_attn.v_proj.weight": "model-00060-of-00092.safetensors",
+ "model.layers.60.input_layernorm.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.0.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.0.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.0.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.1.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.1.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.1.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.10.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.10.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.10.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.100.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.100.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.100.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.101.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.101.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.101.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.102.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.102.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.102.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.103.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.103.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.103.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.104.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.104.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.104.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.105.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.105.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.105.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.106.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.106.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.106.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.107.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.107.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.107.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.108.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.108.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.108.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.109.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.109.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.109.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.11.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.11.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.11.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.110.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.110.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.110.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.111.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.111.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.111.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.112.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.112.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.112.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.113.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.113.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.113.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.114.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.114.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.114.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.115.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.115.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.115.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.116.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.116.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.116.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.117.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.117.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.117.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.118.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.118.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.118.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.119.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.119.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.119.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.12.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.12.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.12.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.120.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.120.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.120.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.121.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.121.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.121.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.122.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.122.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.122.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.123.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.123.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.123.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.124.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.124.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.124.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.125.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.125.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.125.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.126.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.126.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.126.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.127.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.127.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.127.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.128.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.128.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.128.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.129.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.129.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.129.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.13.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.13.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.13.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.130.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.130.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.130.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.131.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.131.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.131.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.132.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.132.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.132.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.133.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.133.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.133.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.134.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.134.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.134.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.135.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.135.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.135.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.136.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.136.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.136.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.137.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.137.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.137.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.138.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.138.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.138.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.139.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.139.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.139.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.14.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.14.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.14.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.140.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.140.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.140.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.141.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.141.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.141.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.142.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.142.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.142.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.143.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.143.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.143.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.144.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.144.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.144.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.145.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.145.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.145.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.146.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.146.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.146.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.147.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.147.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.147.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.148.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.148.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.148.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.149.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.149.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.149.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.15.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.15.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.15.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.150.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.150.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.150.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.151.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.151.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.151.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.152.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.152.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.152.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.153.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.153.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.153.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.154.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.154.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.154.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.155.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.155.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.155.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.156.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.156.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.156.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.157.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.157.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.157.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.158.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.158.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.158.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.159.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.159.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.159.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.16.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.16.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.16.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.17.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.17.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.17.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.18.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.18.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.18.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.19.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.19.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.19.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.2.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.2.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.2.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.20.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.20.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.20.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.21.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.21.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.21.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.22.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.22.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.22.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.23.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.23.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.23.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.24.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.24.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.24.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.25.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.25.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.25.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.26.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.26.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.26.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.27.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.27.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.27.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.28.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.28.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.28.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.29.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.29.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.29.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.3.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.3.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.3.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.30.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.30.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.30.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.31.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.31.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.31.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.32.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.32.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.32.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.33.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.33.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.33.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.34.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.34.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.34.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.35.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.35.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.35.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.36.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.36.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.36.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.37.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.37.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.37.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.38.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.38.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.38.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.39.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.39.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.39.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.4.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.4.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.4.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.40.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.40.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.40.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.41.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.41.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.41.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.42.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.42.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.42.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.43.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.43.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.43.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.44.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.44.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.44.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.45.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.45.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.45.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.46.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.46.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.46.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.47.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.47.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.47.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.48.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.48.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.48.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.49.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.49.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.49.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.5.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.5.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.5.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.50.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.50.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.50.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.51.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.51.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.51.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.52.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.52.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.52.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.53.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.53.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.53.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.54.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.54.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.54.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.55.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.55.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.55.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.56.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.56.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.56.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.57.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.57.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.57.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.58.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.58.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.58.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.59.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.59.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.59.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.6.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.6.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.6.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.60.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.60.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.60.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.61.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.61.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.61.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.62.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.62.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.62.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.63.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.63.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.63.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.64.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.64.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.64.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.65.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.65.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.65.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.66.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.66.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.66.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.67.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.67.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.67.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.68.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.68.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.68.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.69.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.69.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.69.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.7.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.7.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.7.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.70.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.70.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.70.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.71.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.71.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.71.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.72.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.72.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.72.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.73.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.73.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.73.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.74.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.74.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.74.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.75.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.75.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.75.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.76.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.76.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.76.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.77.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.77.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.77.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.78.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.78.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.78.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.79.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.79.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.79.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.8.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.8.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.8.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.80.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.80.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.80.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.81.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.81.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.81.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.82.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.82.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.82.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.83.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.83.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.83.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.84.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.84.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.84.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.85.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.85.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.85.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.86.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.86.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.86.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.87.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.87.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.87.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.88.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.88.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.88.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.89.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.89.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.89.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.9.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.9.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.9.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.90.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.90.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.90.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.91.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.91.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.91.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.92.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.92.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.92.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.93.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.93.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.93.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.94.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.94.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.94.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.95.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.95.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.95.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.96.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.96.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.96.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.97.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.97.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.97.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.98.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.98.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.98.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.99.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.99.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.experts.99.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.gate.e_score_correction_bias": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.gate.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.shared_experts.down_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.shared_experts.gate_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.mlp.shared_experts.up_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.post_attention_layernorm.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.self_attn.k_norm.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.self_attn.k_proj.bias": "model-00061-of-00092.safetensors",
+ "model.layers.60.self_attn.k_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.self_attn.o_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.self_attn.q_norm.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.self_attn.q_proj.bias": "model-00061-of-00092.safetensors",
+ "model.layers.60.self_attn.q_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.60.self_attn.v_proj.bias": "model-00061-of-00092.safetensors",
+ "model.layers.60.self_attn.v_proj.weight": "model-00061-of-00092.safetensors",
+ "model.layers.61.input_layernorm.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.0.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.0.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.0.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.1.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.1.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.1.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.10.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.10.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.10.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.100.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.100.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.100.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.101.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.101.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.101.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.102.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.102.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.102.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.103.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.103.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.103.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.104.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.104.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.104.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.105.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.105.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.105.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.106.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.106.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.106.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.107.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.107.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.107.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.108.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.108.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.108.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.109.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.109.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.109.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.11.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.11.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.11.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.110.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.110.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.110.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.111.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.111.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.111.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.112.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.112.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.112.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.113.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.113.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.113.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.114.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.114.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.114.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.115.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.115.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.115.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.116.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.116.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.116.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.117.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.117.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.117.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.118.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.118.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.118.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.119.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.119.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.119.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.12.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.12.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.12.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.120.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.120.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.120.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.121.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.121.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.121.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.122.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.122.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.122.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.123.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.123.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.123.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.124.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.124.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.124.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.125.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.125.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.125.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.126.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.126.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.126.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.127.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.127.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.127.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.128.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.128.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.128.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.129.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.129.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.129.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.13.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.13.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.13.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.130.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.130.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.130.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.131.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.131.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.131.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.132.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.132.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.132.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.133.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.133.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.133.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.134.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.134.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.134.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.135.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.135.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.135.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.136.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.136.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.136.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.137.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.137.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.137.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.138.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.138.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.138.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.139.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.139.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.139.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.14.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.14.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.14.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.140.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.140.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.140.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.141.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.141.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.141.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.142.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.142.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.142.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.143.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.143.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.143.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.144.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.144.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.144.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.145.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.145.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.145.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.146.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.146.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.146.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.147.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.147.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.147.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.148.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.148.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.148.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.149.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.149.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.149.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.15.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.15.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.15.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.150.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.150.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.150.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.151.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.151.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.151.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.152.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.152.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.152.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.153.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.153.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.153.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.154.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.154.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.154.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.155.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.155.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.155.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.156.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.156.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.156.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.157.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.157.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.157.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.158.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.158.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.158.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.159.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.159.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.159.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.16.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.16.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.16.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.17.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.17.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.17.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.18.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.18.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.18.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.19.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.19.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.19.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.2.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.2.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.2.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.20.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.20.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.20.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.21.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.21.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.21.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.22.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.22.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.22.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.23.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.23.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.23.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.24.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.24.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.24.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.25.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.25.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.25.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.26.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.26.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.26.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.27.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.27.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.27.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.28.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.28.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.28.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.29.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.29.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.29.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.3.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.3.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.3.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.30.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.30.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.30.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.31.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.31.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.31.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.32.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.32.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.32.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.33.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.33.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.33.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.34.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.34.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.34.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.35.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.35.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.35.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.36.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.36.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.36.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.37.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.37.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.37.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.38.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.38.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.38.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.39.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.39.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.39.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.4.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.4.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.4.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.40.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.40.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.40.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.41.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.41.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.41.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.42.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.42.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.42.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.43.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.43.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.43.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.44.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.44.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.44.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.45.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.45.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.45.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.46.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.46.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.46.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.47.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.47.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.47.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.48.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.48.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.48.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.49.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.49.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.49.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.5.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.5.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.5.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.50.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.50.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.50.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.51.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.51.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.51.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.52.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.52.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.52.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.53.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.53.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.53.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.54.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.54.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.54.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.55.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.55.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.55.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.56.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.56.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.56.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.57.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.57.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.57.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.58.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.58.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.58.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.59.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.59.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.59.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.6.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.6.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.6.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.60.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.60.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.60.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.61.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.61.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.61.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.62.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.62.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.62.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.63.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.63.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.63.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.64.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.64.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.64.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.65.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.65.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.65.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.66.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.66.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.66.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.67.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.67.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.67.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.68.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.68.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.68.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.69.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.69.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.69.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.7.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.7.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.7.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.70.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.70.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.70.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.71.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.71.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.71.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.72.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.72.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.72.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.73.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.73.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.73.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.74.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.74.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.74.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.75.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.75.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.75.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.76.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.76.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.76.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.77.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.77.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.77.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.78.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.78.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.78.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.79.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.79.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.79.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.8.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.8.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.8.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.80.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.80.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.80.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.81.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.81.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.81.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.82.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.82.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.82.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.83.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.83.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.83.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.84.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.84.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.84.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.85.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.85.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.85.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.86.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.86.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.86.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.87.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.87.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.87.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.88.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.88.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.88.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.89.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.89.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.89.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.9.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.9.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.9.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.90.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.90.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.90.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.91.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.91.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.91.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.92.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.92.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.92.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.93.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.93.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.93.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.94.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.94.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.94.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.95.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.95.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.95.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.96.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.96.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.96.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.97.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.97.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.97.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.98.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.98.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.98.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.99.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.99.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.experts.99.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.gate.e_score_correction_bias": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.gate.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.shared_experts.down_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.shared_experts.gate_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.mlp.shared_experts.up_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.post_attention_layernorm.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.self_attn.k_norm.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.self_attn.k_proj.bias": "model-00062-of-00092.safetensors",
+ "model.layers.61.self_attn.k_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.self_attn.o_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.self_attn.q_norm.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.self_attn.q_proj.bias": "model-00062-of-00092.safetensors",
+ "model.layers.61.self_attn.q_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.61.self_attn.v_proj.bias": "model-00062-of-00092.safetensors",
+ "model.layers.61.self_attn.v_proj.weight": "model-00062-of-00092.safetensors",
+ "model.layers.62.input_layernorm.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.0.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.0.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.0.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.1.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.1.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.1.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.10.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.10.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.10.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.100.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.100.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.100.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.101.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.101.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.101.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.102.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.102.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.102.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.103.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.103.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.103.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.104.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.104.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.104.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.105.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.105.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.105.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.106.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.106.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.106.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.107.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.107.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.107.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.108.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.108.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.108.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.109.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.109.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.109.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.11.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.11.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.11.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.110.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.110.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.110.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.111.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.111.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.111.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.112.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.112.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.112.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.113.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.113.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.113.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.114.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.114.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.114.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.115.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.115.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.115.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.116.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.116.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.116.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.117.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.117.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.117.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.118.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.118.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.118.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.119.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.119.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.119.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.12.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.12.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.12.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.120.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.120.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.120.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.121.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.121.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.121.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.122.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.122.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.122.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.123.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.123.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.123.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.124.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.124.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.124.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.125.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.125.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.125.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.126.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.126.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.126.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.127.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.127.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.127.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.128.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.128.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.128.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.129.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.129.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.129.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.13.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.13.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.13.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.130.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.130.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.130.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.131.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.131.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.131.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.132.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.132.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.132.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.133.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.133.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.133.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.134.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.134.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.134.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.135.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.135.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.135.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.136.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.136.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.136.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.137.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.137.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.137.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.138.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.138.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.138.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.139.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.139.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.139.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.14.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.14.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.14.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.140.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.140.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.140.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.141.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.141.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.141.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.142.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.142.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.142.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.143.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.143.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.143.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.144.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.144.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.144.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.145.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.145.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.145.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.146.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.146.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.146.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.147.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.147.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.147.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.148.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.148.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.148.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.149.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.149.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.149.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.15.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.15.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.15.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.150.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.150.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.150.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.151.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.151.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.151.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.152.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.152.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.152.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.153.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.153.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.153.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.154.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.154.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.154.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.155.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.155.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.155.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.156.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.156.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.156.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.157.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.157.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.157.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.158.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.158.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.158.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.159.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.159.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.159.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.16.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.16.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.16.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.17.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.17.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.17.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.18.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.18.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.18.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.19.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.19.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.19.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.2.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.2.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.2.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.20.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.20.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.20.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.21.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.21.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.21.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.22.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.22.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.22.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.23.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.23.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.23.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.24.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.24.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.24.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.25.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.25.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.25.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.26.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.26.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.26.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.27.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.27.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.27.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.28.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.28.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.28.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.29.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.29.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.29.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.3.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.3.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.3.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.30.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.30.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.30.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.31.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.31.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.31.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.32.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.32.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.32.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.33.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.33.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.33.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.34.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.34.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.34.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.35.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.35.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.35.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.36.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.36.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.36.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.37.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.37.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.37.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.38.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.38.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.38.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.39.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.39.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.39.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.4.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.4.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.4.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.40.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.40.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.40.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.41.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.41.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.41.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.42.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.42.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.42.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.43.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.43.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.43.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.44.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.44.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.44.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.45.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.45.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.45.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.46.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.46.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.46.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.47.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.47.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.47.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.48.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.48.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.48.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.49.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.49.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.49.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.5.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.5.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.5.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.50.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.50.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.50.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.51.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.51.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.51.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.52.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.52.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.52.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.53.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.53.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.53.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.54.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.54.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.54.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.55.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.55.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.55.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.56.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.56.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.56.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.57.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.57.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.57.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.58.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.58.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.58.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.59.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.59.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.59.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.6.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.6.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.6.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.60.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.60.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.60.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.61.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.61.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.61.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.62.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.62.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.62.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.63.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.63.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.63.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.64.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.64.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.64.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.65.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.65.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.65.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.66.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.66.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.66.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.67.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.67.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.67.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.68.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.68.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.68.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.69.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.69.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.69.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.7.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.7.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.7.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.70.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.70.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.70.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.71.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.71.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.71.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.72.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.72.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.72.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.73.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.73.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.73.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.74.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.74.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.74.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.75.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.75.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.75.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.76.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.76.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.76.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.77.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.77.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.77.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.78.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.78.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.78.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.79.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.79.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.79.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.8.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.8.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.8.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.80.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.80.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.80.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.81.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.81.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.81.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.82.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.82.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.82.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.83.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.83.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.83.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.84.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.84.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.84.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.85.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.85.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.85.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.86.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.86.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.86.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.87.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.87.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.87.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.88.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.88.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.88.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.89.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.89.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.89.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.9.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.9.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.9.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.90.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.90.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.90.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.91.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.91.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.91.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.92.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.92.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.92.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.93.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.93.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.93.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.94.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.94.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.94.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.95.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.95.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.95.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.96.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.96.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.96.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.97.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.97.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.97.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.98.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.98.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.98.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.99.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.99.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.experts.99.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.gate.e_score_correction_bias": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.gate.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.shared_experts.down_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.shared_experts.gate_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.mlp.shared_experts.up_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.post_attention_layernorm.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.self_attn.k_norm.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.self_attn.k_proj.bias": "model-00063-of-00092.safetensors",
+ "model.layers.62.self_attn.k_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.self_attn.o_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.self_attn.q_norm.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.self_attn.q_proj.bias": "model-00063-of-00092.safetensors",
+ "model.layers.62.self_attn.q_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.62.self_attn.v_proj.bias": "model-00063-of-00092.safetensors",
+ "model.layers.62.self_attn.v_proj.weight": "model-00063-of-00092.safetensors",
+ "model.layers.63.input_layernorm.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.0.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.0.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.0.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.1.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.1.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.1.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.10.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.10.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.10.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.100.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.100.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.100.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.101.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.101.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.101.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.102.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.102.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.102.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.103.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.103.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.103.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.104.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.104.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.104.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.105.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.105.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.105.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.106.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.106.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.106.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.107.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.107.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.107.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.108.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.108.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.108.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.109.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.109.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.109.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.11.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.11.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.11.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.110.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.110.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.110.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.111.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.111.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.111.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.112.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.112.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.112.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.113.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.113.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.113.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.114.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.114.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.114.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.115.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.115.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.115.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.116.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.116.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.116.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.117.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.117.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.117.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.118.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.118.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.118.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.119.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.119.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.119.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.12.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.12.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.12.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.120.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.120.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.120.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.121.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.121.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.121.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.122.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.122.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.122.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.123.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.123.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.123.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.124.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.124.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.124.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.125.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.125.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.125.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.126.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.126.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.126.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.127.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.127.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.127.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.128.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.128.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.128.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.129.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.129.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.129.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.13.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.13.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.13.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.130.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.130.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.130.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.131.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.131.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.131.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.132.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.132.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.132.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.133.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.133.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.133.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.134.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.134.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.134.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.135.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.135.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.135.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.136.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.136.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.136.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.137.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.137.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.137.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.138.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.138.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.138.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.139.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.139.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.139.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.14.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.14.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.14.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.140.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.140.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.140.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.141.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.141.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.141.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.142.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.142.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.142.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.143.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.143.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.143.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.144.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.144.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.144.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.145.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.145.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.145.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.146.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.146.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.146.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.147.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.147.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.147.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.148.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.148.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.148.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.149.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.149.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.149.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.15.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.15.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.15.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.150.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.150.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.150.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.151.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.151.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.151.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.152.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.152.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.152.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.153.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.153.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.153.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.154.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.154.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.154.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.155.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.155.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.155.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.156.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.156.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.156.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.157.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.157.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.157.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.158.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.158.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.158.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.159.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.159.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.159.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.16.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.16.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.16.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.17.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.17.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.17.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.18.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.18.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.18.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.19.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.19.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.19.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.2.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.2.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.2.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.20.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.20.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.20.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.21.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.21.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.21.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.22.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.22.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.22.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.23.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.23.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.23.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.24.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.24.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.24.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.25.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.25.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.25.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.26.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.26.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.26.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.27.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.27.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.27.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.28.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.28.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.28.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.29.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.29.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.29.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.3.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.3.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.3.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.30.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.30.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.30.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.31.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.31.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.31.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.32.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.32.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.32.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.33.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.33.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.33.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.34.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.34.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.34.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.35.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.35.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.35.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.36.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.36.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.36.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.37.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.37.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.37.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.38.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.38.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.38.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.39.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.39.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.39.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.4.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.4.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.4.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.40.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.40.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.40.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.41.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.41.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.41.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.42.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.42.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.42.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.43.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.43.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.43.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.44.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.44.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.44.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.45.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.45.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.45.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.46.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.46.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.46.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.47.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.47.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.47.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.48.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.48.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.48.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.49.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.49.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.49.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.5.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.5.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.5.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.50.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.50.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.50.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.51.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.51.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.51.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.52.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.52.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.52.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.53.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.53.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.53.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.54.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.54.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.54.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.55.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.55.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.55.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.56.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.56.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.56.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.57.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.57.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.57.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.58.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.58.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.58.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.59.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.59.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.59.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.6.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.6.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.6.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.60.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.60.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.60.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.61.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.61.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.61.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.62.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.62.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.62.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.63.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.63.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.63.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.64.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.64.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.64.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.65.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.65.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.65.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.66.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.66.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.66.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.67.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.67.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.67.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.68.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.68.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.68.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.69.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.69.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.69.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.7.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.7.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.7.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.70.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.70.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.70.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.71.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.71.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.71.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.72.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.72.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.72.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.73.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.73.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.73.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.74.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.74.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.74.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.75.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.75.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.75.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.76.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.76.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.76.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.77.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.77.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.77.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.78.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.78.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.78.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.79.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.79.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.79.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.8.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.8.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.8.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.80.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.80.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.80.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.81.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.81.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.81.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.82.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.82.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.82.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.83.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.83.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.83.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.84.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.84.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.84.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.85.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.85.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.85.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.86.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.86.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.86.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.87.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.87.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.87.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.88.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.88.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.88.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.89.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.89.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.89.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.9.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.9.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.9.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.90.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.90.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.90.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.91.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.91.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.91.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.92.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.92.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.92.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.93.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.93.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.93.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.94.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.94.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.94.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.95.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.95.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.95.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.96.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.96.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.96.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.97.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.97.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.97.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.98.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.98.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.98.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.99.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.99.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.experts.99.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.gate.e_score_correction_bias": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.gate.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.shared_experts.down_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.shared_experts.gate_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.mlp.shared_experts.up_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.post_attention_layernorm.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.self_attn.k_norm.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.self_attn.k_proj.bias": "model-00064-of-00092.safetensors",
+ "model.layers.63.self_attn.k_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.self_attn.o_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.self_attn.q_norm.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.self_attn.q_proj.bias": "model-00064-of-00092.safetensors",
+ "model.layers.63.self_attn.q_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.63.self_attn.v_proj.bias": "model-00064-of-00092.safetensors",
+ "model.layers.63.self_attn.v_proj.weight": "model-00064-of-00092.safetensors",
+ "model.layers.64.input_layernorm.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.0.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.0.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.0.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.1.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.1.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.1.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.10.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.10.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.10.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.100.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.100.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.100.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.101.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.101.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.101.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.102.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.102.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.102.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.103.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.103.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.103.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.104.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.104.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.104.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.105.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.105.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.105.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.106.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.106.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.106.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.107.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.107.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.107.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.108.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.108.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.108.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.109.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.109.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.109.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.11.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.11.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.11.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.110.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.110.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.110.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.111.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.111.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.111.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.112.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.112.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.112.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.113.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.113.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.113.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.114.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.114.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.114.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.115.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.115.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.115.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.116.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.116.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.116.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.117.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.117.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.117.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.118.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.118.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.118.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.119.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.119.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.119.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.12.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.12.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.12.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.120.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.120.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.120.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.121.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.121.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.121.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.122.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.122.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.122.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.123.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.123.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.123.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.124.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.124.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.124.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.125.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.125.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.125.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.126.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.126.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.126.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.127.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.127.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.127.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.128.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.128.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.128.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.129.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.129.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.129.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.13.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.13.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.13.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.130.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.130.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.130.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.131.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.131.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.131.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.132.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.132.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.132.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.133.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.133.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.133.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.134.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.134.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.134.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.135.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.135.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.135.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.136.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.136.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.136.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.137.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.137.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.137.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.138.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.138.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.138.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.139.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.139.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.139.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.14.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.14.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.14.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.140.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.140.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.140.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.141.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.141.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.141.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.142.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.142.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.142.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.143.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.143.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.143.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.144.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.144.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.144.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.145.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.145.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.145.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.146.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.146.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.146.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.147.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.147.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.147.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.148.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.148.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.148.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.149.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.149.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.149.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.15.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.15.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.15.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.150.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.150.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.150.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.151.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.151.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.151.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.152.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.152.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.152.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.153.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.153.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.153.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.154.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.154.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.154.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.155.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.155.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.155.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.156.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.156.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.156.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.157.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.157.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.157.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.158.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.158.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.158.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.159.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.159.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.159.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.16.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.16.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.16.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.17.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.17.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.17.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.18.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.18.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.18.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.19.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.19.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.19.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.2.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.2.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.2.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.20.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.20.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.20.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.21.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.21.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.21.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.22.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.22.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.22.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.23.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.23.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.23.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.24.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.24.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.24.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.25.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.25.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.25.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.26.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.26.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.26.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.27.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.27.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.27.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.28.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.28.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.28.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.29.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.29.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.29.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.3.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.3.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.3.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.30.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.30.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.30.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.31.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.31.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.31.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.32.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.32.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.32.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.33.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.33.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.33.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.34.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.34.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.34.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.35.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.35.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.35.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.36.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.36.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.36.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.37.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.37.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.37.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.38.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.38.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.38.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.39.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.39.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.39.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.4.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.4.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.4.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.40.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.40.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.40.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.41.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.41.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.41.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.42.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.42.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.42.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.43.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.43.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.43.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.44.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.44.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.44.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.45.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.45.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.45.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.46.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.46.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.46.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.47.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.47.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.47.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.48.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.48.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.48.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.49.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.49.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.49.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.5.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.5.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.5.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.50.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.50.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.50.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.51.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.51.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.51.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.52.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.52.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.52.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.53.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.53.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.53.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.54.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.54.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.54.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.55.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.55.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.55.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.56.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.56.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.56.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.57.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.57.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.57.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.58.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.58.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.58.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.59.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.59.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.59.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.6.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.6.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.6.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.60.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.60.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.60.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.61.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.61.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.61.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.62.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.62.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.62.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.63.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.63.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.63.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.64.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.64.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.64.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.65.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.65.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.65.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.66.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.66.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.66.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.67.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.67.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.67.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.68.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.68.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.68.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.69.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.69.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.69.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.7.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.7.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.7.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.70.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.70.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.70.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.71.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.71.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.71.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.72.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.72.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.72.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.73.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.73.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.73.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.74.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.74.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.74.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.75.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.75.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.75.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.76.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.76.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.76.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.77.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.77.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.77.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.78.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.78.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.78.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.79.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.79.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.79.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.8.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.8.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.8.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.80.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.80.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.80.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.81.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.81.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.81.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.82.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.82.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.82.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.83.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.83.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.83.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.84.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.84.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.84.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.85.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.85.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.85.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.86.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.86.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.86.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.87.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.87.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.87.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.88.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.88.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.88.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.89.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.89.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.89.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.9.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.9.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.9.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.90.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.90.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.90.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.91.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.91.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.91.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.92.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.92.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.92.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.93.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.93.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.93.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.94.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.94.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.94.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.95.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.95.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.95.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.96.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.96.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.96.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.97.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.97.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.97.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.98.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.98.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.98.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.99.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.99.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.experts.99.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.gate.e_score_correction_bias": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.gate.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.shared_experts.down_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.shared_experts.gate_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.mlp.shared_experts.up_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.post_attention_layernorm.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.self_attn.k_norm.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.self_attn.k_proj.bias": "model-00065-of-00092.safetensors",
+ "model.layers.64.self_attn.k_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.self_attn.o_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.self_attn.q_norm.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.self_attn.q_proj.bias": "model-00065-of-00092.safetensors",
+ "model.layers.64.self_attn.q_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.64.self_attn.v_proj.bias": "model-00065-of-00092.safetensors",
+ "model.layers.64.self_attn.v_proj.weight": "model-00065-of-00092.safetensors",
+ "model.layers.65.input_layernorm.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.0.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.0.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.0.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.1.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.1.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.1.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.10.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.10.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.10.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.100.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.100.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.100.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.101.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.101.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.101.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.102.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.102.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.102.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.103.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.103.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.103.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.104.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.104.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.104.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.105.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.105.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.105.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.106.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.106.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.106.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.107.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.107.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.107.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.108.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.108.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.108.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.109.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.109.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.109.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.11.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.11.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.11.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.110.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.110.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.110.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.111.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.111.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.111.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.112.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.112.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.112.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.113.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.113.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.113.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.114.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.114.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.114.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.115.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.115.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.115.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.116.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.116.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.116.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.117.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.117.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.117.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.118.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.118.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.118.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.119.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.119.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.119.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.12.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.12.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.12.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.120.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.120.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.120.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.121.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.121.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.121.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.122.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.122.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.122.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.123.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.123.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.123.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.124.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.124.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.124.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.125.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.125.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.125.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.126.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.126.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.126.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.127.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.127.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.127.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.128.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.128.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.128.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.129.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.129.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.129.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.13.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.13.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.13.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.130.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.130.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.130.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.131.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.131.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.131.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.132.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.132.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.132.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.133.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.133.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.133.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.134.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.134.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.134.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.135.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.135.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.135.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.136.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.136.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.136.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.137.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.137.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.137.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.138.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.138.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.138.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.139.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.139.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.139.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.14.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.14.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.14.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.140.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.140.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.140.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.141.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.141.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.141.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.142.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.142.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.142.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.143.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.143.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.143.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.144.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.144.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.144.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.145.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.145.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.145.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.146.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.146.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.146.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.147.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.147.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.147.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.148.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.148.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.148.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.149.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.149.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.149.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.15.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.15.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.15.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.150.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.150.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.150.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.151.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.151.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.151.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.152.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.152.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.152.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.153.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.153.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.153.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.154.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.154.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.154.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.155.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.155.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.155.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.156.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.156.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.156.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.157.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.157.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.157.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.158.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.158.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.158.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.159.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.159.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.159.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.16.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.16.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.16.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.17.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.17.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.17.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.18.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.18.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.18.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.19.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.19.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.19.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.2.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.2.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.2.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.20.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.20.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.20.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.21.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.21.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.21.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.22.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.22.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.22.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.23.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.23.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.23.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.24.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.24.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.24.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.25.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.25.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.25.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.26.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.26.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.26.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.27.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.27.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.27.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.28.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.28.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.28.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.29.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.29.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.29.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.3.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.3.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.3.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.30.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.30.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.30.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.31.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.31.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.31.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.32.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.32.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.32.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.33.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.33.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.33.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.34.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.34.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.34.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.35.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.35.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.35.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.36.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.36.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.36.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.37.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.37.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.37.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.38.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.38.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.38.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.39.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.39.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.39.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.4.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.4.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.4.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.40.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.40.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.40.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.41.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.41.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.41.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.42.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.42.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.42.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.43.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.43.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.43.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.44.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.44.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.44.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.45.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.45.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.45.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.46.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.46.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.46.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.47.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.47.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.47.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.48.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.48.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.48.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.49.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.49.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.49.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.5.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.5.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.5.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.50.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.50.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.50.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.51.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.51.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.51.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.52.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.52.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.52.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.53.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.53.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.53.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.54.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.54.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.54.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.55.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.55.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.55.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.56.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.56.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.56.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.57.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.57.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.57.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.58.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.58.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.58.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.59.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.59.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.59.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.6.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.6.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.6.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.60.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.60.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.60.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.61.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.61.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.61.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.62.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.62.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.62.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.63.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.63.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.63.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.64.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.64.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.64.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.65.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.65.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.65.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.66.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.66.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.66.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.67.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.67.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.67.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.68.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.68.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.68.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.69.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.69.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.69.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.7.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.7.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.7.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.70.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.70.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.70.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.71.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.71.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.71.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.72.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.72.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.72.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.73.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.73.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.73.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.74.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.74.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.74.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.75.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.75.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.75.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.76.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.76.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.76.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.77.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.77.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.77.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.78.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.78.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.78.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.79.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.79.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.79.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.8.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.8.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.8.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.80.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.80.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.80.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.81.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.81.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.81.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.82.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.82.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.82.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.83.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.83.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.83.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.84.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.84.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.84.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.85.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.85.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.85.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.86.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.86.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.86.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.87.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.87.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.87.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.88.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.88.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.88.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.89.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.89.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.89.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.9.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.9.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.9.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.90.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.90.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.90.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.91.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.91.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.91.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.92.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.92.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.92.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.93.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.93.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.93.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.94.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.94.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.94.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.95.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.95.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.95.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.96.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.96.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.96.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.97.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.97.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.97.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.98.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.98.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.98.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.99.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.99.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.experts.99.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.gate.e_score_correction_bias": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.gate.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.shared_experts.down_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.shared_experts.gate_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.mlp.shared_experts.up_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.post_attention_layernorm.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.self_attn.k_norm.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.self_attn.k_proj.bias": "model-00066-of-00092.safetensors",
+ "model.layers.65.self_attn.k_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.self_attn.o_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.self_attn.q_norm.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.self_attn.q_proj.bias": "model-00066-of-00092.safetensors",
+ "model.layers.65.self_attn.q_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.65.self_attn.v_proj.bias": "model-00066-of-00092.safetensors",
+ "model.layers.65.self_attn.v_proj.weight": "model-00066-of-00092.safetensors",
+ "model.layers.66.input_layernorm.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.0.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.0.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.0.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.1.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.1.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.1.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.10.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.10.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.10.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.100.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.100.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.100.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.101.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.101.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.101.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.102.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.102.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.102.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.103.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.103.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.103.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.104.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.104.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.104.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.105.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.105.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.105.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.106.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.106.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.106.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.107.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.107.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.107.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.108.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.108.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.108.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.109.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.109.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.109.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.11.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.11.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.11.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.110.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.110.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.110.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.111.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.111.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.111.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.112.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.112.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.112.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.113.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.113.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.113.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.114.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.114.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.114.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.115.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.115.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.115.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.116.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.116.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.116.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.117.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.117.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.117.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.118.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.118.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.118.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.119.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.119.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.119.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.12.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.12.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.12.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.120.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.120.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.120.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.121.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.121.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.121.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.122.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.122.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.122.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.123.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.123.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.123.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.124.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.124.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.124.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.125.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.125.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.125.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.126.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.126.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.126.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.127.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.127.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.127.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.128.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.128.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.128.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.129.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.129.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.129.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.13.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.13.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.13.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.130.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.130.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.130.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.131.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.131.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.131.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.132.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.132.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.132.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.133.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.133.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.133.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.134.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.134.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.134.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.135.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.135.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.135.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.136.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.136.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.136.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.137.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.137.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.137.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.138.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.138.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.138.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.139.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.139.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.139.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.14.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.14.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.14.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.140.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.140.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.140.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.141.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.141.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.141.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.142.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.142.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.142.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.143.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.143.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.143.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.144.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.144.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.144.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.145.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.145.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.145.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.146.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.146.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.146.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.147.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.147.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.147.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.148.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.148.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.148.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.149.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.149.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.149.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.15.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.15.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.15.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.150.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.150.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.150.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.151.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.151.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.151.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.152.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.152.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.152.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.153.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.153.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.153.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.154.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.154.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.154.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.155.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.155.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.155.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.156.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.156.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.156.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.157.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.157.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.157.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.158.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.158.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.158.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.159.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.159.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.159.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.16.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.16.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.16.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.17.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.17.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.17.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.18.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.18.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.18.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.19.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.19.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.19.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.2.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.2.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.2.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.20.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.20.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.20.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.21.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.21.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.21.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.22.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.22.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.22.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.23.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.23.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.23.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.24.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.24.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.24.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.25.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.25.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.25.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.26.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.26.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.26.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.27.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.27.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.27.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.28.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.28.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.28.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.29.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.29.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.29.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.3.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.3.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.3.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.30.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.30.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.30.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.31.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.31.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.31.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.32.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.32.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.32.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.33.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.33.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.33.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.34.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.34.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.34.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.35.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.35.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.35.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.36.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.36.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.36.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.37.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.37.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.37.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.38.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.38.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.38.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.39.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.39.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.39.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.4.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.4.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.4.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.40.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.40.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.40.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.41.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.41.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.41.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.42.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.42.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.42.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.43.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.43.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.43.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.44.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.44.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.44.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.45.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.45.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.45.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.46.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.46.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.46.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.47.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.47.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.47.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.48.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.48.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.48.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.49.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.49.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.49.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.5.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.5.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.5.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.50.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.50.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.50.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.51.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.51.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.51.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.52.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.52.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.52.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.53.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.53.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.53.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.54.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.54.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.54.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.55.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.55.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.55.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.56.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.56.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.56.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.57.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.57.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.57.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.58.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.58.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.58.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.59.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.59.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.59.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.6.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.6.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.6.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.60.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.60.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.60.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.61.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.61.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.61.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.62.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.62.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.62.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.63.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.63.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.63.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.64.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.64.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.64.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.65.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.65.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.65.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.66.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.66.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.66.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.67.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.67.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.67.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.68.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.68.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.68.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.69.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.69.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.69.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.7.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.7.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.7.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.70.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.70.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.70.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.71.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.71.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.71.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.72.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.72.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.72.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.73.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.73.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.73.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.74.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.74.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.74.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.75.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.75.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.75.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.76.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.76.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.76.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.77.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.77.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.77.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.78.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.78.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.78.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.79.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.79.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.79.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.8.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.8.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.8.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.80.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.80.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.80.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.81.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.81.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.81.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.82.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.82.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.82.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.83.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.83.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.83.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.84.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.84.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.84.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.85.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.85.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.85.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.86.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.86.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.86.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.87.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.87.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.87.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.88.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.88.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.88.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.89.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.89.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.89.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.9.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.9.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.9.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.90.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.90.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.90.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.91.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.91.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.91.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.92.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.92.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.92.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.93.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.93.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.93.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.94.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.94.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.94.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.95.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.95.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.95.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.96.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.96.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.96.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.97.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.97.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.97.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.98.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.98.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.98.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.99.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.99.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.experts.99.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.gate.e_score_correction_bias": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.gate.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.shared_experts.down_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.shared_experts.gate_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.mlp.shared_experts.up_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.post_attention_layernorm.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.self_attn.k_norm.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.self_attn.k_proj.bias": "model-00067-of-00092.safetensors",
+ "model.layers.66.self_attn.k_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.self_attn.o_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.self_attn.q_norm.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.self_attn.q_proj.bias": "model-00067-of-00092.safetensors",
+ "model.layers.66.self_attn.q_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.66.self_attn.v_proj.bias": "model-00067-of-00092.safetensors",
+ "model.layers.66.self_attn.v_proj.weight": "model-00067-of-00092.safetensors",
+ "model.layers.67.input_layernorm.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.0.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.0.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.0.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.1.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.1.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.1.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.10.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.10.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.10.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.100.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.100.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.100.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.101.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.101.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.101.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.102.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.102.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.102.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.103.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.103.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.103.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.104.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.104.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.104.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.105.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.105.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.105.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.106.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.106.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.106.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.107.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.107.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.107.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.108.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.108.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.108.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.109.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.109.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.109.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.11.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.11.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.11.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.110.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.110.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.110.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.111.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.111.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.111.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.112.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.112.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.112.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.113.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.113.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.113.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.114.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.114.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.114.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.115.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.115.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.115.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.116.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.116.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.116.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.117.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.117.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.117.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.118.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.118.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.118.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.119.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.119.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.119.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.12.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.12.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.12.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.120.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.120.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.120.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.121.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.121.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.121.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.122.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.122.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.122.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.123.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.123.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.123.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.124.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.124.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.124.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.125.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.125.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.125.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.126.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.126.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.126.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.127.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.127.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.127.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.128.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.128.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.128.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.129.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.129.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.129.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.13.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.13.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.13.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.130.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.130.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.130.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.131.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.131.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.131.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.132.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.132.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.132.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.133.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.133.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.133.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.134.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.134.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.134.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.135.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.135.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.135.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.136.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.136.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.136.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.137.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.137.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.137.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.138.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.138.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.138.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.139.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.139.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.139.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.14.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.14.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.14.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.140.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.140.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.140.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.141.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.141.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.141.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.142.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.142.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.142.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.143.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.143.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.143.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.144.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.144.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.144.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.145.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.145.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.145.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.146.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.146.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.146.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.147.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.147.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.147.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.148.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.148.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.148.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.149.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.149.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.149.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.15.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.15.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.15.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.150.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.150.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.150.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.151.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.151.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.151.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.152.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.152.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.152.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.153.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.153.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.153.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.154.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.154.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.154.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.155.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.155.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.155.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.156.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.156.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.156.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.157.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.157.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.157.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.158.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.158.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.158.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.159.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.159.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.159.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.16.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.16.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.16.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.17.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.17.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.17.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.18.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.18.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.18.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.19.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.19.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.19.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.2.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.2.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.2.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.20.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.20.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.20.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.21.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.21.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.21.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.22.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.22.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.22.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.23.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.23.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.23.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.24.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.24.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.24.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.25.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.25.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.25.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.26.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.26.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.26.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.27.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.27.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.27.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.28.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.28.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.28.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.29.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.29.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.29.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.3.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.3.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.3.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.30.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.30.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.30.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.31.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.31.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.31.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.32.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.32.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.32.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.33.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.33.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.33.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.34.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.34.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.34.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.35.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.35.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.35.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.36.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.36.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.36.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.37.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.37.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.37.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.38.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.38.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.38.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.39.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.39.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.39.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.4.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.4.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.4.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.40.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.40.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.40.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.41.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.41.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.41.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.42.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.42.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.42.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.43.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.43.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.43.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.44.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.44.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.44.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.45.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.45.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.45.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.46.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.46.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.46.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.47.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.47.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.47.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.48.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.48.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.48.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.49.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.49.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.49.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.5.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.5.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.5.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.50.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.50.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.50.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.51.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.51.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.51.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.52.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.52.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.52.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.53.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.53.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.53.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.54.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.54.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.54.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.55.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.55.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.55.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.56.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.56.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.56.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.57.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.57.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.57.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.58.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.58.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.58.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.59.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.59.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.59.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.6.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.6.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.6.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.60.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.60.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.60.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.61.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.61.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.61.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.62.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.62.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.62.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.63.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.63.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.63.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.64.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.64.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.64.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.65.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.65.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.65.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.66.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.66.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.66.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.67.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.67.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.67.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.68.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.68.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.68.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.69.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.69.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.69.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.7.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.7.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.7.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.70.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.70.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.70.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.71.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.71.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.71.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.72.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.72.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.72.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.73.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.73.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.73.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.74.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.74.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.74.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.75.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.75.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.75.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.76.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.76.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.76.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.77.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.77.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.77.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.78.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.78.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.78.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.79.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.79.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.79.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.8.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.8.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.8.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.80.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.80.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.80.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.81.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.81.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.81.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.82.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.82.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.82.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.83.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.83.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.83.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.84.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.84.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.84.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.85.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.85.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.85.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.86.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.86.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.86.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.87.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.87.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.87.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.88.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.88.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.88.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.89.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.89.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.89.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.9.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.9.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.9.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.90.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.90.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.90.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.91.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.91.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.91.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.92.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.92.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.92.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.93.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.93.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.93.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.94.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.94.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.94.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.95.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.95.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.95.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.96.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.96.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.96.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.97.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.97.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.97.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.98.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.98.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.98.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.99.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.99.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.experts.99.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.gate.e_score_correction_bias": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.gate.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.shared_experts.down_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.shared_experts.gate_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.mlp.shared_experts.up_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.post_attention_layernorm.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.self_attn.k_norm.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.self_attn.k_proj.bias": "model-00068-of-00092.safetensors",
+ "model.layers.67.self_attn.k_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.self_attn.o_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.self_attn.q_norm.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.self_attn.q_proj.bias": "model-00068-of-00092.safetensors",
+ "model.layers.67.self_attn.q_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.67.self_attn.v_proj.bias": "model-00068-of-00092.safetensors",
+ "model.layers.67.self_attn.v_proj.weight": "model-00068-of-00092.safetensors",
+ "model.layers.68.input_layernorm.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.0.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.0.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.0.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.1.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.1.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.1.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.10.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.10.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.10.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.100.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.100.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.100.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.101.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.101.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.101.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.102.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.102.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.102.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.103.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.103.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.103.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.104.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.104.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.104.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.105.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.105.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.105.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.106.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.106.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.106.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.107.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.107.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.107.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.108.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.108.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.108.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.109.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.109.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.109.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.11.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.11.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.11.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.110.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.110.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.110.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.111.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.111.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.111.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.112.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.112.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.112.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.113.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.113.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.113.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.114.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.114.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.114.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.115.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.115.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.115.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.116.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.116.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.116.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.117.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.117.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.117.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.118.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.118.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.118.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.119.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.119.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.119.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.12.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.12.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.12.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.120.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.120.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.120.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.121.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.121.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.121.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.122.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.122.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.122.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.123.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.123.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.123.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.124.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.124.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.124.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.125.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.125.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.125.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.126.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.126.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.126.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.127.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.127.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.127.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.128.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.128.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.128.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.129.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.129.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.129.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.13.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.13.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.13.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.130.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.130.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.130.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.131.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.131.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.131.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.132.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.132.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.132.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.133.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.133.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.133.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.134.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.134.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.134.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.135.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.135.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.135.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.136.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.136.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.136.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.137.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.137.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.137.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.138.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.138.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.138.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.139.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.139.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.139.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.14.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.14.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.14.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.140.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.140.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.140.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.141.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.141.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.141.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.142.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.142.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.142.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.143.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.143.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.143.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.144.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.144.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.144.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.145.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.145.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.145.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.146.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.146.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.146.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.147.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.147.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.147.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.148.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.148.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.148.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.149.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.149.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.149.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.15.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.15.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.15.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.150.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.150.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.150.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.151.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.151.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.151.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.152.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.152.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.152.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.153.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.153.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.153.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.154.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.154.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.154.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.155.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.155.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.155.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.156.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.156.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.156.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.157.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.157.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.157.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.158.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.158.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.158.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.159.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.159.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.159.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.16.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.16.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.16.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.17.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.17.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.17.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.18.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.18.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.18.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.19.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.19.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.19.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.2.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.2.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.2.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.20.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.20.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.20.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.21.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.21.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.21.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.22.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.22.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.22.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.23.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.23.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.23.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.24.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.24.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.24.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.25.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.25.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.25.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.26.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.26.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.26.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.27.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.27.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.27.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.28.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.28.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.28.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.29.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.29.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.29.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.3.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.3.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.3.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.30.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.30.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.30.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.31.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.31.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.31.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.32.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.32.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.32.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.33.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.33.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.33.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.34.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.34.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.34.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.35.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.35.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.35.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.36.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.36.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.36.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.37.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.37.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.37.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.38.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.38.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.38.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.39.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.39.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.39.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.4.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.4.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.4.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.40.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.40.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.40.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.41.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.41.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.41.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.42.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.42.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.42.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.43.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.43.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.43.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.44.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.44.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.44.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.45.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.45.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.45.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.46.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.46.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.46.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.47.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.47.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.47.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.48.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.48.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.48.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.49.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.49.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.49.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.5.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.5.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.5.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.50.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.50.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.50.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.51.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.51.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.51.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.52.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.52.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.52.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.53.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.53.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.53.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.54.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.54.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.54.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.55.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.55.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.55.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.56.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.56.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.56.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.57.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.57.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.57.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.58.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.58.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.58.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.59.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.59.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.59.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.6.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.6.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.6.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.60.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.60.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.60.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.61.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.61.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.61.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.62.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.62.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.62.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.63.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.63.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.63.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.64.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.64.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.64.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.65.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.65.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.65.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.66.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.66.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.66.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.67.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.67.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.67.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.68.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.68.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.68.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.69.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.69.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.69.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.7.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.7.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.7.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.70.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.70.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.70.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.71.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.71.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.71.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.72.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.72.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.72.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.73.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.73.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.73.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.74.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.74.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.74.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.75.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.75.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.75.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.76.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.76.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.76.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.77.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.77.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.77.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.78.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.78.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.78.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.79.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.79.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.79.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.8.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.8.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.8.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.80.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.80.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.80.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.81.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.81.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.81.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.82.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.82.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.82.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.83.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.83.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.83.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.84.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.84.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.84.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.85.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.85.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.85.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.86.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.86.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.86.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.87.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.87.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.87.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.88.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.88.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.88.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.89.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.89.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.89.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.9.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.9.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.9.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.90.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.90.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.90.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.91.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.91.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.91.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.92.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.92.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.92.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.93.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.93.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.93.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.94.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.94.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.94.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.95.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.95.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.95.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.96.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.96.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.96.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.97.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.97.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.97.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.98.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.98.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.98.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.99.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.99.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.experts.99.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.gate.e_score_correction_bias": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.gate.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.shared_experts.down_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.shared_experts.gate_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.mlp.shared_experts.up_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.post_attention_layernorm.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.self_attn.k_norm.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.self_attn.k_proj.bias": "model-00069-of-00092.safetensors",
+ "model.layers.68.self_attn.k_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.self_attn.o_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.self_attn.q_norm.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.self_attn.q_proj.bias": "model-00069-of-00092.safetensors",
+ "model.layers.68.self_attn.q_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.68.self_attn.v_proj.bias": "model-00069-of-00092.safetensors",
+ "model.layers.68.self_attn.v_proj.weight": "model-00069-of-00092.safetensors",
+ "model.layers.69.input_layernorm.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.0.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.0.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.0.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.1.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.1.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.1.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.10.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.10.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.10.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.100.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.100.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.100.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.101.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.101.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.101.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.102.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.102.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.102.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.103.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.103.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.103.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.104.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.104.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.104.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.105.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.105.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.105.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.106.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.106.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.106.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.107.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.107.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.107.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.108.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.108.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.108.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.109.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.109.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.109.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.11.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.11.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.11.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.110.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.110.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.110.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.111.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.111.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.111.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.112.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.112.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.112.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.113.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.113.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.113.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.114.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.114.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.114.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.115.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.115.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.115.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.116.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.116.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.116.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.117.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.117.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.117.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.118.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.118.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.118.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.119.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.119.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.119.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.12.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.12.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.12.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.120.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.120.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.120.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.121.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.121.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.121.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.122.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.122.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.122.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.123.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.123.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.123.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.124.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.124.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.124.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.125.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.125.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.125.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.126.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.126.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.126.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.127.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.127.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.127.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.128.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.128.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.128.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.129.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.129.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.129.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.13.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.13.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.13.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.130.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.130.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.130.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.131.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.131.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.131.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.132.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.132.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.132.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.133.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.133.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.133.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.134.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.134.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.134.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.135.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.135.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.135.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.136.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.136.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.136.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.137.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.137.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.137.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.138.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.138.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.138.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.139.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.139.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.139.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.14.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.14.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.14.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.140.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.140.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.140.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.141.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.141.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.141.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.142.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.142.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.142.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.143.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.143.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.143.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.144.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.144.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.144.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.145.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.145.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.145.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.146.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.146.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.146.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.147.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.147.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.147.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.148.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.148.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.148.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.149.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.149.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.149.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.15.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.15.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.15.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.150.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.150.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.150.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.151.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.151.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.151.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.152.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.152.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.152.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.153.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.153.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.153.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.154.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.154.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.154.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.155.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.155.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.155.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.156.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.156.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.156.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.157.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.157.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.157.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.158.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.158.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.158.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.159.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.159.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.159.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.16.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.16.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.16.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.17.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.17.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.17.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.18.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.18.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.18.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.19.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.19.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.19.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.2.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.2.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.2.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.20.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.20.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.20.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.21.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.21.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.21.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.22.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.22.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.22.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.23.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.23.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.23.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.24.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.24.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.24.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.25.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.25.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.25.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.26.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.26.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.26.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.27.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.27.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.27.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.28.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.28.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.28.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.29.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.29.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.29.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.3.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.3.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.3.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.30.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.30.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.30.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.31.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.31.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.31.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.32.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.32.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.32.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.33.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.33.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.33.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.34.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.34.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.34.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.35.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.35.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.35.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.36.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.36.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.36.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.37.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.37.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.37.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.38.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.38.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.38.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.39.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.39.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.39.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.4.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.4.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.4.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.40.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.40.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.40.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.41.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.41.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.41.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.42.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.42.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.42.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.43.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.43.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.43.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.44.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.44.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.44.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.45.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.45.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.45.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.46.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.46.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.46.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.47.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.47.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.47.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.48.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.48.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.48.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.49.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.49.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.49.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.5.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.5.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.5.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.50.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.50.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.50.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.51.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.51.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.51.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.52.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.52.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.52.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.53.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.53.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.53.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.54.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.54.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.54.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.55.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.55.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.55.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.56.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.56.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.56.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.57.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.57.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.57.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.58.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.58.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.58.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.59.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.59.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.59.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.6.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.6.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.6.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.60.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.60.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.60.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.61.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.61.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.61.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.62.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.62.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.62.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.63.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.63.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.63.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.64.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.64.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.64.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.65.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.65.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.65.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.66.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.66.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.66.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.67.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.67.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.67.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.68.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.68.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.68.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.69.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.69.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.69.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.7.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.7.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.7.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.70.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.70.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.70.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.71.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.71.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.71.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.72.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.72.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.72.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.73.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.73.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.73.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.74.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.74.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.74.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.75.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.75.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.75.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.76.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.76.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.76.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.77.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.77.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.77.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.78.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.78.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.78.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.79.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.79.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.79.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.8.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.8.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.8.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.80.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.80.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.80.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.81.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.81.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.81.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.82.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.82.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.82.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.83.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.83.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.83.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.84.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.84.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.84.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.85.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.85.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.85.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.86.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.86.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.86.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.87.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.87.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.87.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.88.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.88.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.88.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.89.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.89.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.89.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.9.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.9.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.9.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.90.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.90.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.90.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.91.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.91.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.91.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.92.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.92.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.92.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.93.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.93.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.93.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.94.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.94.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.94.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.95.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.95.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.95.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.96.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.96.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.96.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.97.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.97.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.97.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.98.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.98.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.98.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.99.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.99.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.experts.99.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.gate.e_score_correction_bias": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.gate.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.shared_experts.down_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.shared_experts.gate_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.mlp.shared_experts.up_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.post_attention_layernorm.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.self_attn.k_norm.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.self_attn.k_proj.bias": "model-00070-of-00092.safetensors",
+ "model.layers.69.self_attn.k_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.self_attn.o_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.self_attn.q_norm.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.self_attn.q_proj.bias": "model-00070-of-00092.safetensors",
+ "model.layers.69.self_attn.q_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.69.self_attn.v_proj.bias": "model-00070-of-00092.safetensors",
+ "model.layers.69.self_attn.v_proj.weight": "model-00070-of-00092.safetensors",
+ "model.layers.70.input_layernorm.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.0.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.0.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.0.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.1.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.1.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.1.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.10.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.10.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.10.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.100.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.100.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.100.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.101.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.101.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.101.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.102.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.102.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.102.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.103.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.103.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.103.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.104.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.104.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.104.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.105.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.105.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.105.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.106.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.106.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.106.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.107.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.107.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.107.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.108.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.108.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.108.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.109.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.109.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.109.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.11.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.11.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.11.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.110.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.110.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.110.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.111.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.111.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.111.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.112.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.112.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.112.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.113.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.113.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.113.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.114.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.114.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.114.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.115.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.115.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.115.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.116.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.116.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.116.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.117.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.117.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.117.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.118.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.118.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.118.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.119.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.119.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.119.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.12.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.12.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.12.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.120.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.120.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.120.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.121.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.121.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.121.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.122.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.122.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.122.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.123.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.123.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.123.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.124.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.124.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.124.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.125.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.125.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.125.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.126.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.126.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.126.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.127.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.127.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.127.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.128.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.128.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.128.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.129.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.129.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.129.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.13.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.13.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.13.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.130.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.130.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.130.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.131.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.131.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.131.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.132.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.132.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.132.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.133.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.133.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.133.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.134.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.134.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.134.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.135.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.135.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.135.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.136.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.136.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.136.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.137.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.137.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.137.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.138.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.138.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.138.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.139.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.139.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.139.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.14.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.14.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.14.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.140.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.140.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.140.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.141.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.141.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.141.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.142.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.142.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.142.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.143.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.143.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.143.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.144.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.144.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.144.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.145.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.145.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.145.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.146.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.146.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.146.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.147.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.147.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.147.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.148.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.148.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.148.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.149.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.149.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.149.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.15.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.15.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.15.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.150.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.150.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.150.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.151.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.151.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.151.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.152.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.152.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.152.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.153.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.153.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.153.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.154.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.154.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.154.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.155.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.155.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.155.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.156.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.156.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.156.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.157.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.157.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.157.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.158.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.158.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.158.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.159.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.159.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.159.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.16.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.16.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.16.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.17.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.17.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.17.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.18.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.18.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.18.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.19.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.19.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.19.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.2.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.2.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.2.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.20.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.20.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.20.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.21.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.21.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.21.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.22.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.22.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.22.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.23.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.23.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.23.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.24.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.24.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.24.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.25.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.25.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.25.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.26.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.26.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.26.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.27.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.27.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.27.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.28.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.28.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.28.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.29.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.29.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.29.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.3.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.3.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.3.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.30.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.30.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.30.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.31.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.31.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.31.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.32.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.32.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.32.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.33.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.33.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.33.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.34.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.34.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.34.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.35.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.35.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.35.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.36.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.36.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.36.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.37.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.37.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.37.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.38.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.38.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.38.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.39.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.39.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.39.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.4.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.4.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.4.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.40.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.40.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.40.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.41.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.41.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.41.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.42.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.42.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.42.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.43.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.43.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.43.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.44.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.44.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.44.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.45.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.45.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.45.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.46.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.46.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.46.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.47.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.47.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.47.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.48.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.48.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.48.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.49.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.49.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.49.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.5.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.5.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.5.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.50.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.50.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.50.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.51.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.51.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.51.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.52.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.52.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.52.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.53.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.53.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.53.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.54.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.54.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.54.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.55.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.55.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.55.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.56.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.56.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.56.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.57.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.57.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.57.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.58.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.58.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.58.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.59.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.59.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.59.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.6.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.6.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.6.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.60.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.60.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.60.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.61.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.61.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.61.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.62.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.62.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.62.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.63.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.63.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.63.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.64.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.64.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.64.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.65.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.65.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.65.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.66.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.66.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.66.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.67.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.67.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.67.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.68.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.68.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.68.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.69.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.69.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.69.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.7.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.7.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.7.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.70.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.70.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.70.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.71.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.71.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.71.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.72.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.72.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.72.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.73.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.73.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.73.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.74.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.74.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.74.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.75.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.75.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.75.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.76.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.76.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.76.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.77.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.77.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.77.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.78.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.78.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.78.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.79.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.79.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.79.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.8.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.8.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.8.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.80.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.80.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.80.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.81.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.81.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.81.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.82.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.82.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.82.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.83.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.83.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.83.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.84.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.84.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.84.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.85.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.85.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.85.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.86.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.86.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.86.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.87.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.87.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.87.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.88.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.88.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.88.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.89.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.89.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.89.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.9.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.9.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.9.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.90.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.90.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.90.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.91.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.91.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.91.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.92.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.92.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.92.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.93.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.93.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.93.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.94.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.94.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.94.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.95.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.95.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.95.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.96.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.96.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.96.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.97.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.97.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.97.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.98.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.98.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.98.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.99.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.99.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.experts.99.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.gate.e_score_correction_bias": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.gate.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.shared_experts.down_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.shared_experts.gate_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.mlp.shared_experts.up_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.post_attention_layernorm.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.self_attn.k_norm.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.self_attn.k_proj.bias": "model-00071-of-00092.safetensors",
+ "model.layers.70.self_attn.k_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.self_attn.o_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.self_attn.q_norm.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.self_attn.q_proj.bias": "model-00071-of-00092.safetensors",
+ "model.layers.70.self_attn.q_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.70.self_attn.v_proj.bias": "model-00071-of-00092.safetensors",
+ "model.layers.70.self_attn.v_proj.weight": "model-00071-of-00092.safetensors",
+ "model.layers.71.input_layernorm.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.0.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.0.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.0.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.1.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.1.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.1.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.10.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.10.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.10.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.100.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.100.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.100.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.101.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.101.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.101.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.102.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.102.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.102.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.103.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.103.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.103.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.104.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.104.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.104.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.105.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.105.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.105.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.106.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.106.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.106.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.107.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.107.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.107.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.108.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.108.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.108.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.109.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.109.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.109.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.11.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.11.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.11.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.110.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.110.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.110.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.111.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.111.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.111.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.112.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.112.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.112.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.113.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.113.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.113.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.114.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.114.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.114.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.115.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.115.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.115.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.116.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.116.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.116.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.117.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.117.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.117.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.118.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.118.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.118.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.119.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.119.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.119.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.12.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.12.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.12.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.120.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.120.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.120.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.121.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.121.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.121.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.122.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.122.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.122.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.123.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.123.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.123.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.124.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.124.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.124.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.125.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.125.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.125.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.126.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.126.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.126.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.127.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.127.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.127.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.128.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.128.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.128.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.129.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.129.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.129.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.13.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.13.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.13.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.130.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.130.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.130.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.131.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.131.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.131.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.132.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.132.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.132.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.133.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.133.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.133.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.134.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.134.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.134.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.135.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.135.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.135.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.136.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.136.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.136.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.137.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.137.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.137.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.138.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.138.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.138.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.139.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.139.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.139.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.14.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.14.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.14.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.140.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.140.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.140.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.141.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.141.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.141.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.142.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.142.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.142.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.143.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.143.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.143.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.144.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.144.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.144.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.145.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.145.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.145.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.146.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.146.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.146.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.147.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.147.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.147.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.148.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.148.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.148.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.149.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.149.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.149.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.15.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.15.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.15.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.150.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.150.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.150.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.151.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.151.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.151.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.152.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.152.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.152.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.153.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.153.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.153.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.154.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.154.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.154.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.155.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.155.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.155.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.156.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.156.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.156.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.157.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.157.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.157.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.158.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.158.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.158.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.159.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.159.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.159.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.16.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.16.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.16.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.17.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.17.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.17.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.18.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.18.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.18.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.19.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.19.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.19.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.2.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.2.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.2.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.20.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.20.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.20.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.21.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.21.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.21.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.22.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.22.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.22.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.23.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.23.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.23.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.24.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.24.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.24.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.25.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.25.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.25.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.26.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.26.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.26.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.27.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.27.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.27.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.28.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.28.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.28.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.29.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.29.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.29.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.3.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.3.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.3.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.30.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.30.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.30.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.31.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.31.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.31.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.32.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.32.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.32.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.33.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.33.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.33.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.34.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.34.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.34.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.35.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.35.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.35.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.36.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.36.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.36.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.37.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.37.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.37.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.38.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.38.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.38.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.39.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.39.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.39.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.4.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.4.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.4.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.40.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.40.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.40.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.41.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.41.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.41.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.42.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.42.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.42.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.43.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.43.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.43.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.44.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.44.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.44.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.45.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.45.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.45.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.46.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.46.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.46.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.47.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.47.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.47.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.48.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.48.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.48.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.49.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.49.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.49.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.5.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.5.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.5.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.50.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.50.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.50.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.51.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.51.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.51.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.52.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.52.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.52.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.53.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.53.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.53.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.54.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.54.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.54.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.55.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.55.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.55.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.56.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.56.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.56.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.57.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.57.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.57.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.58.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.58.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.58.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.59.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.59.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.59.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.6.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.6.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.6.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.60.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.60.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.60.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.61.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.61.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.61.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.62.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.62.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.62.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.63.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.63.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.63.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.64.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.64.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.64.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.65.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.65.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.65.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.66.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.66.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.66.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.67.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.67.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.67.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.68.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.68.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.68.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.69.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.69.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.69.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.7.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.7.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.7.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.70.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.70.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.70.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.71.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.71.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.71.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.72.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.72.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.72.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.73.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.73.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.73.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.74.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.74.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.74.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.75.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.75.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.75.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.76.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.76.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.76.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.77.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.77.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.77.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.78.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.78.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.78.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.79.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.79.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.79.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.8.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.8.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.8.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.80.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.80.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.80.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.81.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.81.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.81.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.82.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.82.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.82.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.83.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.83.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.83.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.84.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.84.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.84.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.85.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.85.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.85.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.86.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.86.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.86.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.87.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.87.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.87.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.88.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.88.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.88.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.89.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.89.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.89.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.9.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.9.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.9.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.90.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.90.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.90.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.91.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.91.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.91.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.92.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.92.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.92.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.93.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.93.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.93.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.94.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.94.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.94.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.95.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.95.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.95.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.96.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.96.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.96.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.97.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.97.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.97.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.98.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.98.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.98.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.99.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.99.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.experts.99.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.gate.e_score_correction_bias": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.gate.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.shared_experts.down_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.shared_experts.gate_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.mlp.shared_experts.up_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.post_attention_layernorm.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.self_attn.k_norm.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.self_attn.k_proj.bias": "model-00072-of-00092.safetensors",
+ "model.layers.71.self_attn.k_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.self_attn.o_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.self_attn.q_norm.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.self_attn.q_proj.bias": "model-00072-of-00092.safetensors",
+ "model.layers.71.self_attn.q_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.71.self_attn.v_proj.bias": "model-00072-of-00092.safetensors",
+ "model.layers.71.self_attn.v_proj.weight": "model-00072-of-00092.safetensors",
+ "model.layers.72.input_layernorm.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.0.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.0.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.0.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.1.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.1.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.1.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.10.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.10.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.10.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.100.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.100.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.100.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.101.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.101.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.101.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.102.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.102.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.102.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.103.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.103.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.103.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.104.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.104.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.104.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.105.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.105.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.105.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.106.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.106.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.106.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.107.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.107.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.107.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.108.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.108.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.108.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.109.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.109.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.109.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.11.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.11.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.11.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.110.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.110.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.110.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.111.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.111.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.111.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.112.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.112.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.112.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.113.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.113.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.113.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.114.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.114.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.114.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.115.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.115.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.115.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.116.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.116.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.116.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.117.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.117.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.117.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.118.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.118.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.118.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.119.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.119.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.119.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.12.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.12.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.12.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.120.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.120.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.120.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.121.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.121.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.121.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.122.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.122.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.122.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.123.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.123.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.123.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.124.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.124.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.124.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.125.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.125.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.125.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.126.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.126.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.126.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.127.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.127.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.127.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.128.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.128.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.128.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.129.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.129.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.129.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.13.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.13.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.13.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.130.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.130.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.130.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.131.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.131.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.131.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.132.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.132.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.132.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.133.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.133.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.133.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.134.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.134.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.134.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.135.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.135.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.135.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.136.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.136.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.136.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.137.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.137.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.137.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.138.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.138.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.138.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.139.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.139.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.139.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.14.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.14.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.14.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.140.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.140.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.140.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.141.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.141.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.141.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.142.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.142.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.142.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.143.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.143.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.143.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.144.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.144.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.144.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.145.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.145.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.145.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.146.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.146.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.146.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.147.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.147.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.147.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.148.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.148.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.148.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.149.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.149.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.149.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.15.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.15.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.15.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.150.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.150.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.150.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.151.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.151.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.151.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.152.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.152.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.152.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.153.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.153.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.153.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.154.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.154.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.154.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.155.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.155.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.155.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.156.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.156.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.156.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.157.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.157.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.157.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.158.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.158.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.158.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.159.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.159.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.159.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.16.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.16.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.16.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.17.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.17.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.17.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.18.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.18.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.18.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.19.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.19.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.19.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.2.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.2.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.2.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.20.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.20.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.20.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.21.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.21.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.21.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.22.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.22.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.22.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.23.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.23.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.23.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.24.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.24.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.24.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.25.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.25.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.25.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.26.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.26.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.26.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.27.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.27.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.27.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.28.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.28.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.28.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.29.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.29.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.29.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.3.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.3.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.3.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.30.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.30.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.30.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.31.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.31.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.31.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.32.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.32.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.32.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.33.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.33.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.33.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.34.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.34.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.34.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.35.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.35.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.35.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.36.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.36.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.36.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.37.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.37.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.37.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.38.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.38.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.38.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.39.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.39.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.39.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.4.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.4.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.4.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.40.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.40.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.40.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.41.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.41.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.41.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.42.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.42.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.42.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.43.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.43.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.43.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.44.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.44.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.44.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.45.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.45.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.45.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.46.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.46.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.46.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.47.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.47.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.47.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.48.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.48.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.48.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.49.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.49.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.49.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.5.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.5.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.5.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.50.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.50.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.50.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.51.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.51.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.51.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.52.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.52.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.52.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.53.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.53.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.53.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.54.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.54.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.54.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.55.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.55.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.55.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.56.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.56.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.56.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.57.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.57.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.57.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.58.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.58.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.58.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.59.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.59.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.59.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.6.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.6.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.6.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.60.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.60.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.60.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.61.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.61.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.61.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.62.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.62.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.62.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.63.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.63.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.63.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.64.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.64.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.64.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.65.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.65.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.65.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.66.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.66.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.66.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.67.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.67.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.67.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.68.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.68.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.68.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.69.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.69.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.69.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.7.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.7.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.7.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.70.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.70.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.70.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.71.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.71.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.71.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.72.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.72.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.72.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.73.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.73.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.73.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.74.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.74.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.74.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.75.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.75.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.75.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.76.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.76.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.76.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.77.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.77.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.77.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.78.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.78.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.78.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.79.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.79.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.79.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.8.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.8.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.8.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.80.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.80.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.80.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.81.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.81.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.81.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.82.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.82.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.82.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.83.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.83.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.83.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.84.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.84.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.84.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.85.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.85.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.85.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.86.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.86.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.86.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.87.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.87.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.87.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.88.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.88.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.88.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.89.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.89.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.89.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.9.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.9.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.9.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.90.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.90.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.90.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.91.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.91.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.91.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.92.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.92.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.92.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.93.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.93.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.93.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.94.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.94.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.94.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.95.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.95.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.95.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.96.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.96.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.96.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.97.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.97.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.97.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.98.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.98.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.98.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.99.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.99.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.experts.99.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.gate.e_score_correction_bias": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.gate.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.shared_experts.down_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.shared_experts.gate_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.mlp.shared_experts.up_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.post_attention_layernorm.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.self_attn.k_norm.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.self_attn.k_proj.bias": "model-00073-of-00092.safetensors",
+ "model.layers.72.self_attn.k_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.self_attn.o_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.self_attn.q_norm.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.self_attn.q_proj.bias": "model-00073-of-00092.safetensors",
+ "model.layers.72.self_attn.q_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.72.self_attn.v_proj.bias": "model-00073-of-00092.safetensors",
+ "model.layers.72.self_attn.v_proj.weight": "model-00073-of-00092.safetensors",
+ "model.layers.73.input_layernorm.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.0.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.0.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.0.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.1.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.1.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.1.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.10.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.10.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.10.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.100.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.100.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.100.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.101.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.101.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.101.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.102.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.102.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.102.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.103.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.103.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.103.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.104.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.104.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.104.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.105.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.105.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.105.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.106.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.106.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.106.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.107.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.107.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.107.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.108.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.108.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.108.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.109.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.109.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.109.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.11.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.11.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.11.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.110.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.110.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.110.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.111.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.111.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.111.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.112.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.112.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.112.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.113.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.113.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.113.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.114.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.114.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.114.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.115.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.115.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.115.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.116.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.116.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.116.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.117.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.117.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.117.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.118.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.118.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.118.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.119.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.119.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.119.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.12.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.12.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.12.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.120.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.120.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.120.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.121.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.121.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.121.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.122.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.122.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.122.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.123.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.123.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.123.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.124.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.124.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.124.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.125.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.125.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.125.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.126.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.126.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.126.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.127.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.127.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.127.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.128.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.128.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.128.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.129.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.129.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.129.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.13.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.13.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.13.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.130.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.130.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.130.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.131.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.131.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.131.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.132.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.132.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.132.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.133.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.133.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.133.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.134.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.134.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.134.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.135.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.135.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.135.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.136.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.136.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.136.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.137.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.137.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.137.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.138.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.138.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.138.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.139.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.139.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.139.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.14.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.14.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.14.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.140.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.140.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.140.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.141.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.141.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.141.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.142.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.142.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.142.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.143.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.143.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.143.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.144.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.144.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.144.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.145.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.145.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.145.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.146.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.146.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.146.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.147.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.147.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.147.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.148.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.148.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.148.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.149.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.149.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.149.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.15.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.15.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.15.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.150.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.150.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.150.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.151.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.151.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.151.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.152.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.152.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.152.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.153.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.153.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.153.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.154.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.154.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.154.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.155.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.155.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.155.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.156.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.156.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.156.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.157.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.157.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.157.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.158.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.158.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.158.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.159.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.159.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.159.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.16.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.16.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.16.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.17.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.17.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.17.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.18.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.18.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.18.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.19.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.19.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.19.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.2.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.2.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.2.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.20.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.20.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.20.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.21.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.21.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.21.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.22.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.22.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.22.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.23.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.23.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.23.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.24.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.24.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.24.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.25.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.25.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.25.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.26.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.26.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.26.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.27.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.27.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.27.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.28.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.28.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.28.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.29.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.29.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.29.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.3.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.3.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.3.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.30.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.30.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.30.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.31.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.31.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.31.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.32.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.32.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.32.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.33.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.33.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.33.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.34.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.34.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.34.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.35.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.35.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.35.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.36.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.36.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.36.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.37.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.37.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.37.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.38.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.38.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.38.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.39.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.39.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.39.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.4.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.4.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.4.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.40.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.40.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.40.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.41.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.41.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.41.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.42.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.42.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.42.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.43.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.43.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.43.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.44.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.44.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.44.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.45.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.45.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.45.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.46.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.46.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.46.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.47.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.47.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.47.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.48.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.48.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.48.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.49.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.49.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.49.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.5.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.5.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.5.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.50.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.50.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.50.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.51.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.51.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.51.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.52.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.52.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.52.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.53.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.53.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.53.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.54.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.54.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.54.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.55.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.55.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.55.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.56.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.56.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.56.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.57.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.57.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.57.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.58.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.58.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.58.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.59.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.59.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.59.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.6.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.6.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.6.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.60.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.60.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.60.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.61.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.61.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.61.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.62.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.62.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.62.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.63.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.63.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.63.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.64.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.64.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.64.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.65.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.65.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.65.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.66.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.66.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.66.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.67.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.67.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.67.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.68.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.68.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.68.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.69.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.69.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.69.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.7.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.7.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.7.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.70.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.70.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.70.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.71.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.71.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.71.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.72.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.72.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.72.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.73.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.73.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.73.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.74.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.74.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.74.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.75.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.75.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.75.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.76.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.76.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.76.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.77.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.77.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.77.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.78.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.78.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.78.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.79.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.79.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.79.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.8.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.8.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.8.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.80.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.80.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.80.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.81.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.81.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.81.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.82.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.82.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.82.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.83.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.83.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.83.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.84.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.84.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.84.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.85.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.85.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.85.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.86.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.86.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.86.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.87.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.87.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.87.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.88.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.88.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.88.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.89.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.89.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.89.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.9.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.9.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.9.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.90.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.90.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.90.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.91.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.91.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.91.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.92.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.92.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.92.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.93.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.93.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.93.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.94.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.94.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.94.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.95.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.95.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.95.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.96.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.96.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.96.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.97.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.97.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.97.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.98.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.98.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.98.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.99.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.99.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.experts.99.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.gate.e_score_correction_bias": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.gate.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.shared_experts.down_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.shared_experts.gate_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.mlp.shared_experts.up_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.post_attention_layernorm.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.self_attn.k_norm.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.self_attn.k_proj.bias": "model-00074-of-00092.safetensors",
+ "model.layers.73.self_attn.k_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.self_attn.o_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.self_attn.q_norm.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.self_attn.q_proj.bias": "model-00074-of-00092.safetensors",
+ "model.layers.73.self_attn.q_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.73.self_attn.v_proj.bias": "model-00074-of-00092.safetensors",
+ "model.layers.73.self_attn.v_proj.weight": "model-00074-of-00092.safetensors",
+ "model.layers.74.input_layernorm.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.0.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.0.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.0.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.1.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.1.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.1.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.10.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.10.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.10.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.100.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.100.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.100.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.101.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.101.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.101.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.102.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.102.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.102.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.103.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.103.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.103.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.104.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.104.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.104.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.105.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.105.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.105.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.106.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.106.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.106.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.107.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.107.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.107.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.108.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.108.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.108.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.109.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.109.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.109.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.11.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.11.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.11.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.110.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.110.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.110.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.111.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.111.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.111.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.112.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.112.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.112.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.113.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.113.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.113.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.114.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.114.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.114.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.115.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.115.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.115.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.116.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.116.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.116.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.117.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.117.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.117.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.118.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.118.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.118.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.119.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.119.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.119.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.12.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.12.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.12.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.120.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.120.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.120.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.121.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.121.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.121.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.122.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.122.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.122.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.123.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.123.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.123.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.124.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.124.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.124.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.125.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.125.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.125.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.126.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.126.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.126.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.127.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.127.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.127.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.128.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.128.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.128.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.129.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.129.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.129.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.13.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.13.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.13.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.130.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.130.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.130.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.131.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.131.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.131.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.132.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.132.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.132.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.133.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.133.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.133.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.134.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.134.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.134.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.135.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.135.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.135.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.136.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.136.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.136.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.137.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.137.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.137.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.138.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.138.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.138.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.139.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.139.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.139.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.14.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.14.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.14.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.140.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.140.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.140.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.141.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.141.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.141.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.142.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.142.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.142.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.143.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.143.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.143.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.144.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.144.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.144.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.145.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.145.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.145.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.146.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.146.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.146.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.147.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.147.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.147.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.148.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.148.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.148.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.149.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.149.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.149.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.15.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.15.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.15.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.150.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.150.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.150.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.151.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.151.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.151.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.152.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.152.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.152.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.153.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.153.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.153.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.154.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.154.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.154.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.155.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.155.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.155.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.156.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.156.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.156.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.157.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.157.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.157.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.158.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.158.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.158.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.159.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.159.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.159.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.16.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.16.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.16.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.17.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.17.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.17.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.18.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.18.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.18.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.19.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.19.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.19.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.2.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.2.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.2.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.20.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.20.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.20.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.21.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.21.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.21.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.22.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.22.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.22.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.23.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.23.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.23.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.24.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.24.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.24.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.25.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.25.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.25.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.26.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.26.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.26.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.27.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.27.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.27.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.28.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.28.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.28.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.29.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.29.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.29.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.3.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.3.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.3.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.30.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.30.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.30.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.31.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.31.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.31.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.32.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.32.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.32.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.33.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.33.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.33.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.34.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.34.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.34.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.35.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.35.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.35.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.36.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.36.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.36.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.37.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.37.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.37.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.38.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.38.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.38.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.39.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.39.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.39.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.4.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.4.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.4.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.40.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.40.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.40.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.41.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.41.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.41.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.42.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.42.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.42.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.43.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.43.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.43.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.44.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.44.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.44.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.45.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.45.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.45.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.46.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.46.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.46.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.47.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.47.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.47.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.48.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.48.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.48.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.49.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.49.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.49.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.5.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.5.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.5.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.50.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.50.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.50.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.51.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.51.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.51.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.52.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.52.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.52.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.53.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.53.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.53.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.54.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.54.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.54.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.55.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.55.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.55.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.56.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.56.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.56.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.57.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.57.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.57.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.58.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.58.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.58.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.59.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.59.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.59.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.6.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.6.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.6.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.60.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.60.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.60.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.61.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.61.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.61.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.62.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.62.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.62.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.63.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.63.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.63.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.64.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.64.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.64.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.65.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.65.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.65.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.66.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.66.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.66.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.67.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.67.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.67.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.68.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.68.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.68.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.69.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.69.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.69.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.7.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.7.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.7.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.70.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.70.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.70.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.71.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.71.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.71.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.72.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.72.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.72.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.73.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.73.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.73.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.74.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.74.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.74.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.75.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.75.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.75.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.76.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.76.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.76.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.77.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.77.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.77.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.78.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.78.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.78.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.79.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.79.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.79.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.8.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.8.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.8.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.80.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.80.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.80.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.81.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.81.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.81.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.82.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.82.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.82.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.83.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.83.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.83.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.84.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.84.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.84.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.85.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.85.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.85.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.86.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.86.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.86.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.87.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.87.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.87.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.88.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.88.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.88.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.89.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.89.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.89.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.9.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.9.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.9.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.90.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.90.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.90.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.91.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.91.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.91.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.92.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.92.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.92.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.93.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.93.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.93.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.94.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.94.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.94.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.95.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.95.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.95.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.96.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.96.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.96.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.97.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.97.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.97.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.98.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.98.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.98.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.99.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.99.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.experts.99.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.gate.e_score_correction_bias": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.gate.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.shared_experts.down_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.shared_experts.gate_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.mlp.shared_experts.up_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.post_attention_layernorm.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.self_attn.k_norm.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.self_attn.k_proj.bias": "model-00075-of-00092.safetensors",
+ "model.layers.74.self_attn.k_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.self_attn.o_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.self_attn.q_norm.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.self_attn.q_proj.bias": "model-00075-of-00092.safetensors",
+ "model.layers.74.self_attn.q_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.74.self_attn.v_proj.bias": "model-00075-of-00092.safetensors",
+ "model.layers.74.self_attn.v_proj.weight": "model-00075-of-00092.safetensors",
+ "model.layers.75.input_layernorm.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.0.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.0.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.0.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.1.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.1.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.1.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.10.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.10.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.10.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.100.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.100.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.100.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.101.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.101.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.101.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.102.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.102.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.102.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.103.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.103.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.103.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.104.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.104.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.104.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.105.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.105.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.105.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.106.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.106.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.106.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.107.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.107.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.107.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.108.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.108.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.108.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.109.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.109.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.109.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.11.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.11.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.11.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.110.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.110.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.110.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.111.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.111.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.111.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.112.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.112.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.112.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.113.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.113.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.113.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.114.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.114.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.114.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.115.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.115.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.115.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.116.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.116.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.116.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.117.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.117.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.117.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.118.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.118.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.118.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.119.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.119.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.119.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.12.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.12.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.12.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.120.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.120.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.120.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.121.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.121.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.121.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.122.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.122.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.122.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.123.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.123.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.123.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.124.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.124.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.124.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.125.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.125.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.125.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.126.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.126.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.126.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.127.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.127.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.127.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.128.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.128.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.128.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.129.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.129.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.129.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.13.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.13.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.13.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.130.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.130.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.130.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.131.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.131.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.131.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.132.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.132.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.132.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.133.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.133.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.133.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.134.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.134.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.134.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.135.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.135.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.135.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.136.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.136.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.136.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.137.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.137.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.137.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.138.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.138.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.138.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.139.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.139.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.139.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.14.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.14.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.14.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.140.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.140.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.140.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.141.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.141.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.141.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.142.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.142.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.142.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.143.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.143.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.143.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.144.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.144.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.144.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.145.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.145.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.145.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.146.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.146.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.146.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.147.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.147.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.147.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.148.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.148.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.148.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.149.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.149.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.149.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.15.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.15.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.15.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.150.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.150.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.150.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.151.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.151.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.151.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.152.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.152.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.152.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.153.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.153.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.153.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.154.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.154.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.154.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.155.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.155.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.155.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.156.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.156.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.156.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.157.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.157.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.157.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.158.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.158.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.158.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.159.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.159.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.159.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.16.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.16.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.16.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.17.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.17.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.17.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.18.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.18.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.18.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.19.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.19.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.19.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.2.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.2.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.2.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.20.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.20.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.20.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.21.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.21.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.21.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.22.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.22.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.22.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.23.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.23.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.23.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.24.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.24.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.24.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.25.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.25.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.25.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.26.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.26.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.26.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.27.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.27.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.27.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.28.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.28.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.28.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.29.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.29.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.29.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.3.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.3.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.3.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.30.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.30.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.30.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.31.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.31.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.31.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.32.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.32.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.32.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.33.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.33.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.33.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.34.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.34.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.34.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.35.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.35.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.35.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.36.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.36.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.36.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.37.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.37.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.37.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.38.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.38.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.38.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.39.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.39.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.39.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.4.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.4.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.4.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.40.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.40.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.40.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.41.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.41.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.41.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.42.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.42.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.42.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.43.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.43.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.43.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.44.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.44.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.44.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.45.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.45.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.45.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.46.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.46.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.46.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.47.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.47.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.47.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.48.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.48.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.48.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.49.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.49.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.49.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.5.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.5.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.5.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.50.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.50.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.50.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.51.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.51.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.51.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.52.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.52.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.52.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.53.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.53.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.53.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.54.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.54.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.54.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.55.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.55.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.55.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.56.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.56.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.56.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.57.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.57.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.57.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.58.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.58.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.58.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.59.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.59.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.59.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.6.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.6.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.6.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.60.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.60.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.60.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.61.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.61.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.61.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.62.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.62.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.62.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.63.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.63.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.63.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.64.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.64.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.64.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.65.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.65.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.65.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.66.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.66.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.66.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.67.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.67.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.67.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.68.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.68.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.68.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.69.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.69.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.69.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.7.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.7.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.7.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.70.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.70.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.70.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.71.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.71.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.71.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.72.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.72.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.72.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.73.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.73.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.73.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.74.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.74.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.74.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.75.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.75.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.75.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.76.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.76.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.76.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.77.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.77.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.77.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.78.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.78.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.78.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.79.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.79.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.79.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.8.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.8.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.8.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.80.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.80.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.80.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.81.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.81.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.81.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.82.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.82.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.82.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.83.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.83.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.83.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.84.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.84.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.84.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.85.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.85.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.85.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.86.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.86.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.86.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.87.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.87.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.87.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.88.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.88.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.88.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.89.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.89.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.89.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.9.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.9.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.9.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.90.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.90.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.90.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.91.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.91.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.91.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.92.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.92.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.92.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.93.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.93.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.93.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.94.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.94.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.94.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.95.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.95.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.95.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.96.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.96.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.96.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.97.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.97.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.97.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.98.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.98.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.98.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.99.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.99.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.experts.99.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.gate.e_score_correction_bias": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.gate.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.shared_experts.down_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.shared_experts.gate_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.mlp.shared_experts.up_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.post_attention_layernorm.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.self_attn.k_norm.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.self_attn.k_proj.bias": "model-00076-of-00092.safetensors",
+ "model.layers.75.self_attn.k_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.self_attn.o_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.self_attn.q_norm.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.self_attn.q_proj.bias": "model-00076-of-00092.safetensors",
+ "model.layers.75.self_attn.q_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.75.self_attn.v_proj.bias": "model-00076-of-00092.safetensors",
+ "model.layers.75.self_attn.v_proj.weight": "model-00076-of-00092.safetensors",
+ "model.layers.76.input_layernorm.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.0.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.0.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.0.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.1.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.1.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.1.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.10.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.10.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.10.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.100.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.100.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.100.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.101.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.101.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.101.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.102.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.102.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.102.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.103.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.103.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.103.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.104.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.104.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.104.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.105.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.105.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.105.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.106.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.106.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.106.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.107.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.107.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.107.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.108.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.108.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.108.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.109.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.109.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.109.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.11.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.11.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.11.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.110.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.110.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.110.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.111.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.111.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.111.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.112.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.112.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.112.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.113.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.113.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.113.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.114.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.114.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.114.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.115.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.115.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.115.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.116.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.116.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.116.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.117.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.117.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.117.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.118.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.118.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.118.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.119.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.119.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.119.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.12.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.12.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.12.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.120.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.120.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.120.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.121.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.121.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.121.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.122.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.122.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.122.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.123.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.123.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.123.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.124.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.124.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.124.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.125.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.125.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.125.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.126.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.126.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.126.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.127.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.127.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.127.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.128.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.128.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.128.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.129.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.129.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.129.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.13.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.13.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.13.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.130.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.130.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.130.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.131.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.131.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.131.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.132.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.132.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.132.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.133.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.133.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.133.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.134.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.134.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.134.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.135.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.135.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.135.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.136.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.136.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.136.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.137.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.137.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.137.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.138.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.138.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.138.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.139.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.139.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.139.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.14.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.14.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.14.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.140.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.140.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.140.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.141.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.141.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.141.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.142.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.142.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.142.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.143.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.143.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.143.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.144.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.144.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.144.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.145.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.145.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.145.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.146.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.146.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.146.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.147.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.147.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.147.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.148.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.148.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.148.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.149.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.149.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.149.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.15.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.15.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.15.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.150.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.150.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.150.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.151.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.151.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.151.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.152.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.152.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.152.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.153.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.153.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.153.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.154.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.154.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.154.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.155.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.155.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.155.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.156.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.156.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.156.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.157.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.157.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.157.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.158.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.158.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.158.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.159.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.159.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.159.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.16.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.16.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.16.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.17.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.17.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.17.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.18.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.18.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.18.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.19.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.19.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.19.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.2.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.2.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.2.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.20.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.20.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.20.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.21.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.21.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.21.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.22.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.22.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.22.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.23.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.23.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.23.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.24.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.24.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.24.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.25.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.25.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.25.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.26.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.26.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.26.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.27.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.27.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.27.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.28.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.28.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.28.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.29.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.29.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.29.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.3.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.3.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.3.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.30.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.30.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.30.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.31.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.31.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.31.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.32.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.32.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.32.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.33.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.33.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.33.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.34.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.34.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.34.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.35.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.35.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.35.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.36.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.36.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.36.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.37.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.37.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.37.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.38.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.38.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.38.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.39.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.39.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.39.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.4.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.4.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.4.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.40.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.40.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.40.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.41.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.41.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.41.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.42.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.42.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.42.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.43.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.43.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.43.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.44.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.44.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.44.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.45.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.45.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.45.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.46.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.46.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.46.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.47.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.47.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.47.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.48.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.48.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.48.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.49.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.49.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.49.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.5.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.5.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.5.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.50.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.50.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.50.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.51.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.51.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.51.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.52.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.52.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.52.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.53.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.53.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.53.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.54.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.54.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.54.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.55.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.55.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.55.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.56.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.56.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.56.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.57.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.57.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.57.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.58.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.58.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.58.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.59.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.59.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.59.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.6.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.6.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.6.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.60.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.60.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.60.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.61.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.61.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.61.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.62.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.62.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.62.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.63.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.63.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.63.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.64.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.64.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.64.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.65.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.65.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.65.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.66.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.66.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.66.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.67.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.67.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.67.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.68.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.68.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.68.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.69.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.69.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.69.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.7.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.7.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.7.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.70.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.70.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.70.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.71.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.71.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.71.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.72.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.72.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.72.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.73.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.73.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.73.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.74.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.74.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.74.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.75.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.75.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.75.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.76.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.76.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.76.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.77.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.77.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.77.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.78.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.78.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.78.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.79.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.79.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.79.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.8.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.8.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.8.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.80.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.80.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.80.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.81.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.81.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.81.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.82.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.82.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.82.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.83.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.83.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.83.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.84.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.84.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.84.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.85.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.85.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.85.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.86.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.86.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.86.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.87.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.87.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.87.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.88.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.88.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.88.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.89.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.89.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.89.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.9.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.9.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.9.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.90.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.90.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.90.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.91.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.91.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.91.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.92.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.92.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.92.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.93.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.93.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.93.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.94.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.94.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.94.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.95.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.95.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.95.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.96.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.96.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.96.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.97.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.97.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.97.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.98.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.98.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.98.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.99.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.99.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.experts.99.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.gate.e_score_correction_bias": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.gate.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.shared_experts.down_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.shared_experts.gate_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.mlp.shared_experts.up_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.post_attention_layernorm.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.self_attn.k_norm.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.self_attn.k_proj.bias": "model-00077-of-00092.safetensors",
+ "model.layers.76.self_attn.k_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.self_attn.o_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.self_attn.q_norm.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.self_attn.q_proj.bias": "model-00077-of-00092.safetensors",
+ "model.layers.76.self_attn.q_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.76.self_attn.v_proj.bias": "model-00077-of-00092.safetensors",
+ "model.layers.76.self_attn.v_proj.weight": "model-00077-of-00092.safetensors",
+ "model.layers.77.input_layernorm.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.0.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.0.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.0.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.1.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.1.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.1.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.10.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.10.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.10.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.100.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.100.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.100.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.101.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.101.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.101.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.102.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.102.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.102.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.103.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.103.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.103.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.104.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.104.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.104.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.105.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.105.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.105.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.106.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.106.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.106.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.107.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.107.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.107.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.108.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.108.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.108.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.109.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.109.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.109.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.11.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.11.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.11.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.110.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.110.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.110.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.111.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.111.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.111.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.112.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.112.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.112.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.113.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.113.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.113.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.114.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.114.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.114.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.115.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.115.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.115.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.116.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.116.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.116.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.117.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.117.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.117.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.118.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.118.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.118.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.119.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.119.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.119.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.12.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.12.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.12.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.120.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.120.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.120.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.121.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.121.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.121.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.122.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.122.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.122.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.123.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.123.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.123.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.124.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.124.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.124.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.125.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.125.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.125.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.126.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.126.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.126.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.127.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.127.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.127.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.128.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.128.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.128.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.129.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.129.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.129.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.13.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.13.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.13.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.130.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.130.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.130.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.131.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.131.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.131.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.132.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.132.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.132.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.133.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.133.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.133.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.134.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.134.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.134.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.135.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.135.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.135.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.136.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.136.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.136.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.137.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.137.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.137.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.138.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.138.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.138.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.139.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.139.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.139.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.14.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.14.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.14.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.140.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.140.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.140.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.141.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.141.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.141.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.142.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.142.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.142.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.143.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.143.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.143.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.144.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.144.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.144.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.145.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.145.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.145.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.146.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.146.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.146.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.147.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.147.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.147.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.148.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.148.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.148.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.149.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.149.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.149.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.15.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.15.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.15.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.150.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.150.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.150.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.151.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.151.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.151.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.152.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.152.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.152.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.153.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.153.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.153.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.154.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.154.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.154.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.155.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.155.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.155.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.156.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.156.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.156.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.157.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.157.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.157.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.158.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.158.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.158.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.159.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.159.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.159.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.16.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.16.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.16.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.17.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.17.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.17.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.18.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.18.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.18.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.19.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.19.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.19.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.2.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.2.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.2.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.20.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.20.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.20.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.21.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.21.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.21.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.22.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.22.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.22.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.23.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.23.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.23.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.24.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.24.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.24.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.25.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.25.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.25.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.26.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.26.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.26.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.27.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.27.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.27.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.28.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.28.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.28.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.29.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.29.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.29.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.3.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.3.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.3.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.30.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.30.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.30.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.31.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.31.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.31.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.32.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.32.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.32.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.33.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.33.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.33.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.34.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.34.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.34.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.35.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.35.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.35.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.36.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.36.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.36.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.37.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.37.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.37.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.38.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.38.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.38.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.39.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.39.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.39.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.4.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.4.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.4.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.40.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.40.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.40.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.41.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.41.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.41.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.42.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.42.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.42.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.43.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.43.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.43.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.44.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.44.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.44.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.45.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.45.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.45.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.46.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.46.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.46.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.47.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.47.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.47.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.48.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.48.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.48.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.49.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.49.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.49.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.5.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.5.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.5.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.50.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.50.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.50.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.51.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.51.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.51.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.52.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.52.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.52.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.53.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.53.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.53.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.54.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.54.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.54.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.55.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.55.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.55.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.56.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.56.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.56.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.57.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.57.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.57.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.58.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.58.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.58.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.59.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.59.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.59.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.6.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.6.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.6.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.60.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.60.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.60.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.61.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.61.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.61.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.62.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.62.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.62.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.63.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.63.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.63.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.64.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.64.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.64.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.65.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.65.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.65.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.66.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.66.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.66.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.67.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.67.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.67.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.68.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.68.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.68.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.69.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.69.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.69.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.7.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.7.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.7.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.70.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.70.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.70.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.71.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.71.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.71.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.72.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.72.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.72.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.73.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.73.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.73.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.74.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.74.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.74.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.75.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.75.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.75.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.76.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.76.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.76.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.77.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.77.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.77.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.78.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.78.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.78.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.79.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.79.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.79.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.8.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.8.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.8.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.80.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.80.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.80.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.81.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.81.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.81.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.82.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.82.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.82.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.83.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.83.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.83.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.84.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.84.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.84.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.85.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.85.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.85.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.86.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.86.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.86.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.87.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.87.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.87.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.88.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.88.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.88.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.89.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.89.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.89.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.9.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.9.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.9.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.90.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.90.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.90.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.91.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.91.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.91.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.92.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.92.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.92.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.93.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.93.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.93.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.94.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.94.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.94.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.95.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.95.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.95.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.96.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.96.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.96.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.97.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.97.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.97.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.98.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.98.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.98.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.99.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.99.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.experts.99.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.gate.e_score_correction_bias": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.gate.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.shared_experts.down_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.shared_experts.gate_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.mlp.shared_experts.up_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.post_attention_layernorm.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.self_attn.k_norm.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.self_attn.k_proj.bias": "model-00078-of-00092.safetensors",
+ "model.layers.77.self_attn.k_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.self_attn.o_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.self_attn.q_norm.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.self_attn.q_proj.bias": "model-00078-of-00092.safetensors",
+ "model.layers.77.self_attn.q_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.77.self_attn.v_proj.bias": "model-00078-of-00092.safetensors",
+ "model.layers.77.self_attn.v_proj.weight": "model-00078-of-00092.safetensors",
+ "model.layers.78.input_layernorm.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.0.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.0.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.0.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.1.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.1.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.1.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.10.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.10.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.10.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.100.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.100.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.100.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.101.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.101.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.101.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.102.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.102.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.102.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.103.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.103.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.103.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.104.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.104.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.104.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.105.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.105.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.105.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.106.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.106.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.106.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.107.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.107.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.107.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.108.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.108.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.108.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.109.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.109.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.109.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.11.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.11.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.11.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.110.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.110.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.110.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.111.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.111.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.111.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.112.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.112.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.112.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.113.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.113.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.113.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.114.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.114.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.114.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.115.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.115.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.115.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.116.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.116.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.116.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.117.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.117.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.117.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.118.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.118.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.118.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.119.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.119.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.119.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.12.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.12.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.12.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.120.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.120.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.120.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.121.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.121.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.121.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.122.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.122.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.122.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.123.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.123.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.123.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.124.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.124.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.124.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.125.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.125.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.125.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.126.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.126.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.126.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.127.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.127.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.127.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.128.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.128.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.128.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.129.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.129.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.129.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.13.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.13.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.13.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.130.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.130.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.130.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.131.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.131.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.131.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.132.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.132.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.132.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.133.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.133.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.133.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.134.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.134.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.134.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.135.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.135.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.135.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.136.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.136.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.136.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.137.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.137.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.137.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.138.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.138.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.138.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.139.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.139.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.139.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.14.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.14.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.14.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.140.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.140.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.140.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.141.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.141.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.141.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.142.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.142.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.142.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.143.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.143.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.143.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.144.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.144.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.144.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.145.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.145.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.145.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.146.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.146.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.146.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.147.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.147.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.147.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.148.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.148.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.148.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.149.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.149.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.149.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.15.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.15.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.15.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.150.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.150.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.150.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.151.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.151.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.151.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.152.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.152.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.152.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.153.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.153.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.153.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.154.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.154.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.154.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.155.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.155.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.155.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.156.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.156.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.156.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.157.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.157.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.157.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.158.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.158.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.158.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.159.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.159.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.159.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.16.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.16.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.16.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.17.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.17.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.17.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.18.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.18.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.18.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.19.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.19.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.19.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.2.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.2.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.2.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.20.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.20.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.20.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.21.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.21.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.21.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.22.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.22.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.22.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.23.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.23.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.23.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.24.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.24.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.24.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.25.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.25.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.25.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.26.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.26.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.26.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.27.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.27.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.27.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.28.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.28.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.28.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.29.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.29.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.29.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.3.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.3.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.3.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.30.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.30.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.30.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.31.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.31.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.31.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.32.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.32.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.32.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.33.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.33.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.33.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.34.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.34.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.34.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.35.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.35.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.35.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.36.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.36.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.36.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.37.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.37.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.37.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.38.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.38.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.38.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.39.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.39.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.39.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.4.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.4.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.4.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.40.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.40.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.40.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.41.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.41.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.41.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.42.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.42.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.42.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.43.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.43.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.43.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.44.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.44.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.44.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.45.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.45.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.45.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.46.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.46.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.46.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.47.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.47.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.47.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.48.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.48.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.48.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.49.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.49.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.49.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.5.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.5.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.5.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.50.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.50.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.50.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.51.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.51.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.51.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.52.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.52.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.52.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.53.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.53.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.53.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.54.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.54.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.54.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.55.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.55.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.55.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.56.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.56.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.56.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.57.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.57.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.57.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.58.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.58.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.58.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.59.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.59.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.59.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.6.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.6.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.6.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.60.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.60.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.60.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.61.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.61.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.61.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.62.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.62.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.62.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.63.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.63.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.63.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.64.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.64.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.64.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.65.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.65.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.65.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.66.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.66.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.66.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.67.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.67.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.67.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.68.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.68.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.68.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.69.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.69.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.69.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.7.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.7.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.7.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.70.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.70.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.70.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.71.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.71.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.71.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.72.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.72.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.72.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.73.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.73.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.73.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.74.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.74.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.74.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.75.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.75.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.75.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.76.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.76.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.76.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.77.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.77.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.77.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.78.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.78.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.78.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.79.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.79.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.79.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.8.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.8.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.8.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.80.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.80.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.80.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.81.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.81.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.81.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.82.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.82.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.82.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.83.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.83.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.83.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.84.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.84.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.84.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.85.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.85.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.85.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.86.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.86.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.86.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.87.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.87.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.87.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.88.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.88.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.88.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.89.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.89.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.89.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.9.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.9.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.9.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.90.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.90.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.90.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.91.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.91.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.91.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.92.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.92.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.92.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.93.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.93.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.93.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.94.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.94.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.94.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.95.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.95.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.95.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.96.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.96.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.96.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.97.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.97.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.97.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.98.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.98.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.98.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.99.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.99.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.experts.99.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.gate.e_score_correction_bias": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.gate.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.shared_experts.down_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.shared_experts.gate_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.mlp.shared_experts.up_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.post_attention_layernorm.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.self_attn.k_norm.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.self_attn.k_proj.bias": "model-00079-of-00092.safetensors",
+ "model.layers.78.self_attn.k_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.self_attn.o_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.self_attn.q_norm.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.self_attn.q_proj.bias": "model-00079-of-00092.safetensors",
+ "model.layers.78.self_attn.q_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.78.self_attn.v_proj.bias": "model-00079-of-00092.safetensors",
+ "model.layers.78.self_attn.v_proj.weight": "model-00079-of-00092.safetensors",
+ "model.layers.79.input_layernorm.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.0.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.0.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.0.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.1.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.1.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.1.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.10.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.10.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.10.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.100.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.100.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.100.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.101.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.101.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.101.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.102.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.102.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.102.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.103.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.103.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.103.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.104.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.104.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.104.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.105.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.105.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.105.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.106.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.106.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.106.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.107.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.107.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.107.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.108.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.108.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.108.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.109.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.109.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.109.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.11.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.11.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.11.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.110.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.110.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.110.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.111.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.111.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.111.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.112.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.112.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.112.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.113.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.113.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.113.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.114.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.114.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.114.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.115.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.115.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.115.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.116.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.116.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.116.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.117.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.117.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.117.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.118.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.118.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.118.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.119.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.119.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.119.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.12.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.12.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.12.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.120.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.120.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.120.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.121.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.121.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.121.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.122.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.122.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.122.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.123.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.123.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.123.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.124.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.124.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.124.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.125.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.125.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.125.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.126.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.126.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.126.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.127.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.127.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.127.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.128.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.128.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.128.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.129.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.129.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.129.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.13.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.13.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.13.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.130.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.130.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.130.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.131.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.131.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.131.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.132.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.132.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.132.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.133.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.133.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.133.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.134.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.134.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.134.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.135.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.135.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.135.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.136.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.136.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.136.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.137.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.137.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.137.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.138.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.138.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.138.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.139.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.139.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.139.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.14.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.14.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.14.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.140.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.140.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.140.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.141.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.141.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.141.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.142.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.142.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.142.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.143.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.143.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.143.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.144.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.144.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.144.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.145.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.145.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.145.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.146.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.146.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.146.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.147.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.147.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.147.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.148.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.148.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.148.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.149.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.149.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.149.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.15.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.15.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.15.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.150.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.150.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.150.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.151.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.151.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.151.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.152.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.152.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.152.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.153.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.153.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.153.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.154.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.154.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.154.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.155.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.155.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.155.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.156.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.156.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.156.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.157.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.157.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.157.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.158.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.158.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.158.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.159.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.159.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.159.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.16.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.16.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.16.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.17.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.17.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.17.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.18.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.18.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.18.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.19.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.19.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.19.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.2.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.2.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.2.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.20.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.20.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.20.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.21.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.21.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.21.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.22.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.22.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.22.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.23.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.23.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.23.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.24.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.24.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.24.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.25.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.25.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.25.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.26.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.26.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.26.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.27.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.27.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.27.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.28.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.28.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.28.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.29.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.29.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.29.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.3.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.3.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.3.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.30.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.30.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.30.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.31.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.31.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.31.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.32.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.32.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.32.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.33.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.33.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.33.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.34.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.34.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.34.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.35.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.35.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.35.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.36.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.36.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.36.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.37.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.37.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.37.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.38.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.38.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.38.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.39.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.39.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.39.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.4.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.4.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.4.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.40.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.40.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.40.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.41.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.41.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.41.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.42.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.42.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.42.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.43.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.43.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.43.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.44.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.44.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.44.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.45.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.45.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.45.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.46.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.46.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.46.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.47.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.47.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.47.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.48.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.48.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.48.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.49.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.49.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.49.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.5.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.5.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.5.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.50.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.50.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.50.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.51.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.51.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.51.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.52.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.52.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.52.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.53.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.53.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.53.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.54.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.54.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.54.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.55.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.55.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.55.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.56.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.56.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.56.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.57.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.57.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.57.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.58.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.58.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.58.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.59.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.59.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.59.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.6.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.6.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.6.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.60.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.60.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.60.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.61.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.61.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.61.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.62.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.62.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.62.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.63.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.63.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.63.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.64.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.64.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.64.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.65.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.65.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.65.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.66.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.66.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.66.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.67.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.67.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.67.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.68.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.68.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.68.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.69.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.69.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.69.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.7.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.7.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.7.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.70.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.70.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.70.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.71.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.71.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.71.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.72.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.72.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.72.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.73.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.73.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.73.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.74.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.74.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.74.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.75.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.75.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.75.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.76.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.76.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.76.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.77.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.77.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.77.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.78.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.78.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.78.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.79.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.79.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.79.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.8.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.8.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.8.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.80.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.80.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.80.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.81.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.81.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.81.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.82.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.82.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.82.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.83.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.83.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.83.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.84.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.84.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.84.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.85.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.85.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.85.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.86.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.86.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.86.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.87.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.87.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.87.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.88.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.88.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.88.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.89.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.89.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.89.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.9.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.9.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.9.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.90.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.90.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.90.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.91.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.91.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.91.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.92.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.92.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.92.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.93.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.93.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.93.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.94.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.94.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.94.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.95.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.95.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.95.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.96.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.96.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.96.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.97.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.97.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.97.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.98.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.98.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.98.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.99.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.99.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.experts.99.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.gate.e_score_correction_bias": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.gate.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.shared_experts.down_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.shared_experts.gate_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.mlp.shared_experts.up_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.post_attention_layernorm.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.self_attn.k_norm.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.self_attn.k_proj.bias": "model-00080-of-00092.safetensors",
+ "model.layers.79.self_attn.k_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.self_attn.o_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.self_attn.q_norm.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.self_attn.q_proj.bias": "model-00080-of-00092.safetensors",
+ "model.layers.79.self_attn.q_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.79.self_attn.v_proj.bias": "model-00080-of-00092.safetensors",
+ "model.layers.79.self_attn.v_proj.weight": "model-00080-of-00092.safetensors",
+ "model.layers.80.input_layernorm.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.0.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.0.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.0.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.1.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.1.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.1.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.10.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.10.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.10.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.100.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.100.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.100.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.101.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.101.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.101.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.102.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.102.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.102.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.103.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.103.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.103.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.104.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.104.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.104.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.105.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.105.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.105.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.106.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.106.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.106.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.107.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.107.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.107.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.108.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.108.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.108.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.109.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.109.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.109.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.11.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.11.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.11.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.110.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.110.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.110.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.111.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.111.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.111.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.112.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.112.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.112.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.113.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.113.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.113.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.114.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.114.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.114.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.115.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.115.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.115.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.116.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.116.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.116.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.117.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.117.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.117.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.118.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.118.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.118.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.119.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.119.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.119.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.12.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.12.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.12.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.120.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.120.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.120.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.121.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.121.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.121.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.122.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.122.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.122.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.123.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.123.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.123.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.124.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.124.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.124.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.125.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.125.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.125.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.126.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.126.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.126.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.127.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.127.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.127.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.128.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.128.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.128.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.129.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.129.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.129.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.13.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.13.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.13.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.130.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.130.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.130.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.131.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.131.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.131.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.132.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.132.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.132.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.133.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.133.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.133.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.134.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.134.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.134.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.135.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.135.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.135.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.136.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.136.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.136.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.137.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.137.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.137.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.138.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.138.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.138.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.139.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.139.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.139.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.14.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.14.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.14.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.140.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.140.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.140.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.141.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.141.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.141.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.142.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.142.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.142.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.143.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.143.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.143.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.144.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.144.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.144.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.145.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.145.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.145.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.146.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.146.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.146.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.147.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.147.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.147.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.148.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.148.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.148.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.149.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.149.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.149.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.15.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.15.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.15.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.150.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.150.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.150.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.151.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.151.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.151.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.152.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.152.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.152.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.153.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.153.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.153.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.154.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.154.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.154.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.155.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.155.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.155.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.156.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.156.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.156.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.157.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.157.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.157.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.158.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.158.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.158.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.159.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.159.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.159.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.16.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.16.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.16.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.17.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.17.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.17.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.18.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.18.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.18.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.19.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.19.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.19.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.2.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.2.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.2.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.20.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.20.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.20.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.21.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.21.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.21.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.22.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.22.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.22.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.23.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.23.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.23.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.24.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.24.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.24.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.25.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.25.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.25.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.26.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.26.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.26.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.27.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.27.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.27.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.28.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.28.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.28.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.29.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.29.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.29.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.3.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.3.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.3.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.30.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.30.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.30.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.31.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.31.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.31.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.32.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.32.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.32.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.33.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.33.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.33.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.34.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.34.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.34.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.35.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.35.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.35.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.36.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.36.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.36.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.37.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.37.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.37.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.38.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.38.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.38.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.39.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.39.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.39.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.4.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.4.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.4.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.40.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.40.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.40.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.41.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.41.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.41.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.42.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.42.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.42.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.43.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.43.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.43.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.44.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.44.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.44.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.45.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.45.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.45.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.46.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.46.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.46.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.47.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.47.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.47.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.48.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.48.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.48.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.49.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.49.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.49.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.5.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.5.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.5.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.50.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.50.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.50.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.51.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.51.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.51.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.52.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.52.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.52.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.53.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.53.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.53.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.54.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.54.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.54.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.55.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.55.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.55.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.56.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.56.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.56.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.57.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.57.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.57.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.58.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.58.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.58.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.59.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.59.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.59.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.6.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.6.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.6.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.60.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.60.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.60.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.61.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.61.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.61.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.62.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.62.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.62.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.63.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.63.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.63.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.64.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.64.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.64.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.65.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.65.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.65.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.66.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.66.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.66.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.67.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.67.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.67.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.68.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.68.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.68.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.69.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.69.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.69.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.7.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.7.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.7.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.70.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.70.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.70.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.71.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.71.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.71.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.72.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.72.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.72.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.73.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.73.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.73.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.74.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.74.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.74.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.75.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.75.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.75.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.76.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.76.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.76.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.77.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.77.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.77.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.78.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.78.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.78.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.79.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.79.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.79.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.8.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.8.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.8.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.80.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.80.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.80.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.81.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.81.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.81.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.82.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.82.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.82.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.83.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.83.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.83.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.84.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.84.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.84.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.85.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.85.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.85.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.86.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.86.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.86.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.87.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.87.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.87.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.88.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.88.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.88.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.89.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.89.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.89.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.9.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.9.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.9.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.90.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.90.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.90.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.91.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.91.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.91.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.92.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.92.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.92.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.93.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.93.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.93.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.94.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.94.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.94.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.95.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.95.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.95.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.96.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.96.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.96.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.97.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.97.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.97.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.98.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.98.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.98.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.99.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.99.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.experts.99.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.gate.e_score_correction_bias": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.gate.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.shared_experts.down_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.shared_experts.gate_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.mlp.shared_experts.up_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.post_attention_layernorm.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.self_attn.k_norm.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.self_attn.k_proj.bias": "model-00081-of-00092.safetensors",
+ "model.layers.80.self_attn.k_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.self_attn.o_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.self_attn.q_norm.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.self_attn.q_proj.bias": "model-00081-of-00092.safetensors",
+ "model.layers.80.self_attn.q_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.80.self_attn.v_proj.bias": "model-00081-of-00092.safetensors",
+ "model.layers.80.self_attn.v_proj.weight": "model-00081-of-00092.safetensors",
+ "model.layers.81.input_layernorm.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.0.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.0.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.0.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.1.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.1.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.1.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.10.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.10.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.10.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.100.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.100.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.100.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.101.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.101.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.101.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.102.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.102.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.102.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.103.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.103.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.103.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.104.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.104.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.104.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.105.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.105.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.105.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.106.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.106.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.106.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.107.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.107.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.107.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.108.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.108.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.108.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.109.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.109.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.109.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.11.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.11.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.11.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.110.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.110.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.110.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.111.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.111.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.111.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.112.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.112.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.112.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.113.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.113.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.113.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.114.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.114.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.114.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.115.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.115.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.115.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.116.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.116.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.116.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.117.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.117.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.117.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.118.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.118.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.118.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.119.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.119.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.119.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.12.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.12.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.12.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.120.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.120.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.120.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.121.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.121.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.121.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.122.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.122.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.122.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.123.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.123.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.123.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.124.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.124.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.124.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.125.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.125.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.125.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.126.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.126.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.126.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.127.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.127.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.127.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.128.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.128.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.128.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.129.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.129.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.129.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.13.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.13.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.13.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.130.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.130.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.130.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.131.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.131.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.131.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.132.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.132.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.132.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.133.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.133.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.133.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.134.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.134.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.134.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.135.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.135.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.135.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.136.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.136.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.136.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.137.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.137.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.137.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.138.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.138.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.138.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.139.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.139.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.139.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.14.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.14.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.14.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.140.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.140.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.140.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.141.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.141.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.141.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.142.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.142.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.142.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.143.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.143.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.143.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.144.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.144.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.144.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.145.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.145.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.145.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.146.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.146.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.146.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.147.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.147.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.147.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.148.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.148.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.148.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.149.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.149.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.149.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.15.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.15.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.15.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.150.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.150.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.150.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.151.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.151.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.151.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.152.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.152.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.152.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.153.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.153.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.153.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.154.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.154.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.154.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.155.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.155.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.155.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.156.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.156.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.156.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.157.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.157.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.157.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.158.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.158.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.158.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.159.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.159.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.159.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.16.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.16.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.16.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.17.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.17.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.17.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.18.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.18.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.18.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.19.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.19.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.19.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.2.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.2.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.2.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.20.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.20.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.20.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.21.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.21.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.21.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.22.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.22.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.22.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.23.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.23.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.23.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.24.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.24.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.24.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.25.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.25.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.25.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.26.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.26.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.26.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.27.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.27.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.27.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.28.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.28.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.28.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.29.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.29.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.29.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.3.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.3.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.3.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.30.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.30.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.30.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.31.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.31.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.31.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.32.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.32.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.32.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.33.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.33.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.33.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.34.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.34.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.34.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.35.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.35.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.35.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.36.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.36.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.36.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.37.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.37.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.37.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.38.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.38.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.38.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.39.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.39.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.39.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.4.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.4.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.4.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.40.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.40.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.40.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.41.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.41.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.41.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.42.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.42.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.42.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.43.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.43.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.43.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.44.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.44.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.44.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.45.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.45.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.45.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.46.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.46.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.46.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.47.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.47.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.47.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.48.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.48.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.48.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.49.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.49.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.49.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.5.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.5.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.5.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.50.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.50.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.50.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.51.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.51.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.51.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.52.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.52.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.52.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.53.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.53.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.53.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.54.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.54.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.54.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.55.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.55.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.55.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.56.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.56.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.56.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.57.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.57.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.57.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.58.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.58.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.58.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.59.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.59.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.59.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.6.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.6.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.6.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.60.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.60.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.60.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.61.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.61.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.61.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.62.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.62.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.62.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.63.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.63.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.63.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.64.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.64.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.64.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.65.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.65.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.65.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.66.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.66.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.66.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.67.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.67.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.67.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.68.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.68.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.68.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.69.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.69.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.69.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.7.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.7.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.7.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.70.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.70.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.70.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.71.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.71.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.71.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.72.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.72.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.72.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.73.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.73.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.73.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.74.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.74.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.74.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.75.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.75.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.75.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.76.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.76.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.76.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.77.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.77.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.77.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.78.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.78.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.78.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.79.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.79.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.79.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.8.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.8.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.8.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.80.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.80.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.80.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.81.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.81.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.81.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.82.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.82.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.82.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.83.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.83.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.83.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.84.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.84.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.84.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.85.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.85.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.85.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.86.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.86.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.86.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.87.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.87.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.87.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.88.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.88.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.88.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.89.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.89.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.89.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.9.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.9.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.9.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.90.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.90.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.90.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.91.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.91.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.91.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.92.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.92.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.92.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.93.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.93.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.93.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.94.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.94.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.94.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.95.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.95.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.95.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.96.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.96.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.96.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.97.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.97.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.97.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.98.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.98.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.98.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.99.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.99.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.experts.99.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.gate.e_score_correction_bias": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.gate.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.shared_experts.down_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.shared_experts.gate_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.mlp.shared_experts.up_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.post_attention_layernorm.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.self_attn.k_norm.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.self_attn.k_proj.bias": "model-00082-of-00092.safetensors",
+ "model.layers.81.self_attn.k_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.self_attn.o_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.self_attn.q_norm.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.self_attn.q_proj.bias": "model-00082-of-00092.safetensors",
+ "model.layers.81.self_attn.q_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.81.self_attn.v_proj.bias": "model-00082-of-00092.safetensors",
+ "model.layers.81.self_attn.v_proj.weight": "model-00082-of-00092.safetensors",
+ "model.layers.82.input_layernorm.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.0.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.0.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.0.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.1.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.1.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.1.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.10.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.10.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.10.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.100.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.100.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.100.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.101.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.101.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.101.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.102.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.102.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.102.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.103.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.103.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.103.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.104.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.104.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.104.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.105.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.105.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.105.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.106.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.106.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.106.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.107.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.107.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.107.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.108.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.108.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.108.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.109.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.109.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.109.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.11.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.11.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.11.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.110.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.110.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.110.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.111.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.111.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.111.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.112.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.112.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.112.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.113.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.113.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.113.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.114.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.114.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.114.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.115.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.115.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.115.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.116.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.116.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.116.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.117.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.117.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.117.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.118.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.118.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.118.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.119.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.119.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.119.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.12.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.12.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.12.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.120.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.120.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.120.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.121.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.121.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.121.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.122.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.122.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.122.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.123.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.123.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.123.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.124.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.124.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.124.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.125.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.125.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.125.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.126.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.126.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.126.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.127.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.127.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.127.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.128.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.128.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.128.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.129.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.129.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.129.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.13.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.13.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.13.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.130.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.130.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.130.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.131.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.131.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.131.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.132.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.132.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.132.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.133.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.133.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.133.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.134.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.134.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.134.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.135.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.135.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.135.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.136.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.136.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.136.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.137.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.137.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.137.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.138.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.138.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.138.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.139.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.139.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.139.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.14.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.14.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.14.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.140.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.140.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.140.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.141.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.141.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.141.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.142.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.142.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.142.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.143.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.143.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.143.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.144.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.144.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.144.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.145.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.145.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.145.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.146.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.146.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.146.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.147.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.147.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.147.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.148.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.148.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.148.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.149.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.149.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.149.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.15.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.15.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.15.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.150.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.150.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.150.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.151.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.151.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.151.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.152.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.152.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.152.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.153.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.153.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.153.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.154.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.154.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.154.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.155.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.155.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.155.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.156.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.156.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.156.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.157.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.157.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.157.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.158.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.158.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.158.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.159.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.159.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.159.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.16.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.16.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.16.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.17.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.17.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.17.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.18.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.18.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.18.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.19.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.19.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.19.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.2.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.2.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.2.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.20.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.20.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.20.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.21.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.21.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.21.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.22.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.22.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.22.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.23.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.23.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.23.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.24.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.24.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.24.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.25.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.25.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.25.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.26.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.26.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.26.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.27.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.27.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.27.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.28.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.28.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.28.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.29.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.29.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.29.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.3.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.3.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.3.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.30.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.30.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.30.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.31.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.31.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.31.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.32.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.32.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.32.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.33.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.33.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.33.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.34.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.34.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.34.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.35.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.35.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.35.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.36.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.36.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.36.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.37.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.37.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.37.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.38.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.38.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.38.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.39.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.39.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.39.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.4.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.4.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.4.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.40.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.40.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.40.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.41.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.41.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.41.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.42.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.42.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.42.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.43.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.43.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.43.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.44.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.44.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.44.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.45.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.45.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.45.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.46.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.46.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.46.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.47.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.47.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.47.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.48.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.48.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.48.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.49.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.49.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.49.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.5.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.5.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.5.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.50.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.50.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.50.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.51.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.51.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.51.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.52.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.52.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.52.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.53.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.53.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.53.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.54.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.54.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.54.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.55.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.55.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.55.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.56.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.56.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.56.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.57.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.57.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.57.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.58.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.58.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.58.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.59.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.59.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.59.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.6.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.6.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.6.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.60.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.60.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.60.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.61.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.61.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.61.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.62.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.62.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.62.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.63.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.63.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.63.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.64.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.64.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.64.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.65.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.65.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.65.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.66.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.66.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.66.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.67.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.67.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.67.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.68.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.68.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.68.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.69.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.69.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.69.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.7.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.7.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.7.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.70.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.70.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.70.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.71.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.71.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.71.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.72.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.72.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.72.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.73.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.73.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.73.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.74.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.74.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.74.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.75.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.75.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.75.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.76.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.76.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.76.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.77.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.77.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.77.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.78.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.78.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.78.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.79.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.79.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.79.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.8.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.8.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.8.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.80.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.80.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.80.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.81.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.81.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.81.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.82.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.82.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.82.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.83.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.83.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.83.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.84.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.84.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.84.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.85.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.85.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.85.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.86.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.86.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.86.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.87.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.87.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.87.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.88.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.88.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.88.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.89.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.89.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.89.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.9.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.9.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.9.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.90.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.90.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.90.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.91.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.91.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.91.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.92.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.92.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.92.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.93.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.93.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.93.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.94.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.94.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.94.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.95.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.95.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.95.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.96.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.96.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.96.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.97.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.97.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.97.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.98.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.98.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.98.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.99.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.99.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.experts.99.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.gate.e_score_correction_bias": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.gate.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.shared_experts.down_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.shared_experts.gate_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.mlp.shared_experts.up_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.post_attention_layernorm.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.self_attn.k_norm.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.self_attn.k_proj.bias": "model-00083-of-00092.safetensors",
+ "model.layers.82.self_attn.k_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.self_attn.o_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.self_attn.q_norm.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.self_attn.q_proj.bias": "model-00083-of-00092.safetensors",
+ "model.layers.82.self_attn.q_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.82.self_attn.v_proj.bias": "model-00083-of-00092.safetensors",
+ "model.layers.82.self_attn.v_proj.weight": "model-00083-of-00092.safetensors",
+ "model.layers.83.input_layernorm.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.0.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.0.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.0.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.1.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.1.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.1.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.10.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.10.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.10.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.100.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.100.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.100.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.101.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.101.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.101.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.102.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.102.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.102.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.103.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.103.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.103.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.104.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.104.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.104.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.105.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.105.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.105.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.106.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.106.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.106.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.107.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.107.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.107.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.108.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.108.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.108.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.109.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.109.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.109.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.11.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.11.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.11.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.110.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.110.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.110.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.111.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.111.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.111.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.112.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.112.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.112.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.113.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.113.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.113.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.114.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.114.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.114.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.115.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.115.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.115.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.116.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.116.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.116.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.117.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.117.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.117.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.118.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.118.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.118.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.119.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.119.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.119.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.12.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.12.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.12.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.120.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.120.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.120.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.121.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.121.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.121.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.122.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.122.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.122.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.123.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.123.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.123.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.124.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.124.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.124.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.125.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.125.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.125.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.126.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.126.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.126.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.127.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.127.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.127.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.128.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.128.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.128.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.129.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.129.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.129.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.13.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.13.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.13.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.130.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.130.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.130.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.131.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.131.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.131.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.132.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.132.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.132.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.133.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.133.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.133.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.134.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.134.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.134.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.135.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.135.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.135.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.136.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.136.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.136.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.137.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.137.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.137.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.138.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.138.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.138.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.139.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.139.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.139.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.14.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.14.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.14.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.140.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.140.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.140.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.141.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.141.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.141.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.142.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.142.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.142.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.143.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.143.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.143.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.144.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.144.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.144.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.145.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.145.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.145.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.146.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.146.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.146.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.147.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.147.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.147.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.148.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.148.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.148.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.149.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.149.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.149.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.15.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.15.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.15.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.150.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.150.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.150.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.151.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.151.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.151.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.152.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.152.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.152.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.153.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.153.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.153.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.154.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.154.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.154.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.155.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.155.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.155.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.156.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.156.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.156.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.157.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.157.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.157.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.158.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.158.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.158.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.159.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.159.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.159.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.16.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.16.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.16.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.17.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.17.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.17.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.18.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.18.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.18.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.19.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.19.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.19.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.2.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.2.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.2.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.20.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.20.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.20.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.21.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.21.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.21.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.22.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.22.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.22.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.23.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.23.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.23.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.24.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.24.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.24.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.25.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.25.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.25.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.26.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.26.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.26.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.27.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.27.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.27.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.28.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.28.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.28.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.29.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.29.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.29.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.3.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.3.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.3.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.30.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.30.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.30.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.31.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.31.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.31.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.32.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.32.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.32.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.33.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.33.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.33.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.34.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.34.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.34.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.35.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.35.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.35.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.36.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.36.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.36.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.37.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.37.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.37.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.38.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.38.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.38.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.39.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.39.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.39.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.4.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.4.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.4.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.40.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.40.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.40.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.41.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.41.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.41.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.42.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.42.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.42.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.43.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.43.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.43.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.44.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.44.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.44.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.45.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.45.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.45.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.46.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.46.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.46.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.47.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.47.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.47.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.48.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.48.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.48.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.49.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.49.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.49.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.5.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.5.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.5.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.50.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.50.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.50.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.51.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.51.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.51.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.52.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.52.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.52.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.53.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.53.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.53.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.54.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.54.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.54.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.55.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.55.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.55.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.56.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.56.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.56.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.57.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.57.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.57.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.58.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.58.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.58.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.59.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.59.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.59.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.6.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.6.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.6.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.60.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.60.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.60.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.61.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.61.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.61.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.62.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.62.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.62.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.63.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.63.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.63.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.64.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.64.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.64.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.65.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.65.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.65.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.66.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.66.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.66.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.67.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.67.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.67.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.68.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.68.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.68.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.69.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.69.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.69.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.7.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.7.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.7.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.70.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.70.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.70.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.71.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.71.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.71.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.72.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.72.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.72.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.73.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.73.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.73.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.74.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.74.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.74.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.75.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.75.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.75.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.76.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.76.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.76.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.77.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.77.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.77.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.78.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.78.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.78.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.79.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.79.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.79.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.8.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.8.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.8.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.80.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.80.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.80.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.81.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.81.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.81.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.82.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.82.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.82.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.83.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.83.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.83.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.84.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.84.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.84.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.85.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.85.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.85.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.86.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.86.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.86.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.87.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.87.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.87.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.88.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.88.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.88.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.89.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.89.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.89.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.9.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.9.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.9.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.90.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.90.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.90.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.91.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.91.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.91.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.92.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.92.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.92.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.93.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.93.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.93.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.94.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.94.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.94.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.95.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.95.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.95.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.96.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.96.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.96.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.97.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.97.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.97.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.98.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.98.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.98.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.99.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.99.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.experts.99.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.gate.e_score_correction_bias": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.gate.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.shared_experts.down_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.shared_experts.gate_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.mlp.shared_experts.up_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.post_attention_layernorm.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.self_attn.k_norm.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.self_attn.k_proj.bias": "model-00084-of-00092.safetensors",
+ "model.layers.83.self_attn.k_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.self_attn.o_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.self_attn.q_norm.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.self_attn.q_proj.bias": "model-00084-of-00092.safetensors",
+ "model.layers.83.self_attn.q_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.83.self_attn.v_proj.bias": "model-00084-of-00092.safetensors",
+ "model.layers.83.self_attn.v_proj.weight": "model-00084-of-00092.safetensors",
+ "model.layers.84.input_layernorm.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.0.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.0.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.0.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.1.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.1.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.1.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.10.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.10.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.10.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.100.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.100.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.100.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.101.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.101.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.101.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.102.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.102.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.102.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.103.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.103.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.103.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.104.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.104.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.104.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.105.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.105.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.105.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.106.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.106.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.106.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.107.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.107.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.107.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.108.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.108.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.108.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.109.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.109.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.109.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.11.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.11.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.11.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.110.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.110.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.110.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.111.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.111.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.111.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.112.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.112.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.112.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.113.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.113.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.113.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.114.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.114.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.114.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.115.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.115.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.115.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.116.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.116.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.116.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.117.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.117.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.117.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.118.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.118.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.118.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.119.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.119.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.119.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.12.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.12.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.12.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.120.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.120.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.120.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.121.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.121.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.121.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.122.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.122.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.122.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.123.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.123.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.123.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.124.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.124.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.124.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.125.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.125.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.125.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.126.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.126.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.126.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.127.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.127.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.127.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.128.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.128.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.128.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.129.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.129.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.129.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.13.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.13.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.13.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.130.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.130.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.130.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.131.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.131.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.131.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.132.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.132.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.132.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.133.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.133.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.133.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.134.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.134.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.134.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.135.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.135.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.135.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.136.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.136.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.136.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.137.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.137.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.137.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.138.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.138.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.138.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.139.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.139.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.139.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.14.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.14.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.14.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.140.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.140.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.140.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.141.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.141.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.141.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.142.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.142.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.142.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.143.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.143.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.143.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.144.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.144.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.144.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.145.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.145.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.145.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.146.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.146.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.146.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.147.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.147.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.147.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.148.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.148.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.148.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.149.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.149.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.149.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.15.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.15.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.15.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.150.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.150.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.150.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.151.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.151.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.151.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.152.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.152.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.152.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.153.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.153.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.153.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.154.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.154.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.154.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.155.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.155.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.155.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.156.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.156.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.156.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.157.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.157.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.157.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.158.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.158.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.158.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.159.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.159.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.159.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.16.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.16.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.16.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.17.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.17.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.17.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.18.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.18.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.18.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.19.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.19.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.19.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.2.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.2.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.2.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.20.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.20.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.20.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.21.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.21.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.21.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.22.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.22.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.22.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.23.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.23.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.23.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.24.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.24.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.24.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.25.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.25.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.25.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.26.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.26.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.26.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.27.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.27.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.27.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.28.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.28.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.28.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.29.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.29.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.29.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.3.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.3.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.3.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.30.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.30.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.30.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.31.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.31.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.31.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.32.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.32.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.32.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.33.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.33.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.33.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.34.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.34.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.34.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.35.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.35.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.35.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.36.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.36.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.36.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.37.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.37.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.37.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.38.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.38.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.38.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.39.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.39.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.39.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.4.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.4.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.4.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.40.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.40.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.40.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.41.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.41.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.41.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.42.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.42.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.42.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.43.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.43.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.43.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.44.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.44.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.44.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.45.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.45.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.45.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.46.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.46.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.46.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.47.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.47.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.47.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.48.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.48.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.48.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.49.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.49.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.49.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.5.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.5.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.5.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.50.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.50.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.50.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.51.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.51.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.51.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.52.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.52.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.52.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.53.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.53.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.53.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.54.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.54.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.54.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.55.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.55.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.55.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.56.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.56.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.56.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.57.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.57.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.57.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.58.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.58.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.58.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.59.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.59.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.59.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.6.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.6.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.6.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.60.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.60.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.60.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.61.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.61.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.61.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.62.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.62.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.62.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.63.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.63.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.63.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.64.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.64.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.64.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.65.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.65.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.65.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.66.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.66.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.66.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.67.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.67.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.67.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.68.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.68.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.68.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.69.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.69.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.69.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.7.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.7.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.7.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.70.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.70.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.70.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.71.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.71.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.71.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.72.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.72.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.72.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.73.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.73.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.73.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.74.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.74.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.74.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.75.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.75.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.75.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.76.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.76.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.76.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.77.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.77.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.77.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.78.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.78.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.78.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.79.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.79.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.79.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.8.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.8.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.8.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.80.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.80.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.80.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.81.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.81.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.81.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.82.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.82.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.82.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.83.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.83.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.83.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.84.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.84.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.84.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.85.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.85.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.85.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.86.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.86.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.86.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.87.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.87.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.87.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.88.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.88.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.88.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.89.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.89.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.89.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.9.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.9.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.9.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.90.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.90.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.90.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.91.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.91.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.91.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.92.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.92.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.92.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.93.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.93.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.93.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.94.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.94.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.94.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.95.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.95.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.95.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.96.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.96.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.96.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.97.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.97.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.97.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.98.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.98.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.98.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.99.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.99.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.experts.99.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.gate.e_score_correction_bias": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.gate.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.shared_experts.down_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.shared_experts.gate_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.mlp.shared_experts.up_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.post_attention_layernorm.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.self_attn.k_norm.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.self_attn.k_proj.bias": "model-00085-of-00092.safetensors",
+ "model.layers.84.self_attn.k_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.self_attn.o_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.self_attn.q_norm.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.self_attn.q_proj.bias": "model-00085-of-00092.safetensors",
+ "model.layers.84.self_attn.q_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.84.self_attn.v_proj.bias": "model-00085-of-00092.safetensors",
+ "model.layers.84.self_attn.v_proj.weight": "model-00085-of-00092.safetensors",
+ "model.layers.85.input_layernorm.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.0.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.0.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.0.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.1.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.1.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.1.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.10.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.10.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.10.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.100.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.100.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.100.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.101.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.101.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.101.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.102.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.102.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.102.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.103.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.103.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.103.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.104.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.104.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.104.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.105.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.105.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.105.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.106.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.106.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.106.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.107.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.107.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.107.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.108.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.108.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.108.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.109.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.109.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.109.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.11.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.11.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.11.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.110.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.110.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.110.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.111.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.111.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.111.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.112.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.112.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.112.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.113.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.113.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.113.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.114.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.114.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.114.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.115.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.115.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.115.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.116.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.116.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.116.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.117.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.117.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.117.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.118.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.118.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.118.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.119.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.119.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.119.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.12.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.12.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.12.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.120.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.120.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.120.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.121.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.121.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.121.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.122.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.122.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.122.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.123.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.123.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.123.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.124.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.124.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.124.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.125.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.125.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.125.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.126.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.126.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.126.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.127.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.127.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.127.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.128.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.128.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.128.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.129.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.129.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.129.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.13.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.13.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.13.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.130.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.130.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.130.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.131.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.131.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.131.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.132.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.132.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.132.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.133.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.133.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.133.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.134.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.134.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.134.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.135.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.135.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.135.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.136.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.136.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.136.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.137.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.137.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.137.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.138.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.138.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.138.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.139.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.139.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.139.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.14.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.14.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.14.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.140.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.140.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.140.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.141.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.141.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.141.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.142.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.142.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.142.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.143.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.143.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.143.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.144.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.144.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.144.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.145.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.145.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.145.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.146.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.146.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.146.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.147.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.147.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.147.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.148.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.148.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.148.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.149.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.149.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.149.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.15.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.15.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.15.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.150.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.150.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.150.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.151.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.151.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.151.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.152.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.152.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.152.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.153.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.153.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.153.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.154.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.154.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.154.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.155.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.155.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.155.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.156.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.156.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.156.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.157.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.157.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.157.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.158.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.158.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.158.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.159.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.159.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.159.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.16.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.16.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.16.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.17.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.17.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.17.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.18.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.18.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.18.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.19.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.19.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.19.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.2.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.2.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.2.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.20.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.20.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.20.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.21.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.21.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.21.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.22.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.22.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.22.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.23.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.23.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.23.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.24.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.24.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.24.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.25.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.25.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.25.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.26.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.26.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.26.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.27.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.27.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.27.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.28.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.28.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.28.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.29.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.29.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.29.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.3.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.3.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.3.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.30.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.30.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.30.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.31.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.31.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.31.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.32.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.32.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.32.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.33.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.33.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.33.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.34.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.34.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.34.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.35.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.35.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.35.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.36.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.36.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.36.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.37.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.37.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.37.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.38.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.38.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.38.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.39.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.39.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.39.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.4.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.4.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.4.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.40.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.40.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.40.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.41.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.41.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.41.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.42.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.42.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.42.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.43.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.43.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.43.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.44.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.44.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.44.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.45.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.45.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.45.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.46.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.46.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.46.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.47.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.47.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.47.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.48.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.48.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.48.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.49.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.49.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.49.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.5.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.5.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.5.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.50.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.50.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.50.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.51.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.51.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.51.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.52.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.52.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.52.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.53.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.53.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.53.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.54.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.54.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.54.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.55.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.55.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.55.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.56.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.56.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.56.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.57.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.57.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.57.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.58.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.58.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.58.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.59.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.59.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.59.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.6.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.6.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.6.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.60.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.60.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.60.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.61.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.61.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.61.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.62.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.62.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.62.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.63.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.63.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.63.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.64.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.64.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.64.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.65.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.65.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.65.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.66.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.66.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.66.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.67.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.67.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.67.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.68.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.68.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.68.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.69.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.69.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.69.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.7.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.7.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.7.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.70.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.70.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.70.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.71.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.71.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.71.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.72.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.72.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.72.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.73.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.73.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.73.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.74.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.74.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.74.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.75.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.75.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.75.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.76.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.76.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.76.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.77.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.77.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.77.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.78.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.78.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.78.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.79.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.79.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.79.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.8.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.8.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.8.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.80.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.80.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.80.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.81.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.81.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.81.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.82.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.82.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.82.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.83.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.83.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.83.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.84.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.84.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.84.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.85.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.85.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.85.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.86.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.86.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.86.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.87.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.87.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.87.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.88.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.88.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.88.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.89.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.89.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.89.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.9.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.9.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.9.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.90.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.90.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.90.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.91.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.91.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.91.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.92.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.92.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.92.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.93.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.93.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.93.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.94.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.94.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.94.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.95.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.95.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.95.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.96.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.96.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.96.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.97.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.97.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.97.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.98.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.98.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.98.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.99.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.99.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.experts.99.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.gate.e_score_correction_bias": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.gate.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.shared_experts.down_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.shared_experts.gate_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.mlp.shared_experts.up_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.post_attention_layernorm.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.self_attn.k_norm.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.self_attn.k_proj.bias": "model-00086-of-00092.safetensors",
+ "model.layers.85.self_attn.k_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.self_attn.o_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.self_attn.q_norm.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.self_attn.q_proj.bias": "model-00086-of-00092.safetensors",
+ "model.layers.85.self_attn.q_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.85.self_attn.v_proj.bias": "model-00086-of-00092.safetensors",
+ "model.layers.85.self_attn.v_proj.weight": "model-00086-of-00092.safetensors",
+ "model.layers.86.input_layernorm.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.0.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.0.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.0.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.1.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.1.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.1.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.10.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.10.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.10.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.100.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.100.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.100.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.101.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.101.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.101.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.102.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.102.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.102.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.103.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.103.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.103.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.104.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.104.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.104.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.105.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.105.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.105.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.106.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.106.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.106.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.107.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.107.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.107.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.108.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.108.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.108.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.109.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.109.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.109.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.11.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.11.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.11.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.110.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.110.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.110.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.111.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.111.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.111.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.112.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.112.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.112.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.113.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.113.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.113.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.114.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.114.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.114.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.115.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.115.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.115.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.116.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.116.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.116.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.117.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.117.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.117.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.118.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.118.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.118.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.119.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.119.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.119.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.12.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.12.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.12.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.120.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.120.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.120.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.121.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.121.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.121.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.122.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.122.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.122.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.123.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.123.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.123.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.124.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.124.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.124.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.125.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.125.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.125.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.126.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.126.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.126.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.127.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.127.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.127.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.128.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.128.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.128.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.129.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.129.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.129.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.13.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.13.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.13.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.130.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.130.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.130.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.131.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.131.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.131.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.132.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.132.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.132.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.133.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.133.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.133.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.134.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.134.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.134.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.135.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.135.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.135.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.136.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.136.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.136.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.137.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.137.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.137.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.138.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.138.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.138.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.139.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.139.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.139.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.14.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.14.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.14.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.140.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.140.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.140.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.141.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.141.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.141.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.142.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.142.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.142.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.143.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.143.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.143.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.144.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.144.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.144.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.145.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.145.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.145.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.146.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.146.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.146.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.147.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.147.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.147.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.148.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.148.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.148.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.149.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.149.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.149.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.15.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.15.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.15.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.150.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.150.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.150.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.151.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.151.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.151.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.152.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.152.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.152.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.153.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.153.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.153.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.154.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.154.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.154.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.155.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.155.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.155.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.156.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.156.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.156.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.157.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.157.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.157.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.158.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.158.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.158.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.159.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.159.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.159.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.16.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.16.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.16.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.17.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.17.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.17.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.18.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.18.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.18.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.19.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.19.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.19.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.2.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.2.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.2.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.20.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.20.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.20.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.21.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.21.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.21.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.22.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.22.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.22.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.23.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.23.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.23.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.24.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.24.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.24.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.25.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.25.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.25.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.26.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.26.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.26.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.27.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.27.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.27.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.28.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.28.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.28.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.29.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.29.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.29.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.3.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.3.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.3.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.30.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.30.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.30.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.31.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.31.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.31.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.32.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.32.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.32.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.33.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.33.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.33.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.34.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.34.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.34.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.35.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.35.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.35.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.36.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.36.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.36.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.37.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.37.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.37.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.38.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.38.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.38.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.39.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.39.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.39.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.4.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.4.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.4.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.40.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.40.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.40.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.41.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.41.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.41.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.42.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.42.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.42.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.43.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.43.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.43.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.44.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.44.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.44.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.45.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.45.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.45.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.46.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.46.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.46.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.47.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.47.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.47.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.48.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.48.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.48.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.49.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.49.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.49.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.5.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.5.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.5.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.50.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.50.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.50.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.51.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.51.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.51.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.52.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.52.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.52.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.53.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.53.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.53.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.54.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.54.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.54.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.55.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.55.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.55.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.56.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.56.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.56.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.57.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.57.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.57.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.58.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.58.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.58.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.59.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.59.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.59.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.6.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.6.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.6.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.60.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.60.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.60.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.61.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.61.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.61.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.62.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.62.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.62.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.63.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.63.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.63.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.64.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.64.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.64.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.65.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.65.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.65.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.66.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.66.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.66.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.67.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.67.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.67.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.68.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.68.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.68.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.69.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.69.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.69.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.7.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.7.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.7.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.70.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.70.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.70.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.71.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.71.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.71.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.72.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.72.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.72.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.73.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.73.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.73.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.74.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.74.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.74.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.75.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.75.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.75.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.76.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.76.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.76.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.77.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.77.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.77.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.78.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.78.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.78.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.79.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.79.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.79.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.8.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.8.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.8.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.80.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.80.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.80.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.81.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.81.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.81.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.82.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.82.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.82.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.83.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.83.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.83.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.84.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.84.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.84.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.85.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.85.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.85.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.86.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.86.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.86.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.87.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.87.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.87.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.88.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.88.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.88.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.89.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.89.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.89.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.9.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.9.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.9.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.90.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.90.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.90.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.91.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.91.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.91.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.92.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.92.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.92.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.93.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.93.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.93.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.94.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.94.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.94.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.95.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.95.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.95.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.96.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.96.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.96.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.97.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.97.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.97.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.98.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.98.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.98.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.99.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.99.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.experts.99.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.gate.e_score_correction_bias": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.gate.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.shared_experts.down_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.shared_experts.gate_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.mlp.shared_experts.up_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.post_attention_layernorm.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.self_attn.k_norm.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.self_attn.k_proj.bias": "model-00087-of-00092.safetensors",
+ "model.layers.86.self_attn.k_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.self_attn.o_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.self_attn.q_norm.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.self_attn.q_proj.bias": "model-00087-of-00092.safetensors",
+ "model.layers.86.self_attn.q_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.86.self_attn.v_proj.bias": "model-00087-of-00092.safetensors",
+ "model.layers.86.self_attn.v_proj.weight": "model-00087-of-00092.safetensors",
+ "model.layers.87.input_layernorm.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.0.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.0.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.0.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.1.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.1.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.1.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.10.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.10.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.10.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.100.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.100.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.100.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.101.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.101.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.101.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.102.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.102.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.102.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.103.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.103.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.103.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.104.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.104.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.104.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.105.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.105.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.105.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.106.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.106.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.106.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.107.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.107.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.107.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.108.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.108.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.108.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.109.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.109.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.109.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.11.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.11.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.11.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.110.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.110.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.110.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.111.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.111.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.111.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.112.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.112.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.112.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.113.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.113.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.113.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.114.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.114.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.114.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.115.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.115.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.115.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.116.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.116.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.116.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.117.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.117.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.117.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.118.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.118.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.118.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.119.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.119.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.119.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.12.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.12.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.12.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.120.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.120.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.120.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.121.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.121.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.121.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.122.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.122.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.122.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.123.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.123.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.123.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.124.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.124.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.124.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.125.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.125.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.125.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.126.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.126.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.126.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.127.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.127.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.127.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.128.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.128.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.128.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.129.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.129.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.129.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.13.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.13.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.13.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.130.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.130.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.130.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.131.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.131.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.131.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.132.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.132.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.132.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.133.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.133.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.133.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.134.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.134.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.134.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.135.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.135.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.135.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.136.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.136.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.136.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.137.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.137.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.137.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.138.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.138.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.138.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.139.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.139.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.139.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.14.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.14.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.14.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.140.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.140.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.140.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.141.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.141.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.141.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.142.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.142.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.142.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.143.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.143.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.143.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.144.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.144.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.144.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.145.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.145.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.145.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.146.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.146.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.146.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.147.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.147.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.147.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.148.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.148.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.148.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.149.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.149.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.149.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.15.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.15.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.15.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.150.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.150.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.150.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.151.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.151.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.151.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.152.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.152.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.152.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.153.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.153.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.153.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.154.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.154.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.154.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.155.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.155.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.155.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.156.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.156.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.156.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.157.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.157.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.157.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.158.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.158.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.158.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.159.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.159.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.159.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.16.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.16.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.16.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.17.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.17.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.17.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.18.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.18.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.18.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.19.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.19.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.19.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.2.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.2.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.2.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.20.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.20.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.20.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.21.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.21.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.21.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.22.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.22.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.22.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.23.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.23.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.23.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.24.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.24.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.24.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.25.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.25.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.25.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.26.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.26.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.26.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.27.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.27.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.27.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.28.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.28.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.28.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.29.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.29.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.29.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.3.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.3.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.3.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.30.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.30.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.30.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.31.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.31.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.31.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.32.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.32.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.32.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.33.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.33.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.33.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.34.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.34.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.34.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.35.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.35.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.35.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.36.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.36.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.36.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.37.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.37.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.37.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.38.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.38.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.38.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.39.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.39.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.39.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.4.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.4.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.4.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.40.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.40.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.40.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.41.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.41.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.41.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.42.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.42.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.42.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.43.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.43.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.43.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.44.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.44.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.44.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.45.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.45.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.45.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.46.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.46.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.46.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.47.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.47.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.47.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.48.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.48.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.48.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.49.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.49.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.49.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.5.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.5.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.5.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.50.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.50.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.50.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.51.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.51.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.51.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.52.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.52.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.52.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.53.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.53.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.53.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.54.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.54.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.54.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.55.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.55.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.55.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.56.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.56.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.56.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.57.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.57.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.57.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.58.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.58.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.58.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.59.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.59.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.59.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.6.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.6.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.6.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.60.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.60.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.60.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.61.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.61.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.61.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.62.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.62.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.62.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.63.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.63.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.63.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.64.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.64.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.64.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.65.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.65.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.65.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.66.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.66.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.66.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.67.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.67.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.67.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.68.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.68.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.68.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.69.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.69.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.69.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.7.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.7.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.7.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.70.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.70.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.70.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.71.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.71.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.71.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.72.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.72.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.72.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.73.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.73.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.73.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.74.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.74.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.74.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.75.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.75.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.75.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.76.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.76.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.76.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.77.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.77.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.77.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.78.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.78.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.78.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.79.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.79.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.79.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.8.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.8.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.8.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.80.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.80.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.80.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.81.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.81.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.81.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.82.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.82.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.82.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.83.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.83.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.83.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.84.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.84.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.84.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.85.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.85.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.85.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.86.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.86.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.86.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.87.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.87.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.87.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.88.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.88.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.88.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.89.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.89.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.89.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.9.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.9.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.9.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.90.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.90.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.90.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.91.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.91.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.91.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.92.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.92.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.92.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.93.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.93.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.93.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.94.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.94.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.94.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.95.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.95.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.95.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.96.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.96.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.96.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.97.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.97.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.97.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.98.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.98.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.98.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.99.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.99.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.experts.99.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.gate.e_score_correction_bias": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.gate.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.shared_experts.down_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.shared_experts.gate_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.mlp.shared_experts.up_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.post_attention_layernorm.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.self_attn.k_norm.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.self_attn.k_proj.bias": "model-00088-of-00092.safetensors",
+ "model.layers.87.self_attn.k_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.self_attn.o_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.self_attn.q_norm.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.self_attn.q_proj.bias": "model-00088-of-00092.safetensors",
+ "model.layers.87.self_attn.q_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.87.self_attn.v_proj.bias": "model-00088-of-00092.safetensors",
+ "model.layers.87.self_attn.v_proj.weight": "model-00088-of-00092.safetensors",
+ "model.layers.88.input_layernorm.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.0.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.0.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.0.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.1.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.1.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.1.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.10.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.10.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.10.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.100.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.100.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.100.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.101.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.101.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.101.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.102.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.102.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.102.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.103.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.103.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.103.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.104.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.104.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.104.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.105.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.105.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.105.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.106.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.106.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.106.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.107.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.107.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.107.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.108.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.108.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.108.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.109.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.109.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.109.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.11.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.11.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.11.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.110.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.110.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.110.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.111.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.111.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.111.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.112.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.112.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.112.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.113.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.113.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.113.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.114.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.114.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.114.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.115.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.115.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.115.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.116.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.116.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.116.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.117.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.117.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.117.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.118.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.118.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.118.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.119.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.119.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.119.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.12.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.12.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.12.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.120.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.120.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.120.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.121.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.121.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.121.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.122.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.122.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.122.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.123.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.123.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.123.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.124.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.124.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.124.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.125.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.125.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.125.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.126.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.126.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.126.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.127.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.127.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.127.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.128.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.128.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.128.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.129.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.129.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.129.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.13.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.13.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.13.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.130.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.130.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.130.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.131.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.131.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.131.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.132.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.132.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.132.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.133.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.133.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.133.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.134.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.134.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.134.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.135.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.135.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.135.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.136.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.136.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.136.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.137.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.137.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.137.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.138.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.138.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.138.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.139.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.139.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.139.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.14.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.14.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.14.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.140.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.140.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.140.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.141.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.141.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.141.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.142.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.142.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.142.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.143.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.143.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.143.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.144.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.144.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.144.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.145.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.145.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.145.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.146.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.146.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.146.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.147.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.147.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.147.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.148.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.148.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.148.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.149.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.149.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.149.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.15.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.15.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.15.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.150.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.150.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.150.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.151.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.151.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.151.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.152.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.152.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.152.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.153.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.153.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.153.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.154.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.154.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.154.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.155.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.155.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.155.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.156.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.156.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.156.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.157.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.157.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.157.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.158.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.158.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.158.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.159.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.159.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.159.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.16.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.16.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.16.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.17.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.17.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.17.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.18.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.18.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.18.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.19.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.19.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.19.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.2.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.2.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.2.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.20.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.20.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.20.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.21.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.21.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.21.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.22.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.22.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.22.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.23.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.23.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.23.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.24.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.24.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.24.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.25.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.25.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.25.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.26.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.26.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.26.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.27.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.27.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.27.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.28.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.28.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.28.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.29.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.29.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.29.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.3.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.3.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.3.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.30.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.30.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.30.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.31.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.31.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.31.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.32.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.32.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.32.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.33.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.33.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.33.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.34.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.34.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.34.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.35.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.35.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.35.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.36.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.36.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.36.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.37.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.37.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.37.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.38.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.38.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.38.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.39.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.39.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.39.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.4.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.4.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.4.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.40.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.40.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.40.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.41.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.41.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.41.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.42.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.42.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.42.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.43.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.43.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.43.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.44.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.44.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.44.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.45.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.45.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.45.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.46.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.46.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.46.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.47.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.47.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.47.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.48.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.48.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.48.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.49.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.49.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.49.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.5.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.5.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.5.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.50.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.50.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.50.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.51.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.51.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.51.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.52.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.52.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.52.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.53.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.53.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.53.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.54.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.54.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.54.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.55.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.55.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.55.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.56.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.56.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.56.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.57.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.57.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.57.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.58.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.58.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.58.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.59.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.59.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.59.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.6.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.6.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.6.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.60.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.60.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.60.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.61.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.61.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.61.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.62.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.62.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.62.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.63.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.63.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.63.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.64.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.64.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.64.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.65.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.65.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.65.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.66.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.66.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.66.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.67.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.67.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.67.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.68.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.68.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.68.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.69.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.69.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.69.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.7.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.7.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.7.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.70.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.70.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.70.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.71.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.71.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.71.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.72.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.72.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.72.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.73.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.73.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.73.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.74.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.74.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.74.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.75.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.75.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.75.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.76.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.76.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.76.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.77.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.77.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.77.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.78.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.78.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.78.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.79.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.79.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.79.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.8.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.8.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.8.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.80.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.80.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.80.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.81.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.81.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.81.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.82.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.82.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.82.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.83.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.83.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.83.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.84.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.84.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.84.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.85.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.85.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.85.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.86.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.86.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.86.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.87.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.87.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.87.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.88.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.88.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.88.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.89.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.89.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.89.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.9.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.9.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.9.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.90.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.90.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.90.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.91.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.91.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.91.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.92.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.92.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.92.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.93.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.93.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.93.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.94.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.94.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.94.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.95.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.95.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.95.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.96.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.96.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.96.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.97.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.97.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.97.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.98.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.98.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.98.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.99.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.99.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.experts.99.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.gate.e_score_correction_bias": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.gate.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.shared_experts.down_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.shared_experts.gate_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.mlp.shared_experts.up_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.post_attention_layernorm.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.self_attn.k_norm.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.self_attn.k_proj.bias": "model-00089-of-00092.safetensors",
+ "model.layers.88.self_attn.k_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.self_attn.o_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.self_attn.q_norm.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.self_attn.q_proj.bias": "model-00089-of-00092.safetensors",
+ "model.layers.88.self_attn.q_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.88.self_attn.v_proj.bias": "model-00089-of-00092.safetensors",
+ "model.layers.88.self_attn.v_proj.weight": "model-00089-of-00092.safetensors",
+ "model.layers.89.input_layernorm.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.0.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.0.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.0.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.1.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.1.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.1.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.10.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.10.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.10.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.100.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.100.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.100.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.101.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.101.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.101.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.102.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.102.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.102.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.103.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.103.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.103.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.104.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.104.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.104.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.105.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.105.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.105.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.106.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.106.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.106.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.107.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.107.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.107.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.108.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.108.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.108.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.109.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.109.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.109.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.11.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.11.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.11.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.110.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.110.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.110.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.111.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.111.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.111.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.112.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.112.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.112.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.113.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.113.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.113.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.114.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.114.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.114.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.115.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.115.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.115.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.116.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.116.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.116.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.117.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.117.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.117.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.118.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.118.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.118.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.119.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.119.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.119.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.12.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.12.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.12.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.120.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.120.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.120.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.121.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.121.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.121.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.122.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.122.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.122.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.123.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.123.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.123.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.124.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.124.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.124.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.125.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.125.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.125.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.126.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.126.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.126.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.127.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.127.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.127.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.128.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.128.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.128.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.129.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.129.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.129.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.13.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.13.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.13.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.130.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.130.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.130.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.131.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.131.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.131.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.132.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.132.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.132.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.133.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.133.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.133.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.134.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.134.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.134.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.135.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.135.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.135.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.136.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.136.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.136.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.137.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.137.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.137.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.138.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.138.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.138.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.139.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.139.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.139.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.14.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.14.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.14.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.140.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.140.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.140.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.141.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.141.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.141.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.142.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.142.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.142.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.143.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.143.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.143.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.144.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.144.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.144.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.145.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.145.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.145.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.146.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.146.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.146.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.147.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.147.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.147.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.148.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.148.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.148.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.149.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.149.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.149.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.15.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.15.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.15.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.150.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.150.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.150.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.151.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.151.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.151.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.152.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.152.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.152.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.153.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.153.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.153.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.154.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.154.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.154.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.155.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.155.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.155.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.156.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.156.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.156.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.157.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.157.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.157.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.158.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.158.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.158.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.159.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.159.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.159.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.16.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.16.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.16.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.17.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.17.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.17.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.18.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.18.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.18.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.19.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.19.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.19.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.2.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.2.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.2.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.20.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.20.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.20.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.21.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.21.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.21.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.22.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.22.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.22.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.23.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.23.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.23.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.24.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.24.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.24.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.25.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.25.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.25.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.26.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.26.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.26.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.27.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.27.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.27.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.28.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.28.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.28.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.29.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.29.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.29.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.3.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.3.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.3.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.30.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.30.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.30.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.31.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.31.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.31.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.32.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.32.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.32.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.33.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.33.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.33.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.34.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.34.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.34.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.35.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.35.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.35.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.36.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.36.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.36.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.37.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.37.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.37.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.38.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.38.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.38.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.39.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.39.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.39.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.4.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.4.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.4.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.40.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.40.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.40.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.41.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.41.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.41.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.42.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.42.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.42.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.43.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.43.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.43.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.44.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.44.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.44.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.45.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.45.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.45.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.46.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.46.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.46.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.47.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.47.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.47.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.48.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.48.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.48.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.49.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.49.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.49.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.5.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.5.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.5.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.50.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.50.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.50.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.51.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.51.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.51.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.52.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.52.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.52.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.53.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.53.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.53.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.54.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.54.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.54.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.55.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.55.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.55.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.56.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.56.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.56.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.57.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.57.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.57.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.58.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.58.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.58.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.59.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.59.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.59.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.6.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.6.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.6.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.60.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.60.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.60.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.61.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.61.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.61.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.62.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.62.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.62.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.63.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.63.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.63.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.64.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.64.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.64.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.65.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.65.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.65.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.66.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.66.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.66.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.67.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.67.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.67.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.68.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.68.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.68.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.69.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.69.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.69.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.7.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.7.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.7.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.70.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.70.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.70.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.71.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.71.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.71.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.72.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.72.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.72.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.73.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.73.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.73.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.74.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.74.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.74.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.75.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.75.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.75.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.76.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.76.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.76.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.77.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.77.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.77.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.78.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.78.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.78.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.79.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.79.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.79.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.8.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.8.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.8.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.80.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.80.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.80.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.81.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.81.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.81.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.82.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.82.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.82.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.83.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.83.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.83.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.84.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.84.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.84.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.85.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.85.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.85.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.86.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.86.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.86.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.87.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.87.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.87.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.88.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.88.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.88.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.89.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.89.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.89.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.9.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.9.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.9.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.90.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.90.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.90.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.91.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.91.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.91.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.92.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.92.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.92.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.93.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.93.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.93.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.94.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.94.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.94.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.95.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.95.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.95.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.96.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.96.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.96.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.97.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.97.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.97.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.98.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.98.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.98.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.99.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.99.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.experts.99.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.gate.e_score_correction_bias": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.gate.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.shared_experts.down_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.shared_experts.gate_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.mlp.shared_experts.up_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.post_attention_layernorm.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.self_attn.k_norm.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.self_attn.k_proj.bias": "model-00090-of-00092.safetensors",
+ "model.layers.89.self_attn.k_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.self_attn.o_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.self_attn.q_norm.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.self_attn.q_proj.bias": "model-00090-of-00092.safetensors",
+ "model.layers.89.self_attn.q_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.89.self_attn.v_proj.bias": "model-00090-of-00092.safetensors",
+ "model.layers.89.self_attn.v_proj.weight": "model-00090-of-00092.safetensors",
+ "model.layers.90.input_layernorm.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.0.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.0.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.0.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.1.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.1.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.1.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.10.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.10.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.10.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.100.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.100.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.100.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.101.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.101.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.101.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.102.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.102.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.102.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.103.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.103.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.103.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.104.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.104.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.104.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.105.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.105.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.105.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.106.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.106.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.106.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.107.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.107.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.107.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.108.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.108.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.108.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.109.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.109.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.109.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.11.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.11.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.11.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.110.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.110.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.110.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.111.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.111.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.111.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.112.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.112.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.112.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.113.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.113.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.113.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.114.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.114.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.114.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.115.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.115.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.115.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.116.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.116.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.116.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.117.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.117.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.117.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.118.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.118.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.118.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.119.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.119.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.119.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.12.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.12.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.12.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.120.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.120.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.120.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.121.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.121.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.121.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.122.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.122.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.122.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.123.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.123.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.123.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.124.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.124.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.124.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.125.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.125.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.125.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.126.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.126.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.126.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.127.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.127.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.127.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.128.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.128.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.128.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.129.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.129.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.129.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.13.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.13.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.13.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.130.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.130.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.130.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.131.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.131.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.131.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.132.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.132.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.132.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.133.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.133.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.133.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.134.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.134.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.134.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.135.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.135.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.135.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.136.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.136.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.136.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.137.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.137.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.137.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.138.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.138.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.138.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.139.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.139.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.139.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.14.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.14.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.14.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.140.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.140.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.140.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.141.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.141.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.141.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.142.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.142.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.142.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.143.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.143.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.143.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.144.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.144.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.144.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.145.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.145.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.145.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.146.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.146.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.146.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.147.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.147.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.147.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.148.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.148.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.148.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.149.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.149.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.149.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.15.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.15.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.15.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.150.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.150.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.150.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.151.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.151.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.151.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.152.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.152.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.152.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.153.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.153.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.153.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.154.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.154.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.154.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.155.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.155.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.155.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.156.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.156.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.156.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.157.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.157.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.157.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.158.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.158.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.158.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.159.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.159.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.159.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.16.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.16.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.16.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.17.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.17.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.17.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.18.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.18.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.18.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.19.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.19.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.19.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.2.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.2.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.2.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.20.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.20.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.20.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.21.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.21.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.21.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.22.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.22.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.22.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.23.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.23.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.23.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.24.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.24.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.24.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.25.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.25.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.25.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.26.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.26.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.26.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.27.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.27.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.27.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.28.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.28.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.28.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.29.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.29.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.29.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.3.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.3.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.3.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.30.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.30.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.30.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.31.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.31.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.31.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.32.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.32.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.32.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.33.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.33.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.33.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.34.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.34.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.34.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.35.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.35.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.35.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.36.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.36.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.36.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.37.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.37.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.37.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.38.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.38.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.38.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.39.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.39.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.39.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.4.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.4.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.4.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.40.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.40.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.40.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.41.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.41.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.41.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.42.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.42.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.42.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.43.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.43.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.43.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.44.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.44.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.44.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.45.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.45.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.45.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.46.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.46.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.46.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.47.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.47.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.47.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.48.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.48.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.48.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.49.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.49.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.49.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.5.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.5.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.5.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.50.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.50.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.50.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.51.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.51.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.51.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.52.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.52.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.52.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.53.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.53.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.53.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.54.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.54.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.54.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.55.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.55.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.55.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.56.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.56.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.56.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.57.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.57.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.57.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.58.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.58.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.58.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.59.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.59.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.59.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.6.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.6.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.6.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.60.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.60.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.60.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.61.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.61.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.61.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.62.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.62.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.62.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.63.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.63.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.63.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.64.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.64.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.64.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.65.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.65.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.65.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.66.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.66.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.66.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.67.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.67.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.67.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.68.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.68.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.68.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.69.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.69.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.69.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.7.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.7.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.7.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.70.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.70.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.70.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.71.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.71.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.71.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.72.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.72.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.72.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.73.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.73.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.73.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.74.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.74.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.74.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.75.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.75.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.75.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.76.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.76.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.76.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.77.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.77.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.77.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.78.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.78.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.78.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.79.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.79.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.79.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.8.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.8.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.8.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.80.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.80.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.80.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.81.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.81.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.81.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.82.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.82.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.82.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.83.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.83.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.83.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.84.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.84.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.84.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.85.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.85.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.85.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.86.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.86.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.86.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.87.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.87.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.87.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.88.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.88.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.88.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.89.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.89.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.89.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.9.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.9.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.9.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.90.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.90.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.90.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.91.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.91.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.91.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.92.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.92.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.92.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.93.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.93.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.93.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.94.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.94.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.94.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.95.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.95.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.95.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.96.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.96.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.96.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.97.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.97.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.97.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.98.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.98.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.98.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.99.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.99.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.experts.99.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.gate.e_score_correction_bias": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.gate.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.shared_experts.down_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.shared_experts.gate_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.mlp.shared_experts.up_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.post_attention_layernorm.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.self_attn.k_norm.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.self_attn.k_proj.bias": "model-00091-of-00092.safetensors",
+ "model.layers.90.self_attn.k_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.self_attn.o_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.self_attn.q_norm.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.self_attn.q_proj.bias": "model-00091-of-00092.safetensors",
+ "model.layers.90.self_attn.q_proj.weight": "model-00091-of-00092.safetensors",
+ "model.layers.90.self_attn.v_proj.bias": "model-00091-of-00092.safetensors",
+ "model.layers.90.self_attn.v_proj.weight": "model-00091-of-00092.safetensors",
+ "lm_head.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.input_layernorm.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.0.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.0.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.0.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.1.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.1.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.1.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.10.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.10.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.10.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.100.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.100.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.100.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.101.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.101.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.101.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.102.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.102.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.102.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.103.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.103.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.103.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.104.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.104.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.104.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.105.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.105.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.105.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.106.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.106.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.106.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.107.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.107.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.107.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.108.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.108.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.108.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.109.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.109.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.109.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.11.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.11.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.11.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.110.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.110.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.110.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.111.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.111.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.111.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.112.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.112.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.112.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.113.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.113.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.113.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.114.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.114.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.114.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.115.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.115.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.115.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.116.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.116.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.116.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.117.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.117.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.117.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.118.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.118.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.118.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.119.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.119.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.119.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.12.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.12.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.12.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.120.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.120.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.120.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.121.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.121.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.121.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.122.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.122.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.122.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.123.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.123.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.123.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.124.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.124.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.124.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.125.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.125.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.125.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.126.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.126.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.126.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.127.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.127.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.127.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.128.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.128.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.128.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.129.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.129.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.129.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.13.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.13.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.13.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.130.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.130.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.130.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.131.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.131.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.131.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.132.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.132.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.132.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.133.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.133.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.133.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.134.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.134.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.134.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.135.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.135.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.135.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.136.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.136.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.136.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.137.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.137.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.137.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.138.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.138.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.138.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.139.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.139.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.139.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.14.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.14.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.14.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.140.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.140.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.140.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.141.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.141.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.141.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.142.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.142.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.142.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.143.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.143.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.143.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.144.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.144.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.144.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.145.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.145.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.145.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.146.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.146.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.146.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.147.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.147.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.147.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.148.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.148.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.148.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.149.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.149.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.149.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.15.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.15.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.15.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.150.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.150.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.150.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.151.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.151.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.151.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.152.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.152.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.152.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.153.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.153.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.153.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.154.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.154.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.154.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.155.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.155.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.155.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.156.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.156.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.156.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.157.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.157.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.157.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.158.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.158.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.158.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.159.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.159.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.159.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.16.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.16.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.16.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.17.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.17.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.17.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.18.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.18.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.18.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.19.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.19.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.19.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.2.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.2.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.2.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.20.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.20.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.20.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.21.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.21.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.21.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.22.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.22.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.22.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.23.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.23.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.23.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.24.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.24.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.24.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.25.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.25.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.25.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.26.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.26.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.26.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.27.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.27.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.27.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.28.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.28.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.28.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.29.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.29.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.29.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.3.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.3.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.3.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.30.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.30.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.30.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.31.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.31.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.31.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.32.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.32.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.32.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.33.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.33.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.33.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.34.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.34.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.34.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.35.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.35.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.35.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.36.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.36.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.36.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.37.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.37.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.37.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.38.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.38.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.38.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.39.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.39.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.39.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.4.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.4.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.4.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.40.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.40.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.40.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.41.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.41.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.41.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.42.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.42.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.42.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.43.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.43.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.43.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.44.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.44.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.44.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.45.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.45.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.45.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.46.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.46.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.46.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.47.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.47.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.47.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.48.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.48.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.48.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.49.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.49.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.49.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.5.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.5.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.5.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.50.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.50.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.50.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.51.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.51.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.51.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.52.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.52.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.52.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.53.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.53.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.53.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.54.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.54.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.54.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.55.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.55.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.55.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.56.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.56.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.56.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.57.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.57.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.57.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.58.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.58.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.58.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.59.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.59.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.59.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.6.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.6.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.6.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.60.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.60.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.60.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.61.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.61.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.61.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.62.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.62.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.62.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.63.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.63.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.63.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.64.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.64.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.64.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.65.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.65.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.65.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.66.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.66.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.66.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.67.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.67.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.67.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.68.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.68.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.68.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.69.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.69.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.69.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.7.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.7.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.7.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.70.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.70.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.70.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.71.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.71.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.71.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.72.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.72.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.72.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.73.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.73.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.73.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.74.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.74.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.74.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.75.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.75.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.75.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.76.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.76.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.76.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.77.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.77.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.77.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.78.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.78.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.78.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.79.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.79.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.79.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.8.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.8.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.8.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.80.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.80.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.80.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.81.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.81.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.81.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.82.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.82.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.82.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.83.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.83.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.83.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.84.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.84.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.84.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.85.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.85.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.85.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.86.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.86.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.86.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.87.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.87.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.87.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.88.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.88.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.88.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.89.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.89.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.89.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.9.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.9.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.9.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.90.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.90.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.90.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.91.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.91.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.91.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.92.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.92.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.92.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.93.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.93.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.93.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.94.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.94.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.94.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.95.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.95.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.95.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.96.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.96.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.96.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.97.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.97.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.97.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.98.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.98.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.98.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.99.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.99.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.experts.99.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.gate.e_score_correction_bias": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.gate.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.shared_experts.down_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.shared_experts.gate_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.mlp.shared_experts.up_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.post_attention_layernorm.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.self_attn.k_norm.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.self_attn.k_proj.bias": "model-00092-of-00092.safetensors",
+ "model.layers.91.self_attn.k_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.self_attn.o_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.self_attn.q_norm.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.self_attn.q_proj.bias": "model-00092-of-00092.safetensors",
+ "model.layers.91.self_attn.q_proj.weight": "model-00092-of-00092.safetensors",
+ "model.layers.91.self_attn.v_proj.bias": "model-00092-of-00092.safetensors",
+ "model.layers.91.self_attn.v_proj.weight": "model-00092-of-00092.safetensors",
+ "model.norm.weight": "model-00092-of-00092.safetensors",
+ "model.layers.92.eh_proj.weight": "mtp.safetensors",
+ "model.layers.92.embed_tokens.weight": "mtp.safetensors",
+ "model.layers.92.enorm.weight": "mtp.safetensors",
+ "model.layers.92.hnorm.weight": "mtp.safetensors",
+ "model.layers.92.input_layernorm.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.0.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.0.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.0.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.1.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.1.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.1.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.10.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.10.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.10.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.100.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.100.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.100.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.101.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.101.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.101.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.102.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.102.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.102.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.103.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.103.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.103.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.104.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.104.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.104.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.105.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.105.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.105.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.106.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.106.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.106.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.107.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.107.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.107.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.108.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.108.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.108.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.109.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.109.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.109.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.11.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.11.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.11.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.110.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.110.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.110.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.111.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.111.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.111.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.112.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.112.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.112.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.113.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.113.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.113.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.114.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.114.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.114.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.115.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.115.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.115.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.116.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.116.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.116.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.117.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.117.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.117.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.118.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.118.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.118.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.119.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.119.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.119.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.12.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.12.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.12.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.120.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.120.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.120.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.121.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.121.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.121.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.122.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.122.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.122.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.123.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.123.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.123.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.124.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.124.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.124.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.125.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.125.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.125.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.126.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.126.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.126.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.127.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.127.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.127.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.128.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.128.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.128.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.129.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.129.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.129.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.13.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.13.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.13.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.130.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.130.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.130.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.131.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.131.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.131.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.132.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.132.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.132.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.133.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.133.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.133.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.134.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.134.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.134.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.135.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.135.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.135.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.136.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.136.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.136.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.137.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.137.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.137.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.138.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.138.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.138.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.139.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.139.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.139.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.14.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.14.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.14.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.140.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.140.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.140.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.141.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.141.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.141.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.142.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.142.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.142.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.143.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.143.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.143.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.144.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.144.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.144.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.145.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.145.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.145.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.146.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.146.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.146.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.147.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.147.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.147.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.148.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.148.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.148.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.149.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.149.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.149.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.15.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.15.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.15.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.150.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.150.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.150.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.151.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.151.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.151.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.152.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.152.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.152.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.153.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.153.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.153.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.154.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.154.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.154.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.155.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.155.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.155.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.156.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.156.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.156.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.157.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.157.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.157.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.158.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.158.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.158.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.159.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.159.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.159.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.16.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.16.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.16.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.17.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.17.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.17.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.18.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.18.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.18.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.19.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.19.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.19.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.2.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.2.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.2.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.20.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.20.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.20.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.21.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.21.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.21.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.22.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.22.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.22.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.23.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.23.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.23.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.24.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.24.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.24.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.25.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.25.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.25.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.26.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.26.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.26.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.27.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.27.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.27.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.28.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.28.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.28.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.29.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.29.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.29.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.3.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.3.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.3.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.30.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.30.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.30.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.31.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.31.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.31.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.32.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.32.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.32.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.33.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.33.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.33.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.34.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.34.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.34.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.35.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.35.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.35.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.36.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.36.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.36.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.37.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.37.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.37.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.38.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.38.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.38.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.39.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.39.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.39.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.4.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.4.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.4.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.40.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.40.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.40.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.41.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.41.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.41.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.42.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.42.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.42.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.43.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.43.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.43.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.44.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.44.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.44.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.45.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.45.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.45.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.46.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.46.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.46.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.47.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.47.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.47.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.48.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.48.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.48.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.49.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.49.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.49.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.5.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.5.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.5.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.50.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.50.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.50.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.51.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.51.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.51.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.52.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.52.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.52.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.53.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.53.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.53.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.54.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.54.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.54.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.55.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.55.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.55.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.56.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.56.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.56.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.57.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.57.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.57.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.58.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.58.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.58.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.59.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.59.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.59.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.6.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.6.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.6.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.60.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.60.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.60.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.61.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.61.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.61.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.62.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.62.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.62.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.63.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.63.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.63.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.64.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.64.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.64.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.65.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.65.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.65.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.66.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.66.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.66.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.67.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.67.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.67.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.68.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.68.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.68.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.69.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.69.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.69.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.7.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.7.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.7.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.70.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.70.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.70.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.71.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.71.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.71.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.72.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.72.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.72.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.73.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.73.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.73.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.74.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.74.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.74.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.75.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.75.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.75.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.76.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.76.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.76.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.77.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.77.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.77.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.78.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.78.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.78.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.79.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.79.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.79.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.8.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.8.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.8.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.80.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.80.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.80.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.81.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.81.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.81.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.82.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.82.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.82.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.83.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.83.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.83.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.84.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.84.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.84.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.85.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.85.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.85.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.86.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.86.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.86.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.87.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.87.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.87.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.88.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.88.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.88.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.89.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.89.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.89.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.9.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.9.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.9.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.90.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.90.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.90.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.91.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.91.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.91.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.92.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.92.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.92.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.93.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.93.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.93.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.94.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.94.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.94.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.95.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.95.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.95.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.96.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.96.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.96.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.97.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.97.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.97.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.98.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.98.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.98.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.99.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.99.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.experts.99.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.gate.e_score_correction_bias": "mtp.safetensors",
+ "model.layers.92.mlp.gate.weight": "mtp.safetensors",
+ "model.layers.92.mlp.shared_experts.down_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.shared_experts.gate_proj.weight": "mtp.safetensors",
+ "model.layers.92.mlp.shared_experts.up_proj.weight": "mtp.safetensors",
+ "model.layers.92.post_attention_layernorm.weight": "mtp.safetensors",
+ "model.layers.92.self_attn.k_norm.weight": "mtp.safetensors",
+ "model.layers.92.self_attn.k_proj.bias": "mtp.safetensors",
+ "model.layers.92.self_attn.k_proj.weight": "mtp.safetensors",
+ "model.layers.92.self_attn.o_proj.weight": "mtp.safetensors",
+ "model.layers.92.self_attn.q_norm.weight": "mtp.safetensors",
+ "model.layers.92.self_attn.q_proj.bias": "mtp.safetensors",
+ "model.layers.92.self_attn.q_proj.weight": "mtp.safetensors",
+ "model.layers.92.self_attn.v_proj.bias": "mtp.safetensors",
+ "model.layers.92.self_attn.v_proj.weight": "mtp.safetensors",
+ "model.layers.92.shared_head.head.weight": "mtp.safetensors",
+ "model.layers.92.shared_head.norm.weight": "mtp.safetensors"
+ }
+}
\ No newline at end of file
diff --git a/mtp.safetensors b/mtp.safetensors
new file mode 100644
index 0000000000000000000000000000000000000000..338451fe52e9417e8811679d614cb27315a1723e
--- /dev/null
+++ b/mtp.safetensors
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:b8ad4e6152da40142d21eb935fea628c500d7529508c86c3c01c1dbce0f34d1a
+size 11079987712
diff --git a/special_tokens_map.json b/special_tokens_map.json
new file mode 100644
index 0000000000000000000000000000000000000000..4df37ba53ea68ab791aff73d452ec032d46b68be
--- /dev/null
+++ b/special_tokens_map.json
@@ -0,0 +1,40 @@
+{
+ "additional_special_tokens": [
+ "<|endoftext|>",
+ "[MASK]",
+ "[gMASK]",
+ "[sMASK]",
+ "",
+ "",
+ "<|system|>",
+ "<|user|>",
+ "<|assistant|>",
+ "<|observation|>",
+ "<|begin_of_image|>",
+ "<|end_of_image|>",
+ "<|begin_of_video|>",
+ "<|end_of_video|>",
+ "<|begin_of_audio|>",
+ "<|end_of_audio|>",
+ "<|begin_of_transcription|>",
+ "<|end_of_transcription|>",
+ "<|code_prefix|>",
+ "<|code_middle|>",
+ "<|code_suffix|>",
+ "/nothink"
+ ],
+ "eos_token": {
+ "content": "<|endoftext|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false
+ },
+ "pad_token": {
+ "content": "[MASK]",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false
+ }
+}
diff --git a/tokenizer.json b/tokenizer.json
new file mode 100644
index 0000000000000000000000000000000000000000..e3ed3c66baf1ec4de61840b0abf02142687bfed8
--- /dev/null
+++ b/tokenizer.json
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:bda8e2146c3bb7b7e0fc96dcc4f0aeff041c6c27952e3ace0665663ebff346ba
+size 19970700
diff --git a/tokenizer_config.json b/tokenizer_config.json
new file mode 100644
index 0000000000000000000000000000000000000000..3eb05858502df056de30379693d8d507879315a0
--- /dev/null
+++ b/tokenizer_config.json
@@ -0,0 +1,328 @@
+{
+ "added_tokens_decoder": {
+ "151329": {
+ "content": "<|endoftext|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151330": {
+ "content": "[MASK]",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151331": {
+ "content": "[gMASK]",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151332": {
+ "content": "[sMASK]",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151333": {
+ "content": "",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151334": {
+ "content": "",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151335": {
+ "content": "<|system|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151336": {
+ "content": "<|user|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151337": {
+ "content": "<|assistant|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151338": {
+ "content": "<|observation|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151339": {
+ "content": "<|begin_of_image|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151340": {
+ "content": "<|end_of_image|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151341": {
+ "content": "<|begin_of_video|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151342": {
+ "content": "<|end_of_video|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151343": {
+ "content": "<|begin_of_audio|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151344": {
+ "content": "<|end_of_audio|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151345": {
+ "content": "<|begin_of_transcription|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151346": {
+ "content": "<|end_of_transcription|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151347": {
+ "content": "<|code_prefix|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151348": {
+ "content": "<|code_middle|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151349": {
+ "content": "<|code_suffix|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151350": {
+ "content": "",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151351": {
+ "content": "",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151352": {
+ "content": "",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151353": {
+ "content": "",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151354": {
+ "content": "",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151355": {
+ "content": "",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151356": {
+ "content": "",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151357": {
+ "content": "",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151358": {
+ "content": "",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151359": {
+ "content": "",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151360": {
+ "content": "/nothink",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": true
+ },
+ "151361": {
+ "content": "<|begin_of_box|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151362": {
+ "content": "<|end_of_box|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151363": {
+ "content": "<|image|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ },
+ "151364": {
+ "content": "<|video|>",
+ "lstrip": false,
+ "normalized": false,
+ "rstrip": false,
+ "single_word": false,
+ "special": false
+ }
+ },
+ "additional_special_tokens": [
+ "<|endoftext|>",
+ "[MASK]",
+ "[gMASK]",
+ "[sMASK]",
+ "",
+ "",
+ "<|system|>",
+ "<|user|>",
+ "<|assistant|>",
+ "<|observation|>",
+ "<|begin_of_image|>",
+ "<|end_of_image|>",
+ "<|begin_of_video|>",
+ "<|end_of_video|>",
+ "<|begin_of_audio|>",
+ "<|end_of_audio|>",
+ "<|begin_of_transcription|>",
+ "<|end_of_transcription|>",
+ "<|code_prefix|>",
+ "<|code_middle|>",
+ "<|code_suffix|>",
+ "/nothink"
+ ],
+ "bos_token": null,
+ "clean_up_tokenization_spaces": false,
+ "do_lower_case": false,
+ "eos_token": "<|endoftext|>",
+ "extra_special_tokens": {},
+ "model_max_length": 202752,
+ "pad_token": "[MASK]",
+ "padding_side": "left",
+ "remove_space": false,
+ "tokenizer_class": "PreTrainedTokenizerFast",
+ "unk_token": null,
+ "chat_template": "{# Unsloth template fixes #}\n[gMASK]\n{%- if tools -%}\n<|system|>\n# Tools\n\nYou may call one or more functions to assist with the user query.\n\nYou are provided with function signatures within XML tags:\n\n{% for tool in tools %}\n{{ tool | tojson|string }}\n{% endfor %}\n\n\nFor each function call, output the function name and arguments within the following XML format:\n{function-name}{arg-key-1}{arg-value-1}{arg-key-2}{arg-value-2}...{%- endif -%}\n{%- macro visible_text(content) -%}\n {%- if content is string -%}\n {{- content }}\n {%- elif content is iterable and content is not mapping -%}\n {%- for item in content -%}\n {%- if item is mapping and item.type == 'text' -%}\n {{- item.text }}\n {%- elif item is string -%}\n {{- item }}\n {%- endif -%}\n {%- endfor -%}\n {%- else -%}\n {{- content }}\n {%- endif -%}\n{%- endmacro -%}\n{%- set ns = namespace(last_user_index=-1) %}\n{%- for m in messages %}\n {%- if m.role == 'user' %}\n {% set ns.last_user_index = loop.index0 -%}\n {%- endif %}\n{%- endfor %}\n{% for m in messages %}\n{%- if m.role == 'user' -%}<|user|>{{ visible_text(m.content) }}\n{%- elif m.role == 'assistant' -%}\n<|assistant|>\n{%- set reasoning_content = '' %}\n{%- set content = visible_text(m.content) %}\n{%- if m.reasoning_content is string %}\n {%- set reasoning_content = m.reasoning_content %}\n{%- else %}\n {%- if '' in content %}\n {%- set reasoning_content = ((content.split('')|first).rstrip('\\n').split('')|last).lstrip('\\n') %}\n {%- set content = (content.split('')|last).lstrip('\\n') %}\n {%- endif %}\n{%- endif %}\n{%- if ((clear_thinking is defined and not clear_thinking) or loop.index0 > ns.last_user_index) and reasoning_content -%}\n{{ '' + reasoning_content.strip() + ''}}\n{%- else -%}\n{{ '' }}\n{%- endif -%}\n{%- if content.strip() -%}\n{{ content.strip() }}\n{%- endif -%}\n{% if m.tool_calls %}\n{% for tc in m.tool_calls %}\n{%- if tc.function %}\n {%- set tc = tc.function %}\n{%- endif %}\n{{- '' + tc.name -}}\n{% set _args = tc.arguments %}{%- if _args is mapping %}{% for k, v in _args|items %}{{ k }}{{ v | tojson|string if v is not string else v }}{% endfor %}{%- endif %}{% endfor %}\n{% endif %}\n{%- elif m.role == 'tool' -%}\n{%- if m.content is string -%}\n{%- if loop.first or (messages[loop.index0 - 1].role != \"tool\") %}\n {{- '<|observation|>' }}\n{%- endif %}\n{{- '' }}\n{{- m.content }}\n{{- '' }}\n{%- else -%}\n<|observation|>{% for tr in m.content %}\n{{ tr.output if tr.output is defined else tr }}{% endfor -%}\n{% endif -%}\n{%- elif m.role == 'system' -%}\n<|system|>{{ visible_text(m.content) }}\n{%- endif -%}\n{%- endfor -%}\n{%- if add_generation_prompt -%}\n <|assistant|>{{- '' if (enable_thinking is defined and not enable_thinking) else '' -}}\n{%- endif -%}\n{# Copyright 2025-present Unsloth. Apache 2.0 License. #}"
+}
\ No newline at end of file