Upload folder using huggingface_hub

Browse files

Files changed (9) hide show

README.md +67 -39
checkpoint-10/README.md +67 -39
checkpoint-10/rng_state.pth +1 -1
checkpoint-10/trainer_state.json +80 -0
checkpoint-10/training_args.bin +1 -1
checkpoint-9/README.md +66 -38
checkpoint-9/rng_state.pth +1 -1
checkpoint-9/trainer_state.json +72 -0
checkpoint-9/training_args.bin +1 -1

README.md CHANGED Viewed

@@ -9,31 +9,31 @@ tags:
 - loss:MultipleNegativesRankingLoss
 base_model: BAAI/bge-large-zh-v1.5
 widget:
-- source_sentence: 機器學習演算法工程師
   sentences:
-  - 採用 React 或 Vue 生態系搭配 TypeScript，能有效提升大型休閒平台前端介面的可維護性與使用者互動體驗。
-  - 運用 PyTorch 或 TensorFlow 建立深度學習模型，並透過 scikit-learn 進行特徵工程
-  - 規劃國內外旅遊路線、協調交通住宿並提供旅客諮詢服務是旅行社企劃人員的主要職責。
-- source_sentence: 開發休閒類行動應用程式的伺服器端框架選擇
   sentences:
-  - 對於需要快速開發與迭代的娛樂型 App，Python 的 FastAPI 或 Django 是建立穩定後端服務的理想選擇。
-  - 可以詢問其對 Vuex 狀態管理、Vue Router 路由控制以及 Nuxt.js 框架優缺點的看法。
-  - 職位要求熟悉 Gin 或 Echo 等 Web 框架，並具備 gRPC 通訊協定的開發實務。
-- source_sentence: 雲端架構維運與 DevOps 技能
   sentences:
-  - 熟練操作 AWS EC2 與 RDS 服務，並具備 Docker 容器化技術與 Kubernetes 叢集管理能力。
-  - 除了基礎的 Express 外，具備 Nest.js 或 Koa 框架開發經驗者在後端職位競爭中更具優勢。
-  - 利用 Jenkins 與 GitLab CI 結合 Docker 容器技術，可以實現遊戲伺服器版本的無縫更新與持續整合 (CI/CD)。
-- source_sentence: React 前端工程師職位要求
   sentences:
-  - 具備 Docker 與 Kubernetes 部署經驗，並能透過 Helm 管理 K8s 應用程式的生命週期。
-  - 擅長運用 Matplotlib、Seaborn 或 Plotly 等工具將複雜數據轉化為直觀的圖表與報表。
-  - 應徵者需熟悉 Redux 狀態管理、React Router 路由配置，並具備 Next.js 伺服器端渲染開發經驗。
-- source_sentence: 尋找熟悉微服務架構的 Java 開發者
   sentences:
-  - 本職位要求應徵者具備 Django 或 FastAPI 的實務經驗，並能運用 Celery 處理非同步任務。
-  - 透過 Jenkins、GitLab CI 或 GitHub Actions 實作 CI/CD 管線以提升軟體交付效率
-  - 精通 Spring Boot 與 Spring Cloud，並具備 Maven 或 Gradle 專案建置經驗
 datasets:
 - yenstdi/embbedding_text_1111
 pipeline_tag: sentence-similarity
@@ -91,9 +91,9 @@ from sentence_transformers import SentenceTransformer
 model = SentenceTransformer("sentence_transformers_model_id")
 # Run inference
 sentences = [
-    '尋找熟悉微服務架構的 Java 開發者',
-    '精通 Spring Boot 與 Spring Cloud，並具備 Maven 或 Gradle 專案建置經驗',
-    '透過 Jenkins、GitLab CI 或 GitHub Actions 實作 CI/CD 管線以提升軟體交付效率',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
@@ -102,9 +102,9 @@ print(embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
-# tensor([[1.0000, 0.6469, 0.3960],
-#         [0.6469, 1.0000, 0.4859],
-#         [0.3960, 0.4859, 1.0000]])
 ```
 <!--
@@ -172,9 +172,37 @@ You can finetune this model on your own dataset.
   }
   ```
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
 - `per_device_train_batch_size`: 64
 - `per_device_eval_batch_size`: 64
 - `num_train_epochs`: 10
@@ -185,7 +213,7 @@ You can finetune this model on your own dataset.
 - `overwrite_output_dir`: False
 - `do_predict`: False
-- `eval_strategy`: no
 - `prediction_loss_only`: True
 - `per_device_train_batch_size`: 64
 - `per_device_eval_batch_size`: 64
@@ -306,18 +334,18 @@ You can finetune this model on your own dataset.
 </details>
 ### Training Logs
-| Epoch | Step | Training Loss |
-|:-----:|:----:|:-------------:|
-| 1.0   | 1    | 2.4771        |
-| 2.0   | 2    | 2.5696        |
-| 3.0   | 3    | 2.4096        |
-| 4.0   | 4    | 2.4025        |
-| 5.0   | 5    | 2.2429        |
-| 6.0   | 6    | 2.1532        |
-| 7.0   | 7    | 2.0347        |
-| 8.0   | 8    | 1.8817        |
-| 9.0   | 9    | 1.7143        |
-| 10.0  | 10   | 1.4908        |
 ### Framework Versions

 - loss:MultipleNegativesRankingLoss
 base_model: BAAI/bge-large-zh-v1.5
 widget:
+- source_sentence: 定期定額投資的優缺點
   sentences:
+  - 近年來大型語言模型與擴散模型在圖像與文本生成領域取得突破性進展。
+  - 國際間的生產與物流體系正在發生重大的組織變革與調整。
+  - 透過固定金額長期投入，投資者能有效攤平市場波動帶來的成本風險，但可能在強勁牛市中錯失更高的單筆申購報酬。
+- source_sentence: 京都最適合賞楓的季節是什麼時候？
   sentences:
+  - 秋季前往關西地區，十一月中旬到十二月初通常是觀賞紅葉的最佳時機。
+  - 使用 asyncio 庫可以實現非阻塞的 I/O 操作，顯著提升網路爬蟲或 API 請求的並發性能。
+  - 在快速變遷的職場環境中，持續獲取新知識與技能是維持個人競爭力與適應力的關鍵。
+- source_sentence: 長期失眠該如何改善？
   sentences:
+  - 建立規律的作息時間、減少睡前使用電子產品，並營造舒適的睡眠環境有助於緩解睡眠障礙。
+  - 植物透過葉綠體吸收太陽能，將二氧化碳與水轉化為葡萄糖並釋放氧氣，這是地球能量循環的基礎。
+  - 辦理信用貸款通常要求穩定的收入證明與良好的信用評分。
+- source_sentence: 如何減少日常生活中的碳足跡
   sentences:
+  - 在推動組織數位化過程中，往往會面臨技術債、員工抗拒改變以及缺乏清晰策略等難題。
+  - 該行動裝置的電力持久度表現優異，能滿足長時間使用的需求。
+  - 透過節能家電、搭乘大眾運輸及實踐蔬食生活，能有效降低個人的環境影響。
+- source_sentence: 京都最值得造訪的歷史古蹟
   sentences:
+  - 這座日本古都擁有眾多世界文化遺產，如清水寺、金閣寺與伏見稻荷大社，是體驗傳統文化的必經之地。
+  - 患者通常會感到胸口灼熱（俗稱火燒心）、胃酸逆流，有時還會伴隨慢性咳嗽或喉嚨發炎。
+  - 這種以植物性食物、橄欖油和適量深海魚為主的飲食模式，被證實能有效預防心血管疾病。
 datasets:
 - yenstdi/embbedding_text_1111
 pipeline_tag: sentence-similarity
 model = SentenceTransformer("sentence_transformers_model_id")
 # Run inference
 sentences = [
+    '京都最值得造訪的歷史古蹟',
+    '這座日本古都擁有眾多世界文化遺產，如清水寺、金閣寺與伏見稻荷大社，是體驗傳統文化的必經之地。',
+    '患者通常會感到胸口灼熱（俗稱火燒心）、胃酸逆流，有時還會伴隨慢性咳嗽或喉嚨發炎。',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
+# tensor([[1.0000, 0.6986, 0.1182],
+#         [0.6986, 1.0000, 0.1618],
+#         [0.1182, 0.1618, 1.0000]])
 ```
 <!--
   }
   ```
+### Evaluation Dataset
+#### embbedding_text_1111
+* Dataset: [embbedding_text_1111](https://huggingface.co/datasets/yenstdi/embbedding_text_1111) at [610ac14](https://huggingface.co/datasets/yenstdi/embbedding_text_1111/tree/610ac1456cc501416303e62f7813f2ee87ee95e3)
+* Size: 25 evaluation samples
+* Columns: <code>anchor</code> and <code>positive</code>
+* Approximate statistics based on the first 25 samples:
+  |         | anchor                                                                             | positive                                                                           |
+  |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
+  | type    | string                                                                             | string                                                                             |
+  | details | <ul><li>min: 11 tokens</li><li>mean: 15.16 tokens</li><li>max: 20 tokens</li></ul> | <ul><li>min: 28 tokens</li><li>mean: 39.36 tokens</li><li>max: 54 tokens</li></ul> |
+* Samples:
+  | anchor                         | positive                                                              |
+  |:-------------------------------|:----------------------------------------------------------------------|
+  | <code>這款手機的電池續航力令人印象深刻。</code> | <code>該行動裝置的電力持久度表現優異，能滿足長時間使用的需求。</code>                             |
+  | <code>什麼是機器學習中的過擬合現象？</code>   | <code>當模型在訓練數據上表現極佳，但在未見過的測試數據上預測準確率大幅下降時，通常就是發生了 Overfitting。</code> |
+  | <code>2024年全球永續能源趨勢報告</code>   | <code>隨著各國減碳政策的推進，太陽能與離岸風電在未來幾年將成為再生能源成長的核心動力。</code>                 |
+* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
+  ```json
+  {
+      "scale": 20.0,
+      "similarity_fct": "cos_sim",
+      "gather_across_devices": false
+  }
+  ```
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
+- `eval_strategy`: epoch
 - `per_device_train_batch_size`: 64
 - `per_device_eval_batch_size`: 64
 - `num_train_epochs`: 10
 - `overwrite_output_dir`: False
 - `do_predict`: False
+- `eval_strategy`: epoch
 - `prediction_loss_only`: True
 - `per_device_train_batch_size`: 64
 - `per_device_eval_batch_size`: 64
 </details>
 ### Training Logs
+| Epoch | Step | Training Loss | Validation Loss |
+|:-----:|:----:|:-------------:|:---------------:|
+| 1.0   | 1    | 2.4771        | 0.4011          |
+| 2.0   | 2    | 2.5696        | 0.3978          |
+| 3.0   | 3    | 2.4096        | 0.3917          |
+| 4.0   | 4    | 2.4025        | 0.3832          |
+| 5.0   | 5    | 2.2429        | 0.3730          |
+| 6.0   | 6    | 2.1532        | 0.3615          |
+| 7.0   | 7    | 2.0347        | 0.3499          |
+| 8.0   | 8    | 1.8817        | 0.3384          |
+| 9.0   | 9    | 1.7143        | 0.3277          |
+| 10.0  | 10   | 1.4908        | 0.3180          |
 ### Framework Versions

checkpoint-10/README.md CHANGED Viewed

@@ -9,31 +9,31 @@ tags:
 - loss:MultipleNegativesRankingLoss
 base_model: BAAI/bge-large-zh-v1.5
 widget:
-- source_sentence: 機器學習演算法工程師
   sentences:
-  - 採用 React 或 Vue 生態系搭配 TypeScript，能有效提升大型休閒平台前端介面的可維護性與使用者互動體驗。
-  - 運用 PyTorch 或 TensorFlow 建立深度學習模型，並透過 scikit-learn 進行特徵工程
-  - 規劃國內外旅遊路線、協調交通住宿並提供旅客諮詢服務是旅行社企劃人員的主要職責。
-- source_sentence: 開發休閒類行動應用程式的伺服器端框架選擇
   sentences:
-  - 對於需要快速開發與迭代的娛樂型 App，Python 的 FastAPI 或 Django 是建立穩定後端服務的理想選擇。
-  - 可以詢問其對 Vuex 狀態管理、Vue Router 路由控制以及 Nuxt.js 框架優缺點的看法。
-  - 職位要求熟悉 Gin 或 Echo 等 Web 框架，並具備 gRPC 通訊協定的開發實務。
-- source_sentence: 雲端架構維運與 DevOps 技能
   sentences:
-  - 熟練操作 AWS EC2 與 RDS 服務，並具備 Docker 容器化技術與 Kubernetes 叢集管理能力。
-  - 除了基礎的 Express 外，具備 Nest.js 或 Koa 框架開發經驗者在後端職位競爭中更具優勢。
-  - 利用 Jenkins 與 GitLab CI 結合 Docker 容器技術，可以實現遊戲伺服器版本的無縫更新與持續整合 (CI/CD)。
-- source_sentence: React 前端工程師職位要求
   sentences:
-  - 具備 Docker 與 Kubernetes 部署經驗，並能透過 Helm 管理 K8s 應用程式的生命週期。
-  - 擅長運用 Matplotlib、Seaborn 或 Plotly 等工具將複雜數據轉化為直觀的圖表與報表。
-  - 應徵者需熟悉 Redux 狀態管理、React Router 路由配置，並具備 Next.js 伺服器端渲染開發經驗。
-- source_sentence: 尋找熟悉微服務架構的 Java 開發者
   sentences:
-  - 本職位要求應徵者具備 Django 或 FastAPI 的實務經驗，並能運用 Celery 處理非同步任務。
-  - 透過 Jenkins、GitLab CI 或 GitHub Actions 實作 CI/CD 管線以提升軟體交付效率
-  - 精通 Spring Boot 與 Spring Cloud，並具備 Maven 或 Gradle 專案建置經驗
 datasets:
 - yenstdi/embbedding_text_1111
 pipeline_tag: sentence-similarity
@@ -91,9 +91,9 @@ from sentence_transformers import SentenceTransformer
 model = SentenceTransformer("sentence_transformers_model_id")
 # Run inference
 sentences = [
-    '尋找熟悉微服務架構的 Java 開發者',
-    '精通 Spring Boot 與 Spring Cloud，並具備 Maven 或 Gradle 專案建置經驗',
-    '透過 Jenkins、GitLab CI 或 GitHub Actions 實作 CI/CD 管線以提升軟體交付效率',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
@@ -102,9 +102,9 @@ print(embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
-# tensor([[1.0000, 0.6469, 0.3960],
-#         [0.6469, 1.0000, 0.4859],
-#         [0.3960, 0.4859, 1.0000]])
 ```
 <!--
@@ -172,9 +172,37 @@ You can finetune this model on your own dataset.
   }
   ```
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
 - `per_device_train_batch_size`: 64
 - `per_device_eval_batch_size`: 64
 - `num_train_epochs`: 10
@@ -185,7 +213,7 @@ You can finetune this model on your own dataset.
 - `overwrite_output_dir`: False
 - `do_predict`: False
-- `eval_strategy`: no
 - `prediction_loss_only`: True
 - `per_device_train_batch_size`: 64
 - `per_device_eval_batch_size`: 64
@@ -306,18 +334,18 @@ You can finetune this model on your own dataset.
 </details>
 ### Training Logs
-| Epoch | Step | Training Loss |
-|:-----:|:----:|:-------------:|
-| 1.0   | 1    | 2.4771        |
-| 2.0   | 2    | 2.5696        |
-| 3.0   | 3    | 2.4096        |
-| 4.0   | 4    | 2.4025        |
-| 5.0   | 5    | 2.2429        |
-| 6.0   | 6    | 2.1532        |
-| 7.0   | 7    | 2.0347        |
-| 8.0   | 8    | 1.8817        |
-| 9.0   | 9    | 1.7143        |
-| 10.0  | 10   | 1.4908        |
 ### Framework Versions

 - loss:MultipleNegativesRankingLoss
 base_model: BAAI/bge-large-zh-v1.5
 widget:
+- source_sentence: 定期定額投資的優缺點
   sentences:
+  - 近年來大型語言模型與擴散模型在圖像與文本生成領域取得突破性進展。
+  - 國際間的生產與物流體系正在發生重大的組織變革與調整。
+  - 透過固定金額長期投入，投資者能有效攤平市場波動帶來的成本風險，但可能在強勁牛市中錯失更高的單筆申購報酬。
+- source_sentence: 京都最適合賞楓的季節是什麼時候？
   sentences:
+  - 秋季前往關西地區，十一月中旬到十二月初通常是觀賞紅葉的最佳時機。
+  - 使用 asyncio 庫可以實現非阻塞的 I/O 操作，顯著提升網路爬蟲或 API 請求的並發性能。
+  - 在快速變遷的職場環境中，持續獲取新知識與技能是維持個人競爭力與適應力的關鍵。
+- source_sentence: 長期失眠該如何改善？
   sentences:
+  - 建立規律的作息時間、減少睡前使用電子產品，並營造舒適的睡眠環境有助於緩解睡眠障礙。
+  - 植物透過葉綠體吸收太陽能，將二氧化碳與水轉化為葡萄糖並釋放氧氣，這是地球能量循環的基礎。
+  - 辦理信用貸款通常要求穩定的收入證明與良好的信用評分。
+- source_sentence: 如何減少日常生活中的碳足跡
   sentences:
+  - 在推動組織數位化過程中，往往會面臨技術債、員工抗拒改變以及缺乏清晰策略等難題。
+  - 該行動裝置的電力持久度表現優異，能滿足長時間使用的需求。
+  - 透過節能家電、搭乘大眾運輸及實踐蔬食生活，能有效降低個人的環境影響。
+- source_sentence: 京都最值得造訪的歷史古蹟
   sentences:
+  - 這座日本古都擁有眾多世界文化遺產，如清水寺、金閣寺與伏見稻荷大社，是體驗傳統文化的必經之地。
+  - 患者通常會感到胸口灼熱（俗稱火燒心）、胃酸逆流，有時還會伴隨慢性咳嗽或喉嚨發炎。
+  - 這種以植物性食物、橄欖油和適量深海魚為主的飲食模式，被證實能有效預防心血管疾病。
 datasets:
 - yenstdi/embbedding_text_1111
 pipeline_tag: sentence-similarity
 model = SentenceTransformer("sentence_transformers_model_id")
 # Run inference
 sentences = [
+    '京都最值得造訪的歷史古蹟',
+    '這座日本古都擁有眾多世界文化遺產，如清水寺、金閣寺與伏見稻荷大社，是體驗傳統文化的必經之地。',
+    '患者通常會感到胸口灼熱（俗稱火燒心）、胃酸逆流，有時還會伴隨慢性咳嗽或喉嚨發炎。',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
+# tensor([[1.0000, 0.6986, 0.1182],
+#         [0.6986, 1.0000, 0.1618],
+#         [0.1182, 0.1618, 1.0000]])
 ```
 <!--
   }
   ```
+### Evaluation Dataset
+#### embbedding_text_1111
+* Dataset: [embbedding_text_1111](https://huggingface.co/datasets/yenstdi/embbedding_text_1111) at [610ac14](https://huggingface.co/datasets/yenstdi/embbedding_text_1111/tree/610ac1456cc501416303e62f7813f2ee87ee95e3)
+* Size: 25 evaluation samples
+* Columns: <code>anchor</code> and <code>positive</code>
+* Approximate statistics based on the first 25 samples:
+  |         | anchor                                                                             | positive                                                                           |
+  |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
+  | type    | string                                                                             | string                                                                             |
+  | details | <ul><li>min: 11 tokens</li><li>mean: 15.16 tokens</li><li>max: 20 tokens</li></ul> | <ul><li>min: 28 tokens</li><li>mean: 39.36 tokens</li><li>max: 54 tokens</li></ul> |
+* Samples:
+  | anchor                         | positive                                                              |
+  |:-------------------------------|:----------------------------------------------------------------------|
+  | <code>這款手機的電池續航力令人印象深刻。</code> | <code>該行動裝置的電力持久度表現優異，能滿足長時間使用的需求。</code>                             |
+  | <code>什麼是機器學習中的過擬合現象？</code>   | <code>當模型在訓練數據上表現極佳，但在未見過的測試數據上預測準確率大幅下降時，通常就是發生了 Overfitting。</code> |
+  | <code>2024年全球永續能源趨勢報告</code>   | <code>隨著各國減碳政策的推進，太陽能與離岸風電在未來幾年將成為再生能源成長的核心動力。</code>                 |
+* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
+  ```json
+  {
+      "scale": 20.0,
+      "similarity_fct": "cos_sim",
+      "gather_across_devices": false
+  }
+  ```
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
+- `eval_strategy`: epoch
 - `per_device_train_batch_size`: 64
 - `per_device_eval_batch_size`: 64
 - `num_train_epochs`: 10
 - `overwrite_output_dir`: False
 - `do_predict`: False
+- `eval_strategy`: epoch
 - `prediction_loss_only`: True
 - `per_device_train_batch_size`: 64
 - `per_device_eval_batch_size`: 64
 </details>
 ### Training Logs
+| Epoch | Step | Training Loss | Validation Loss |
+|:-----:|:----:|:-------------:|:---------------:|
+| 1.0   | 1    | 2.4771        | 0.4011          |
+| 2.0   | 2    | 2.5696        | 0.3978          |
+| 3.0   | 3    | 2.4096        | 0.3917          |
+| 4.0   | 4    | 2.4025        | 0.3832          |
+| 5.0   | 5    | 2.2429        | 0.3730          |
+| 6.0   | 6    | 2.1532        | 0.3615          |
+| 7.0   | 7    | 2.0347        | 0.3499          |
+| 8.0   | 8    | 1.8817        | 0.3384          |
+| 9.0   | 9    | 1.7143        | 0.3277          |
+| 10.0  | 10   | 1.4908        | 0.3180          |
 ### Framework Versions

checkpoint-10/rng_state.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:765517977b6d782ca23707e28659df678828881b9338528bd66691476ad14606
 size 14645

 version https://git-lfs.github.com/spec/v1
+oid sha256:a9ac82c8753156d3621dbec1995af4174a4a918aea621395fd088d22fb439e8f
 size 14645

checkpoint-10/trainer_state.json CHANGED Viewed

@@ -16,6 +16,14 @@
       "loss": 2.4771,
       "step": 1
     },
     {
       "epoch": 2.0,
       "grad_norm": 17.471435546875,
@@ -23,6 +31,14 @@
       "loss": 2.5696,
       "step": 2
     },
     {
       "epoch": 3.0,
       "grad_norm": 16.502605438232422,
@@ -30,6 +46,14 @@
       "loss": 2.4096,
       "step": 3
     },
     {
       "epoch": 4.0,
       "grad_norm": 15.583235740661621,
@@ -37,6 +61,14 @@
       "loss": 2.4025,
       "step": 4
     },
     {
       "epoch": 5.0,
       "grad_norm": 15.021787643432617,
@@ -44,6 +76,14 @@
       "loss": 2.2429,
       "step": 5
     },
     {
       "epoch": 6.0,
       "grad_norm": 14.483270645141602,
@@ -51,6 +91,14 @@
       "loss": 2.1532,
       "step": 6
     },
     {
       "epoch": 7.0,
       "grad_norm": 13.854901313781738,
@@ -58,6 +106,14 @@
       "loss": 2.0347,
       "step": 7
     },
     {
       "epoch": 8.0,
       "grad_norm": 13.282709121704102,
@@ -65,6 +121,14 @@
       "loss": 1.8817,
       "step": 8
     },
     {
       "epoch": 9.0,
       "grad_norm": 12.319931983947754,
@@ -72,12 +136,28 @@
       "loss": 1.7143,
       "step": 9
     },
     {
       "epoch": 10.0,
       "grad_norm": 11.570226669311523,
       "learning_rate": 4.5e-06,
       "loss": 1.4908,
       "step": 10
     }
   ],
   "logging_steps": 1,

       "loss": 2.4771,
       "step": 1
     },
+    {
+      "epoch": 1.0,
+      "eval_loss": 0.40109243988990784,
+      "eval_runtime": 0.2966,
+      "eval_samples_per_second": 84.299,
+      "eval_steps_per_second": 3.372,
+      "step": 1
+    },
     {
       "epoch": 2.0,
       "grad_norm": 17.471435546875,
       "loss": 2.5696,
       "step": 2
     },
+    {
+      "epoch": 2.0,
+      "eval_loss": 0.3977726995944977,
+      "eval_runtime": 0.3347,
+      "eval_samples_per_second": 74.701,
+      "eval_steps_per_second": 2.988,
+      "step": 2
+    },
     {
       "epoch": 3.0,
       "grad_norm": 16.502605438232422,
       "loss": 2.4096,
       "step": 3
     },
+    {
+      "epoch": 3.0,
+      "eval_loss": 0.3917332589626312,
+      "eval_runtime": 0.3382,
+      "eval_samples_per_second": 73.931,
+      "eval_steps_per_second": 2.957,
+      "step": 3
+    },
     {
       "epoch": 4.0,
       "grad_norm": 15.583235740661621,
       "loss": 2.4025,
       "step": 4
     },
+    {
+      "epoch": 4.0,
+      "eval_loss": 0.38322871923446655,
+      "eval_runtime": 0.3277,
+      "eval_samples_per_second": 76.279,
+      "eval_steps_per_second": 3.051,
+      "step": 4
+    },
     {
       "epoch": 5.0,
       "grad_norm": 15.021787643432617,
       "loss": 2.2429,
       "step": 5
     },
+    {
+      "epoch": 5.0,
+      "eval_loss": 0.37303000688552856,
+      "eval_runtime": 0.3473,
+      "eval_samples_per_second": 71.987,
+      "eval_steps_per_second": 2.879,
+      "step": 5
+    },
     {
       "epoch": 6.0,
       "grad_norm": 14.483270645141602,
       "loss": 2.1532,
       "step": 6
     },
+    {
+      "epoch": 6.0,
+      "eval_loss": 0.3615022897720337,
+      "eval_runtime": 0.3447,
+      "eval_samples_per_second": 72.53,
+      "eval_steps_per_second": 2.901,
+      "step": 6
+    },
     {
       "epoch": 7.0,
       "grad_norm": 13.854901313781738,
       "loss": 2.0347,
       "step": 7
     },
+    {
+      "epoch": 7.0,
+      "eval_loss": 0.3498750925064087,
+      "eval_runtime": 0.3575,
+      "eval_samples_per_second": 69.926,
+      "eval_steps_per_second": 2.797,
+      "step": 7
+    },
     {
       "epoch": 8.0,
       "grad_norm": 13.282709121704102,
       "loss": 1.8817,
       "step": 8
     },
+    {
+      "epoch": 8.0,
+      "eval_loss": 0.33841240406036377,
+      "eval_runtime": 0.3574,
+      "eval_samples_per_second": 69.957,
+      "eval_steps_per_second": 2.798,
+      "step": 8
+    },
     {
       "epoch": 9.0,
       "grad_norm": 12.319931983947754,
       "loss": 1.7143,
       "step": 9
     },
+    {
+      "epoch": 9.0,
+      "eval_loss": 0.3276784420013428,
+      "eval_runtime": 0.3673,
+      "eval_samples_per_second": 68.066,
+      "eval_steps_per_second": 2.723,
+      "step": 9
+    },
     {
       "epoch": 10.0,
       "grad_norm": 11.570226669311523,
       "learning_rate": 4.5e-06,
       "loss": 1.4908,
       "step": 10
+    },
+    {
+      "epoch": 10.0,
+      "eval_loss": 0.3180310130119324,
+      "eval_runtime": 0.3552,
+      "eval_samples_per_second": 70.379,
+      "eval_steps_per_second": 2.815,
+      "step": 10
     }
   ],
   "logging_steps": 1,

checkpoint-10/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:30733f8369e31c3b52d3aad89cff6e2b530d9e8987b93399d2713a72f96ed2ab
 size 6097

 version https://git-lfs.github.com/spec/v1
+oid sha256:d7b798ac8b0e0d41d12d4c6ffa18e77feeb9b8c3acefa12c37b4d8f455c740ca
 size 6097

checkpoint-9/README.md CHANGED Viewed

@@ -9,31 +9,31 @@ tags:
 - loss:MultipleNegativesRankingLoss
 base_model: BAAI/bge-large-zh-v1.5
 widget:
-- source_sentence: 機器學習演算法工程師
   sentences:
-  - 採用 React 或 Vue 生態系搭配 TypeScript，能有效提升大型休閒平台前端介面的可維護性與使用者互動體驗。
-  - 運用 PyTorch 或 TensorFlow 建立深度學習模型，並透過 scikit-learn 進行特徵工程
-  - 規劃國內外旅遊路線、協調交通住宿並提供旅客諮詢服務是旅行社企劃人員的主要職責。
-- source_sentence: 開發休閒類行動應用程式的伺服器端框架選擇
   sentences:
-  - 對於需要快速開發與迭代的娛樂型 App，Python 的 FastAPI 或 Django 是建立穩定後端服務的理想選擇。
-  - 可以詢問其對 Vuex 狀態管理、Vue Router 路由控制以及 Nuxt.js 框架優缺點的看法。
-  - 職位要求熟悉 Gin 或 Echo 等 Web 框架，並具備 gRPC 通訊協定的開發實務。
-- source_sentence: 雲端架構維運與 DevOps 技能
   sentences:
-  - 熟練操作 AWS EC2 與 RDS 服務，並具備 Docker 容器化技術與 Kubernetes 叢集管理能力。
-  - 除了基礎的 Express 外，具備 Nest.js 或 Koa 框架開發經驗者在後端職位競爭中更具優勢。
-  - 利用 Jenkins 與 GitLab CI 結合 Docker 容器技術，可以實現遊戲伺服器版本的無縫更新與持續整合 (CI/CD)。
-- source_sentence: React 前端工程師職位要求
   sentences:
-  - 具備 Docker 與 Kubernetes 部署經驗，並能透過 Helm 管理 K8s 應用程式的生命週期。
-  - 擅長運用 Matplotlib、Seaborn 或 Plotly 等工具將複雜數據轉化為直觀的圖表與報表。
-  - 應徵者需熟悉 Redux 狀態管理、React Router 路由配置，並具備 Next.js 伺服器端渲染開發經驗。
-- source_sentence: 尋找熟悉微服務架構的 Java 開發者
   sentences:
-  - 本職位要求應徵者具備 Django 或 FastAPI 的實務經驗，並能運用 Celery 處理非同步任務。
-  - 透過 Jenkins、GitLab CI 或 GitHub Actions 實作 CI/CD 管線以提升軟體交付效率
-  - 精通 Spring Boot 與 Spring Cloud，並具備 Maven 或 Gradle 專案建置經驗
 datasets:
 - yenstdi/embbedding_text_1111
 pipeline_tag: sentence-similarity
@@ -91,9 +91,9 @@ from sentence_transformers import SentenceTransformer
 model = SentenceTransformer("sentence_transformers_model_id")
 # Run inference
 sentences = [
-    '尋找熟悉微服務架構的 Java 開發者',
-    '精通 Spring Boot 與 Spring Cloud，並具備 Maven 或 Gradle 專案建置經驗',
-    '透過 Jenkins、GitLab CI 或 GitHub Actions 實作 CI/CD 管線以提升軟體交付效率',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
@@ -102,9 +102,9 @@ print(embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
-# tensor([[1.0000, 0.6457, 0.4137],
-#         [0.6457, 1.0000, 0.5084],
-#         [0.4137, 0.5084, 1.0000]])
 ```
 <!--
@@ -172,9 +172,37 @@ You can finetune this model on your own dataset.
   }
   ```
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
 - `per_device_train_batch_size`: 64
 - `per_device_eval_batch_size`: 64
 - `num_train_epochs`: 10
@@ -185,7 +213,7 @@ You can finetune this model on your own dataset.
 - `overwrite_output_dir`: False
 - `do_predict`: False
-- `eval_strategy`: no
 - `prediction_loss_only`: True
 - `per_device_train_batch_size`: 64
 - `per_device_eval_batch_size`: 64
@@ -306,17 +334,17 @@ You can finetune this model on your own dataset.
 </details>
 ### Training Logs
-| Epoch | Step | Training Loss |
-|:-----:|:----:|:-------------:|
-| 1.0   | 1    | 2.4771        |
-| 2.0   | 2    | 2.5696        |
-| 3.0   | 3    | 2.4096        |
-| 4.0   | 4    | 2.4025        |
-| 5.0   | 5    | 2.2429        |
-| 6.0   | 6    | 2.1532        |
-| 7.0   | 7    | 2.0347        |
-| 8.0   | 8    | 1.8817        |
-| 9.0   | 9    | 1.7143        |
 ### Framework Versions

 - loss:MultipleNegativesRankingLoss
 base_model: BAAI/bge-large-zh-v1.5
 widget:
+- source_sentence: 定期定額投資的優缺點
   sentences:
+  - 近年來大型語言模型與擴散模型在圖像與文本生成領域取得突破性進展。
+  - 國際間的生產與物流體系正在發生重大的組織變革與調整。
+  - 透過固定金額長期投入，投資者能有效攤平市場波動帶來的成本風險，但可能在強勁牛市中錯失更高的單筆申購報酬。
+- source_sentence: 京都最適合賞楓的季節是什麼時候？
   sentences:
+  - 秋季前往關西地區，十一月中旬到十二月初通常是觀賞紅葉的最佳時機。
+  - 使用 asyncio 庫可以實現非阻塞的 I/O 操作，顯著提升網路爬蟲或 API 請求的並發性能。
+  - 在快速變遷的職場環境中，持續獲取新知識與技能是維持個人競爭力與適應力的關鍵。
+- source_sentence: 長期失眠該如何改善？
   sentences:
+  - 建立規律的作息時間、減少睡前使用電子產品，並營造舒適的睡眠環境有助於緩解睡眠障礙。
+  - 植物透過葉綠體吸收太陽能，將二氧化碳與水轉化為葡萄糖並釋放氧氣，這是地球能量循環的基礎。
+  - 辦理信用貸款通常要求穩定的收入證明與良好的信用評分。
+- source_sentence: 如何減少日常生活中的碳足跡
   sentences:
+  - 在推動組織數位化過程中，往往會面臨技術債、員工抗拒改變以及缺乏清晰策略等難題。
+  - 該行動裝置的電力持久度表現優異，能滿足長時間使用的需求。
+  - 透過節能家電、搭乘大眾運輸及實踐蔬食生活，能有效降低個人的環境影響。
+- source_sentence: 京都最值得造訪的歷史古蹟
   sentences:
+  - 這座日本古都擁有眾多世界文化遺產，如清水寺、金閣寺與伏見稻荷大社，是體驗傳統文化的必經之地。
+  - 患者通常會感到胸口灼熱（俗稱火燒心）、胃酸逆流，有時還會伴隨慢性咳嗽或喉嚨發炎。
+  - 這種以植物性食物、橄欖油和適量深海魚為主的飲食模式，被證實能有效預防心血管疾病。
 datasets:
 - yenstdi/embbedding_text_1111
 pipeline_tag: sentence-similarity
 model = SentenceTransformer("sentence_transformers_model_id")
 # Run inference
 sentences = [
+    '京都最值得造訪的歷史古蹟',
+    '這座日本古都擁有眾多世界文化遺產，如清水寺、金閣寺與伏見稻荷大社，是體驗傳統文化的必經之地。',
+    '患者通常會感到胸口灼熱（俗稱火燒心）、胃酸逆流，有時還會伴隨慢性咳嗽或喉嚨發炎。',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
+# tensor([[1.0000, 0.6910, 0.1253],
+#         [0.6910, 1.0000, 0.1680],
+#         [0.1253, 0.1680, 1.0000]])
 ```
 <!--
   }
   ```
+### Evaluation Dataset
+#### embbedding_text_1111
+* Dataset: [embbedding_text_1111](https://huggingface.co/datasets/yenstdi/embbedding_text_1111) at [610ac14](https://huggingface.co/datasets/yenstdi/embbedding_text_1111/tree/610ac1456cc501416303e62f7813f2ee87ee95e3)
+* Size: 25 evaluation samples
+* Columns: <code>anchor</code> and <code>positive</code>
+* Approximate statistics based on the first 25 samples:
+  |         | anchor                                                                             | positive                                                                           |
+  |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
+  | type    | string                                                                             | string                                                                             |
+  | details | <ul><li>min: 11 tokens</li><li>mean: 15.16 tokens</li><li>max: 20 tokens</li></ul> | <ul><li>min: 28 tokens</li><li>mean: 39.36 tokens</li><li>max: 54 tokens</li></ul> |
+* Samples:
+  | anchor                         | positive                                                              |
+  |:-------------------------------|:----------------------------------------------------------------------|
+  | <code>這款手機的電池續航力令人印象深刻。</code> | <code>該行動裝置的電力持久度表現優異，能滿足長時間使用的需求。</code>                             |
+  | <code>什麼是機器學習中的過擬合現象？</code>   | <code>當模型在訓練數據上表現極佳，但在未見過的測試數據上預測準確率大幅下降時，通常就是發生了 Overfitting。</code> |
+  | <code>2024年全球永續能源趨勢報告</code>   | <code>隨著各國減碳政策的推進，太陽能與離岸風電在未來幾年將成為再生能源成長的核心動力。</code>                 |
+* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
+  ```json
+  {
+      "scale": 20.0,
+      "similarity_fct": "cos_sim",
+      "gather_across_devices": false
+  }
+  ```
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
+- `eval_strategy`: epoch
 - `per_device_train_batch_size`: 64
 - `per_device_eval_batch_size`: 64
 - `num_train_epochs`: 10
 - `overwrite_output_dir`: False
 - `do_predict`: False
+- `eval_strategy`: epoch
 - `prediction_loss_only`: True
 - `per_device_train_batch_size`: 64
 - `per_device_eval_batch_size`: 64
 </details>
 ### Training Logs
+| Epoch | Step | Training Loss | Validation Loss |
+|:-----:|:----:|:-------------:|:---------------:|
+| 1.0   | 1    | 2.4771        | 0.4011          |
+| 2.0   | 2    | 2.5696        | 0.3978          |
+| 3.0   | 3    | 2.4096        | 0.3917          |
+| 4.0   | 4    | 2.4025        | 0.3832          |
+| 5.0   | 5    | 2.2429        | 0.3730          |
+| 6.0   | 6    | 2.1532        | 0.3615          |
+| 7.0   | 7    | 2.0347        | 0.3499          |
+| 8.0   | 8    | 1.8817        | 0.3384          |
+| 9.0   | 9    | 1.7143        | 0.3277          |
 ### Framework Versions

checkpoint-9/rng_state.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e738727bded44cfa4e19444d45a6f5d3d02c86b73fcf4dde53227b17547553ba
 size 14645

 version https://git-lfs.github.com/spec/v1
+oid sha256:f43fa47aa893388f98cab13c029f33a762562f9bc90991890a71d60799592d2e
 size 14645

checkpoint-9/trainer_state.json CHANGED Viewed

@@ -16,6 +16,14 @@
       "loss": 2.4771,
       "step": 1
     },
     {
       "epoch": 2.0,
       "grad_norm": 17.471435546875,
@@ -23,6 +31,14 @@
       "loss": 2.5696,
       "step": 2
     },
     {
       "epoch": 3.0,
       "grad_norm": 16.502605438232422,
@@ -30,6 +46,14 @@
       "loss": 2.4096,
       "step": 3
     },
     {
       "epoch": 4.0,
       "grad_norm": 15.583235740661621,
@@ -37,6 +61,14 @@
       "loss": 2.4025,
       "step": 4
     },
     {
       "epoch": 5.0,
       "grad_norm": 15.021787643432617,
@@ -44,6 +76,14 @@
       "loss": 2.2429,
       "step": 5
     },
     {
       "epoch": 6.0,
       "grad_norm": 14.483270645141602,
@@ -51,6 +91,14 @@
       "loss": 2.1532,
       "step": 6
     },
     {
       "epoch": 7.0,
       "grad_norm": 13.854901313781738,
@@ -58,6 +106,14 @@
       "loss": 2.0347,
       "step": 7
     },
     {
       "epoch": 8.0,
       "grad_norm": 13.282709121704102,
@@ -65,12 +121,28 @@
       "loss": 1.8817,
       "step": 8
     },
     {
       "epoch": 9.0,
       "grad_norm": 12.319931983947754,
       "learning_rate": 4.000000000000001e-06,
       "loss": 1.7143,
       "step": 9
     }
   ],
   "logging_steps": 1,

       "loss": 2.4771,
       "step": 1
     },
+    {
+      "epoch": 1.0,
+      "eval_loss": 0.40109243988990784,
+      "eval_runtime": 0.2966,
+      "eval_samples_per_second": 84.299,
+      "eval_steps_per_second": 3.372,
+      "step": 1
+    },
     {
       "epoch": 2.0,
       "grad_norm": 17.471435546875,
       "loss": 2.5696,
       "step": 2
     },
+    {
+      "epoch": 2.0,
+      "eval_loss": 0.3977726995944977,
+      "eval_runtime": 0.3347,
+      "eval_samples_per_second": 74.701,
+      "eval_steps_per_second": 2.988,
+      "step": 2
+    },
     {
       "epoch": 3.0,
       "grad_norm": 16.502605438232422,
       "loss": 2.4096,
       "step": 3
     },
+    {
+      "epoch": 3.0,
+      "eval_loss": 0.3917332589626312,
+      "eval_runtime": 0.3382,
+      "eval_samples_per_second": 73.931,
+      "eval_steps_per_second": 2.957,
+      "step": 3
+    },
     {
       "epoch": 4.0,
       "grad_norm": 15.583235740661621,
       "loss": 2.4025,
       "step": 4
     },
+    {
+      "epoch": 4.0,
+      "eval_loss": 0.38322871923446655,
+      "eval_runtime": 0.3277,
+      "eval_samples_per_second": 76.279,
+      "eval_steps_per_second": 3.051,
+      "step": 4
+    },
     {
       "epoch": 5.0,
       "grad_norm": 15.021787643432617,
       "loss": 2.2429,
       "step": 5
     },
+    {
+      "epoch": 5.0,
+      "eval_loss": 0.37303000688552856,
+      "eval_runtime": 0.3473,
+      "eval_samples_per_second": 71.987,
+      "eval_steps_per_second": 2.879,
+      "step": 5
+    },
     {
       "epoch": 6.0,
       "grad_norm": 14.483270645141602,
       "loss": 2.1532,
       "step": 6
     },
+    {
+      "epoch": 6.0,
+      "eval_loss": 0.3615022897720337,
+      "eval_runtime": 0.3447,
+      "eval_samples_per_second": 72.53,
+      "eval_steps_per_second": 2.901,
+      "step": 6
+    },
     {
       "epoch": 7.0,
       "grad_norm": 13.854901313781738,
       "loss": 2.0347,
       "step": 7
     },
+    {
+      "epoch": 7.0,
+      "eval_loss": 0.3498750925064087,
+      "eval_runtime": 0.3575,
+      "eval_samples_per_second": 69.926,
+      "eval_steps_per_second": 2.797,
+      "step": 7
+    },
     {
       "epoch": 8.0,
       "grad_norm": 13.282709121704102,
       "loss": 1.8817,
       "step": 8
     },
+    {
+      "epoch": 8.0,
+      "eval_loss": 0.33841240406036377,
+      "eval_runtime": 0.3574,
+      "eval_samples_per_second": 69.957,
+      "eval_steps_per_second": 2.798,
+      "step": 8
+    },
     {
       "epoch": 9.0,
       "grad_norm": 12.319931983947754,
       "learning_rate": 4.000000000000001e-06,
       "loss": 1.7143,
       "step": 9
+    },
+    {
+      "epoch": 9.0,
+      "eval_loss": 0.3276784420013428,
+      "eval_runtime": 0.3673,
+      "eval_samples_per_second": 68.066,
+      "eval_steps_per_second": 2.723,
+      "step": 9
     }
   ],
   "logging_steps": 1,

checkpoint-9/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:30733f8369e31c3b52d3aad89cff6e2b530d9e8987b93399d2713a72f96ed2ab
 size 6097

 version https://git-lfs.github.com/spec/v1
+oid sha256:d7b798ac8b0e0d41d12d4c6ffa18e77feeb9b8c3acefa12c37b4d8f455c740ca
 size 6097