ARM-Development
/

Llama-3.1-8B-text-1.0

Model card Files Files and versions

TravisPing commited on Apr 29, 2025

Commit

f82da8e

·

verified ·

1 Parent(s): 1c242d1

Update README.md

Files changed (1) hide show

README.md +8 -18

README.md CHANGED Viewed

@@ -39,32 +39,30 @@ Fine-tuned on ≈ 2.3k ScienceBase “data → metadata” pairs to automate cre
 Generate schema-compliant metadata text from a JSON/CSV representation of a ScienceBase item.
 ### Downstream Use
-Integrate as a micro-service in data-repository pipelines; bootstrap metadata for legacy collections.
 ### Out-of-Scope
-Open-ended content generation, legal/medical decisions, or any application outside metadata curation.
 ---
 ## Bias, Risks, and Limitations
 * Domain-specific bias toward ScienceBase field names.
 * Possible hallucination of fields when prompts are underspecified.
-* Knowledge limited to training corpus and Jan 2025 Llama 3 cutoff.
-**Recommendation:** keep a human curator in the loop and validate output against your schema.
 ---
 ## Training Details
 ### Training Data
-* ~2.3k ScienceBase records with curated metadata.
-* Pre-processing: control-char stripping, field normalisation, incomplete rows removed.
 ### Training Procedure
 | Hyper-parameter | Value |
 |-----------------|-------|
-| Max sequence length | 20 000 |
 | Precision | fp16 / bf16 (auto) |
 | Quantisation | 4-bit QLoRA (`load_in_4bit=True`) |
 | LoRA rank / α | 16 / 16 |
@@ -79,8 +77,8 @@ Open-ended content generation, legal/medical decisions, or any application outsi
 | Field | Value |
 |-------|-------|
 | GPU | 1 × NVIDIA A100 80 GB |
-| Total training hours | **TODO** |
-| Compute region / cluster | **TODO** |
 ### Software Stack
 | Package | Version |
@@ -103,21 +101,13 @@ Open-ended content generation, legal/medical decisions, or any application outsi
 *Evaluation still in progress.*
----
-## Environmental Impact
-| Field | Value |
-|-------|-------|
-| Hardware | 1 × A100-80 GB |
-| Hours | ~120 hours |
-| Cloud/HPC provider | ARM Cumulus HPC |
 ---
 ## Technical Specifications
 ### Architecture & Objective
-LoRA-tuned `Llama-3.1-8B`; causal-LM objective with structured-to-text instruction prompts.
 ---

 Generate schema-compliant metadata text from a JSON/CSV representation of a ScienceBase item.
 ### Downstream Use
+Integrate as a micro-service in data-repository pipelines.
 ### Out-of-Scope
+Open-ended content generation, or any application outside metadata curation.
 ---
 ## Bias, Risks, and Limitations
 * Domain-specific bias toward ScienceBase field names.
 * Possible hallucination of fields when prompts are underspecified.
 ---
 ## Training Details
 ### Training Data
+* ~ 2.3k ScienceBase records with curated metadata.
 ### Training Procedure
 | Hyper-parameter | Value |
 |-----------------|-------|
+| Max sequence length | 100 000 |
 | Precision | fp16 / bf16 (auto) |
 | Quantisation | 4-bit QLoRA (`load_in_4bit=True`) |
 | LoRA rank / α | 16 / 16 |
 | Field | Value |
 |-------|-------|
 | GPU | 1 × NVIDIA A100 80 GB |
+| Total training hours | ~10 hours |
+| Cloud/HPC provider | ARM Cumulus HPC |
 ### Software Stack
 | Package | Version |
 *Evaluation still in progress.*
 ---
 ## Technical Specifications
 ### Architecture & Objective
+QLoRA-tuned `Llama-3.1-8B-Instruct`; causal-LM objective with structured-to-text instruction prompts.
 ---