moelanoby
/

ALM-Qwen-0.5B-testing

Model card Files Files and versions

moelanoby commited on May 24, 2025

Commit

8d7daea

·

verified ·

1 Parent(s): b00ef0f

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ This repository contains an Attention-Linked Memory augmented Qwen model (ALM-Qw
 *   **AttentionLinkedMemory (ALM)**: A custom PyTorch module for two-level attention-based retrieval from structured memory. (See `ALM.py`)
 *   **QwenGenerator**: Wraps a Hugging Face Qwen model (e.g., Qwen2.5-0.5B-Instruct or Qwen2.5-7B-Instruct) for text generation.
-*   **ALMQwenModel_HF**: The main class orchestrating the ALM retrieval and Qwen generation. (See `alm_qwen_hf.py`)
 *   **Saved Weights & Config**:
     *   `alm_layer_state_dict.pth`: Trained weights for the ALM layer.
     *   `alm_qwen_hf_config.json`: Configuration for the `ALMQwenModel_HF`, including ALM parameters and paths to the Qwen components.
@@ -30,7 +30,7 @@ This repository contains an Attention-Linked Memory augmented Qwen model (ALM-Qw
 3.  **Load the model in Python**:
     ```python
-    from alm_qwen_hf import ALMQwenModel_HF # Make sure alm_qwen_hf.py and ALM.py are in your PYTHONPATH
     import torch
     # Desired device
@@ -79,4 +79,3 @@ The ALM layer (`alm_layer_state_dict.pth`) might have been trained. The Qwen mod
 *   The `load_model` method in `alm_qwen_hf.py` handles the reconstruction of the composite model.
 ---
-*This README was auto-generated. Please update with more specific details about your model.*

 *   **AttentionLinkedMemory (ALM)**: A custom PyTorch module for two-level attention-based retrieval from structured memory. (See `ALM.py`)
 *   **QwenGenerator**: Wraps a Hugging Face Qwen model (e.g., Qwen2.5-0.5B-Instruct or Qwen2.5-7B-Instruct) for text generation.
+*   **ALMQwenModel_HF**: The main class orchestrating the ALM retrieval and Qwen generation. (See `alm_qwen.py`)
 *   **Saved Weights & Config**:
     *   `alm_layer_state_dict.pth`: Trained weights for the ALM layer.
     *   `alm_qwen_hf_config.json`: Configuration for the `ALMQwenModel_HF`, including ALM parameters and paths to the Qwen components.
 3.  **Load the model in Python**:
     ```python
+    from alm_qwen import ALMQwenModel_HF # Make sure alm_qwen_hf.py and ALM.py are in your PYTHONPATH
     import torch
     # Desired device
 *   The `load_model` method in `alm_qwen_hf.py` handles the reconstruction of the composite model.
 ---