wi-lab
/

lwm

Feature Extraction

wireless-communication

few-shot-learning

Model card Files Files and versions

Sadjad Alikhani commited on Sep 26, 2024

Commit

c92c0a2

·

verified ·

1 Parent(s): 491d623

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -171,7 +171,13 @@ else:
 ### 8. **Tokenize and Load the Model**
-Now, tokenize the dataset and load the pre-trained LWM model.
 ```python
 from input_preprocess import tokenizer
@@ -189,6 +195,8 @@ print(f"Loading the LWM model on {device}...")
 model = lwm.from_pretrained(device=device)
 ```
 ---
 ### 9. **Perform Inference**

 ### 8. **Tokenize and Load the Model**
+Before we dive into tokenizing the dataset and loading the model, let's understand how the tokenization process is adapted to the wireless communication context. In this case, **tokenization** refers to segmenting each wireless channel into patches, similar to how Vision Transformers (ViTs) work with images. Each wireless channel is structured as a \(32 \times 32\) matrix, where rows represent antennas and columns represent subcarriers.
+The tokenization process involves **dividing the channel matrix into patches**, with each patch containing information from 16 consecutive subcarriers. These patches are then **embedded** into a 64-dimensional space, providing the Transformer with a richer context for each patch. In this process, **positional encodings** are added to preserve the structural relationships within the channel, ensuring the Transformer captures both spatial and frequency dependencies.
+If you choose to apply **Masked Channel Modeling (MCM)** during inference (by setting `gen_raw=False`), LWM will mask certain patches, as it did during pre-training. However, for standard inference, masking isn't necessary unless you want to test LWM's resilience to noisy inputs.
+Now, let's move on to tokenize the dataset and load the pre-trained LWM model.
 ```python
 from input_preprocess import tokenizer
 model = lwm.from_pretrained(device=device)
 ```
+With this setup, you're ready to pass your tokenized wireless channels through the pre-trained model, extracting rich, context-aware embeddings that are ready for use in downstream tasks.
 ---
 ### 9. **Perform Inference**