Initial upload: KnowForge Encoder (131K params)

Browse files

Files changed (7) hide show

LICENSE +21 -0
README.md +102 -0
best_model.safetensors +3 -0
inference.py +173 -0
model_config.json +8 -0
requirements.txt +2 -0
vocab.json +1 -0

LICENSE ADDED Viewed

	@@ -0,0 +1,21 @@

+MIT License
+Copyright (c) 2026 KnowForge
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

README.md ADDED Viewed

	@@ -0,0 +1,102 @@

+---
+language:
+- en
+- vi
+license: mit
+pipeline_tag: text-classification
+tags:
+- text-classification
+- compositional-reasoning
+- knowforge
+- tiny-model
+---
+# KnowForge Encoder
+A tiny (131K parameter) text classifier trained from scratch on the KnowForge dataset.
+Given a natural-language input prompt, it predicts:
+- **`transform_type`** — which reasoning operation is required
+- **`answer_type`** — what kind of answer to expect
+This model is a fast routing component, not a generative model. It is designed to run in milliseconds on CPU, making it suitable for pre-filtering or routing in a KnowForge inference pipeline.
+---
+## Quick Start
+```bash
+pip install -r requirements.txt
+python inference.py "A is taller than B. B is taller than C. Is A taller than C?"
+# Transform: relation_to_graph (99.12%)
+# Answer type: exact_answer (87.34%)
+```
+```python
+from inference import predict
+result = predict("A is taller than B. B is taller than C. Is A taller than C?")
+print(result["transform_type"])       # "relation_to_graph"
+print(result["transform_confidence"]) # 0.9912
+print(result["answer_type"])          # "exact_answer"
+```
+---
+## What It Classifies
+### Transform types (3 classes)
+| Class | Meaning |
+|---|---|
+| `linear_to_cyclic` | Modular arithmetic in cyclic domains (clocks, calendars, wrap-around) |
+| `relation_to_graph` | Transitive relation query over a directed entity graph |
+| `relation_property_check` | Structural property check on a declared relation system |
+### Answer types (4 classes)
+| Class | Meaning |
+|---|---|
+| `exact_answer` | A single definite value follows from the rules |
+| `conditional_answer` | Answer depends on an unstated condition |
+| `need_more_rule` | Insufficient rules to determine the answer |
+| `unresolvable_without_observation` | Answer requires empirical observation not in the rules |
+---
+## Architecture
+Conv1d text classifier trained entirely from scratch — no pretrained embeddings.
+| Component | Detail |
+|---|---|
+| Embedding | 808 × 64 (word-level, learned) |
+| Encoder | 2 × Conv1d(kernel=3) + ReLU, output dim 128 |
+| Pooling | Global max pooling over sequence |
+| Heads | transform (3), answer_type (4), plus auxiliary heads |
+| Parameters | **131,888** |
+| Training time | ~25 min on CPU |
+---
+## Performance
+Evaluated on dev set after 28 epochs (best checkpoint by dev loss):
+| Metric | Score |
+|---|---|
+| **transform_acc (dev)** | **99.55%** |
+| **atype_acc (dev)** | **99.19%** |
+| transform_acc (train) | 99.66% |
+| atype_acc (train) | 99.37% |
+Transform accuracy on the full test pipeline evaluation: **99.64%**.
+---
+## Limitations
+- **Vocabulary size 808** — trained on KnowForge synthetic text only. Out-of-domain vocabulary falls back to `<UNK>`. Accuracy degrades on very different phrasings.
+- **No context.** The model sees only the raw input text, not the rule structure. It classifies by surface patterns learned from training data.
+- **Not a reasoning model.** This classifier routes queries; it does not solve them. Use KnowForge-0.6B for full answer generation.
+- **Synthetic distribution only.** Tested exclusively on procedurally generated KnowForge examples. Behaviour on real-world inputs is not evaluated.

best_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a978b3722eecc47b2dc780214f20b26788408c43fc46e836eb8cc855f3e7698b
+size 528768

inference.py ADDED Viewed

	@@ -0,0 +1,173 @@

+"""
+KnowForge Encoder — standalone inference.
+Predicts transform_type and answer_type from a KnowForge input prompt.
+CLI:   python inference.py "A cao hơn B, B cao hơn C. A có cao hơn C không?"
+API:   from inference import predict; result = predict("A cao hơn B...")
+"""
+import json
+import re
+import sys
+from pathlib import Path
+from typing import Optional
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+_HERE = Path(__file__).parent
+# ── Label maps (must match training) ────────────────────────────────────────
+TRANSFORM_LABELS = ["linear_to_cyclic", "relation_property_check", "relation_to_graph"]
+ATYPE_LABELS     = ["conditional_answer", "exact_answer", "need_more_rule",
+                    "unresolvable_without_observation"]
+# ── Tokenizer ────────────────────────────────────────────────────────────────
+_TOK_RE = re.compile(r"[\w]+|[^\w\s]", re.UNICODE)
+def _tokenize(text: str) -> list:
+    return _TOK_RE.findall(text.lower())
+# ── Model architecture ───────────────────────────────────────────────────────
+class _MultiTaskEncoder(nn.Module):
+    def __init__(self, vocab_size: int, embed_dim: int = 64,
+                 hidden_dim: int = 64, n_layers: int = 2, dropout: float = 0.3):
+        super().__init__()
+        enc_dim = hidden_dim * 2  # 128
+        self.embedding = nn.Embedding(vocab_size, embed_dim, padding_idx=0)
+        self.dropout   = nn.Dropout(dropout)
+        conv_layers = []
+        in_ch = embed_dim
+        for _ in range(n_layers):
+            conv_layers += [nn.Conv1d(in_ch, enc_dim, 3, padding=1), nn.ReLU()]
+            in_ch = enc_dim
+        self.encoder = nn.Sequential(*conv_layers)
+        self.transform_head   = nn.Linear(enc_dim, len(TRANSFORM_LABELS))
+        self.atype_head       = nn.Linear(enc_dim, len(ATYPE_LABELS))
+        # Unused heads included so state_dict keys match exactly
+        self.etype_head       = nn.Linear(enc_dim, 24)
+        self.uncertainty_head = nn.Linear(enc_dim, 5)
+        self.bio_head         = nn.Linear(enc_dim, 12)
+    def forward(self, token_ids: torch.Tensor) -> dict:
+        x   = self.embedding(token_ids)                          # (B, L, E)
+        x   = self.dropout(x)
+        out = self.encoder(x.transpose(1, 2)).transpose(1, 2)   # (B, L, 128)
+        # Global max pooling over sequence dim
+        pooled = out.max(dim=1).values                           # (B, 128)
+        return {
+            "transform": self.transform_head(pooled),
+            "atype":     self.atype_head(pooled),
+        }
+# ── Lazy singleton loader ────────────────────────────────────────────────────
+_encoder: Optional[_MultiTaskEncoder] = None
+_vocab:   Optional[dict] = None
+def _load():
+    global _encoder, _vocab
+    if _encoder is not None:
+        return _encoder, _vocab
+    vocab_path = _HERE / "vocab.json"
+    cfg_path   = _HERE / "model_config.json"
+    sf_path    = _HERE / "best_model.safetensors"
+    pt_path    = _HERE / "best_model.pt"
+    if not vocab_path.exists():
+        raise FileNotFoundError(f"vocab.json not found at {vocab_path}")
+    _vocab = json.load(open(vocab_path))
+    cfg = json.load(open(cfg_path)) if cfg_path.exists() else {}
+    model = _MultiTaskEncoder(
+        vocab_size = cfg.get("vocab_size", len(_vocab)),
+        embed_dim  = cfg.get("embed_dim",  64),
+        hidden_dim = cfg.get("hidden_dim", 64),
+        n_layers   = cfg.get("n_layers",   2),
+        dropout    = cfg.get("dropout",    0.3),
+    )
+    if sf_path.exists():
+        from safetensors.torch import load_file
+        state = load_file(str(sf_path))
+    elif pt_path.exists():
+        state = torch.load(str(pt_path), map_location="cpu", weights_only=True)
+    else:
+        raise FileNotFoundError(f"No model weights found at {sf_path} or {pt_path}")
+    model.load_state_dict(state)
+    model.eval()
+    _encoder = model
+    return _encoder, _vocab
+# ── Public API ───────────────────────────────────────────────────────────────
+def predict(text: str) -> dict:
+    """
+    Predict transform_type and answer_type for a KnowForge input.
+    Args:
+        text: Natural-language input (rules + question or question alone).
+    Returns:
+        {
+            "transform_type":       str   — one of linear_to_cyclic /
+                                            relation_property_check /
+                                            relation_to_graph,
+            "transform_confidence": float — softmax probability [0,1],
+            "answer_type":          str   — one of conditional_answer /
+                                            exact_answer /
+                                            need_more_rule /
+                                            unresolvable_without_observation,
+            "atype_confidence":     float,
+        }
+    """
+    model, vocab = _load()
+    toks = _tokenize(text)
+    ids  = [vocab.get(t, vocab.get("<UNK>", 1)) for t in toks] or [0]
+    x    = torch.tensor([ids], dtype=torch.long)  # (1, L)
+    with torch.no_grad():
+        logits = model(x)
+    t_probs = F.softmax(logits["transform"][0], dim=-1)
+    a_probs = F.softmax(logits["atype"][0],     dim=-1)
+    t_idx = int(t_probs.argmax())
+    a_idx = int(a_probs.argmax())
+    return {
+        "transform_type":       TRANSFORM_LABELS[t_idx],
+        "transform_confidence": round(float(t_probs[t_idx]), 4),
+        "answer_type":          ATYPE_LABELS[a_idx],
+        "atype_confidence":     round(float(a_probs[a_idx]), 4),
+    }
+def _main():
+    if len(sys.argv) < 2:
+        print("Usage: python inference.py \"<input text>\"")
+        sys.exit(1)
+    text   = " ".join(sys.argv[1:])
+    result = predict(text)
+    print(f"Transform: {result['transform_type']} ({result['transform_confidence']:.2%})")
+    print(f"Answer type: {result['answer_type']} ({result['atype_confidence']:.2%})")
+if __name__ == "__main__":
+    _main()

model_config.json ADDED Viewed

	@@ -0,0 +1,8 @@

+{
+  "vocab_size": 808,
+  "embed_dim": 64,
+  "hidden_dim": 64,
+  "n_layers": 2,
+  "dropout": 0.3,
+  "param_count": 131888
+}

requirements.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ torch>=2.0.0
2	+ safetensors>=0.4.0

vocab.json ADDED Viewed

	@@ -0,0 +1 @@

+ {"<PAD>": 0, "<UNK>": 1, "tính": 2, "chất": 3, ":": 4, "nhóm": 5, "1": 6, "liên": 7, "quan": 8, "đến": 9, "2": 10, ".": 11, "3": 12, "theo": 13, "định": 14, "nghĩa": 15, ",": 16, "bắc": 17, "cầu": 18, "áp": 19, "dụng": 20, "có": 21, "không": 22, "?": 23, "chỉ": 24, "cho": 25, "cặp": 26, "trực": 27, "tiếp": 28, "màn": 29, "hình": 30, "6": 31, "kênh": 32, "(": 33, "–": 34, ")": 35, "qua": 36, "về": 37, "đang": 38, "ở": 39, "chuyển": 40, "4": 41, "giá": 42, "trị": 43, "khu": 44, "a": 45, "=": 46, "76": 47, ">": 48, "b": 49, "72": 50, "c": 51, "67": 52, "nhiều": 53, "hơn": 54, "thắng": 55, "trong": 56, "trận": 57, "đấu": 58, "lần": 59, "hai": 60, "độc": 61, "lập": 62, "chắc": 63, "bộ": 64, "đếm": 65, "trạng": 66, "thái": 67, "sau": 68, "là": 69, "thêm": 70, "bước": 71, "đâu": 72, "tăng": 73, "đều": 74, "nhà": 75, "máy": 76, "<": 77, "chiến": 78, "kết": 79, "nối": 80, "→": 81, "biến": 82, "số": 83, "z": 84, ";": 85, "đảm": 86, "bảo": 87, "đợt": 88, "luân": 89, "phiên": 90, "đơn": 91, "vị": 92, "làm": 93, "nào": 94, "thứ": 95, "tự": 96, "xuất": 97, "hiện": 98, "đội": 99, "đầu": 100, "tiên": 101, "đó": 102, "rồi": 103, "lượt": 104, "và": 105, "ban": 106, "so": 107, "hiệu": 108, "quả": 109, "sang": 110, "mức": 111, "tiêu": 112, "thụ": 113, "năng": 114, "lượng": 115, "thước": 116, "đo": 117, "thay": 118, "đổi": 119, "trình": 120, "lặp": 121, "8": 122, "5": 123, "7": 124, "trí": 125, "tiến": 126, "hệ": 127, "tuyến": 128, "phần": 129, "tử": 130, "ngang": 131, "nhau": 132, "mọi": 133, "sánh": 134, "được": 135, "hùng": 136, "hàng": 137, "kiên": 138, "lâm": 139, "phản": 140, "ví": 141, "dụ": 142, "nhưng": 143, "trường": 144, "hợp": 145, "p": 146, "hỏi": 147, "10": 148, "liệu": 149, "phong": 150, "97": 151, "quang": 152, "93": 153, "sơn": 154, "88": 155, "vòng": 156, "vượt": 157, "mỗi": 158, "điều": 159, "kiện": 160, "riêng": 161, "phép": 162, "toán": 163, "-": 164, "+": 165, "mod": 166, "mốc": 167, "thời": 168, "gian": 169, "nam": 170, "trước": 171, "nhất": 172, "minh": 173, "long": 174, "gần": 175, "nằm": 176, "ổn": 177, "cũng": 178, "đã": 179, "xảy": 180, "ra": 181, "chu": 182, "kỳ": 183, "toàn": 184, "tập": 185, "hoa": 186, "lan": 187, "mai": 188, "vực": 189, "phân": 190, "xưởng": 191, "y": 192, "bối": 193, "cảnh": 194, "cục": 195, "tròn": 196, "từ": 197, "đi": 198, "chiều": 199, "thuận": 200, "cao": 201, "thấp": 202, "uy": 203, "dẫn": 204, "vinh": 205, "xuân": 206, "cuối": 207, "bảng": 208, "dùng": 209, "khả": 210, "mở": 211, "rộng": 212, "kinh": 213, "tế": 214, "quán": 215, "q1": 216, "81": 217, "q2": 218, "75": 219, "q3": 220, "70": 221, "điểm": 222, "thống": 223, "kê": 224, "mẫu": 225, "x": 226, "khác": 227, "công": 228, "thức": 229, "mới": 230, "với": 231, "căn": 232, "ven": 233, "sông": 234, "dài": 235, "điện": 236, "thoại": 237, "đời": 238, "cũ": 239, "chiếc": 240, "xe": 241, "màu": 242, "đỏ": 243, "thiết": 244, "bị": 245, "thử": 246, "nghiệm": 247, "thể": 248, "luận": 249, "chưa": 250, "chuỗi": 251, "bình": 252, "an": 253, "đảo": 254, "ngược": 255, "luôn": 256, "giành": 257, "tích": 258, "linh": 259, "hoạt": 260, "độ": 261, "chính": 262, "xác": 263, "ghế": 264, "9": 265, "dịch": 266, "khoảng": 267, "cách": 268, "mùa": 269, "hè": 270, "môi": 271, "nói": 272, "chung": 273, "dãy": 274, "tại": 275, "của": 276, "xét": 277, "rõ": 278, "/": 279, "cộng": 280, "nếu": 281, "thì": 282, "quy": 283, "tắc": 284, "đề": 285, "cập": 286, "ca": 287, "xoay": 288, "tuyên": 289, "bố": 290, "ràng": 291, "tam": 292, "đoạn": 293, "bất": 294, "ưu": 295, "hoặc": 296, "total": 297, "order": 298, "trọng": 299, "tỷ": 300, "lệ": 301, "thành": 302, "biên": 303, "tốc": 304, "chậm": 305, "m1": 306, "m2": 307, "m3": 308, "m4": 309, "lịch": 310, "ghi": 311, "nhận": 312, "thuần": 313, "sát": 314, "suy": 315, "circular": 316, "array": 317, "index": 318, "phía": 319, "xếp": 320, "hạng": 321, "đạt": 322, "phú": 323, "giang": 324, "yếu": 325, "tố": 326, "trung": 327, "m": 328, "top": 329, "nút": 330, "gây": 331, "tầng": 332, "tổng": 333, "họa": 334, "82": 335, "78": 336, "73": 337, "nhỏ": 338, "lớn": 339, "ghép": 340, "lẻ": 341, "sân": 342, "khấu": 343, "chỗ": 344, "ngồi": 345, "84": 346, "khái": 347, "quát": 348, "trội": 349, "địa": 350, "dự": 351, "án": 352, "bắt": 353, "89": 354, "83": 355, "suất": 356, "đồng": 357, "hồ": 358, "quay": 359, "lại": 360, "77": 361, "74": 362, "slot": 363, "chi": 364, "phí": 365, "nguyên": 366, "này": 367, "nhớ": 368, "ô": 369, "con": 370, "trỏ": 371, "cố": 372, "cứ": 373, "item": 374, "80": 375, "68": 376, "tình": 377, "huống": 378, "tác": 379, "động": 380, "gián": 381, "thực": 382, "mô": 383, "lý": 384, "thuyết": 385, "ước": 386, "0": 387, "điệu": 388, "loại": 389, "chênh": 390, "kéo": 391, "xây": 392, "dựng": 393, "lệch": 394, "giữa": 395, "tuy": 396, "nhiên": 397, "automaton": 398, "cạnh": 399, "biển": 400, "sâu": 401, "alpha": 402, "sao": 403, "hay": 404, "gió": 405, "lửa": 406, "núi": 407, "tới": 408, "xử": 409, "kho": 410, "sắp": 411, "ngọc": 412, "đứng": 413, "trên": 414, "phát": 415, "quỳnh": 416, "fsm": 417, "tuần": 418, "hoàn": 419, "giảm": 420, "dần": 421, "sử": 422, "năm": 423, "nữa": 424, "ảnh": 425, "hưởng": 426, "cấp": 427, "thẳng": 428, "thanh": 429, "tràn": 430, "đường": 431, "đua": 432, "dừng": 433, "biết": 434, "gói": 435, "enterprise": 436, "ít": 437, "model": 438, "basic": 439, "pro": 440, "clb": 441, "đất": 442, "rừng": 443, "khi": 444, "giống": 445, "p1": 446, "98": 447, "p2": 448, "95": 449, "p3": 450, "87": 451, "2019": 452, "2011": 453, "2006": 454, "bền": 455, "đây": 456, "phải": 457, "thắm": 458, "uyên": 459, "vân": 460, "thiếu": 461, "khoản": 462, "tú": 463, "ánh": 464, "tất": 465, "cả": 466, "q10": 467, "q6": 468, "vào": 469, "team": 470, "gamma": 471, "beta": 472, "91": 473, "92": 474, "đúng": 475, "giai": 476, "cùng": 477, "mảng": 478, "[": 479, "]": 480, "danh": 481, "sách": 482, "khép": 483, "kín": 484, "mục": 485, "nhảy": 486, "hữu": 487, "hạn": 488, "ký": 489, "nên": 490, "đầy": 491, "đủ": 492, "luật": 493, "lô": 494, "đặc": 495, "liền": 496, "cơ": 497, "chế": 498, "≠": 499, "từng": 500, "đỉnh": 501, "đáy": 502, "2010": 503, "2001": 504, "1994": 505, "dữ": 506, "thô": 507, "thấy": 508, "phụ": 509, "thuộc": 510, "đối": 511, "bằng": 512, "chứng": 513, "vs": 514, "11": 515, "thông": 516, "nhân": 517, "t": 518, "đêm": 519, "2018": 520, "2015": 521, "2007": 522, "gì": 523, "học": 524, "tài": 525, "đánh": 526, "bại": 527, "gặp": 528, "q9": 529, "kế": 530, "thừa": 531, "79": 532, "71": 533, "cần": 534, "thơ": 535, "nặng": 536, "đà": 537, "lạt": 538, "hải": 539, "phòng": 540, "tre": 541, "d": 542, "r": 543, "mây": 544, "nova": 545, "quân": 546, "rẻ": 547, "dũng": 548, "nha": 549, "trang": 550, "buôn": 551, "ma": 552, "thuột": 553, "hà": 554, "nội": 555, "buổi": 556, "sáng": 557, "thí": 558, "85": 559, "đông": 560, "102": 561, "94": 562, "ngoài": 563, "12": 564, "dòng": 565, "2009": 566, "2013": 567, "v": 568, "áo": 569, "len": 570, "xa": 571, "tủ": 572, "đồ": 573, "gỗ": 574, "sồi": 575, "lược": 576, "ii": 577, "option": 578, "hoạch": 579, "phức": 580, "tạp": 581, "vận": 582, "hành": 583, "hạnh": 584, "thảo": 585, "nóng": 586, "bản": 587, "2028": 588, "2020": 589, "2012": 590, "q": 591, "nhanh": 592, "nẵng": 593, "xanh": 594, "q8": 595, "q5": 596, "90": 597, "phương": 598, "truyền": 599, "đệm": 600, "nêu": 601, "2014": 602, "2004": 603, "2002": 604, "max": 605, "version": 606, "a17": 607, "việt": 608, "xứng": 609, "2022": 610, "2003": 611, "2021": 612, "k": 613, "96": 614, "huế": 615, "ngắn": 616, "nhơn": 617, "c1": 618, "c2": 619, "c3": 620, "c4": 621, "thu": 622, "86": 623, "69": 624, "phúc": 625, "ngoại": 626, "khía": 627, "chuẩn": 628, "nano": 629, "bao": 630, "gồm": 631, "đắt": 632, "tiền": 633, "quý": 634, "hiếm": 635, "2017": 636, "2008": 637, "99": 638, "2027": 639, "1997": 640, "64": 641, "q0": 642, "tháng": 643, "chẵn": 644, "ba": 645, "100": 646, "tốt": 647, "v1": 648, "v2": 649, "v3": 650, "v4": 651, "tả": 652, "k1": 653, "k2": 654, "k3": 655, "k4": 656, "66": 657, "13": 658, "2016": 659, "vụ": 660, "doanh": 661, "nghiệp": 662, "hàm": 663, "subset": 664, "lớp": 665, "1996": 666, "2000": 667, "65": 668, "tuổi": 669, "nhi": 670, "giờ": 671, "thường": 672, "biệt": 673, "2024": 674, "quyển": 675, "bìa": 676, "cứng": 677, "1998": 678, "ngày": 679, "muộn": 680, "khởi": 681, "diễn": 682, "bán": 683, "nhẹ": 684, "1995": 685, "dày": 686, "dạn": 687, "ngân": 688, "oanh": 689, "i": 690, "tùng": 691, "101": 692, "sự": 693, "q7": 694, "q4": 695, "g1": 696, "g2": 697, "g3": 698, "g4": 699, "2025": 700, "g": 701, "h": 702, "1990": 703, "1993": 704, "2026": 705, "sớm": 706, "xế": 707, "tối": 708, "r1": 709, "r2": 710, "r3": 711, "r4": 712, "một": 713, "thang": 714, "duy": 715, "'": 716, "quen": 717, "trò": 718, "chơi": 719, "trục": 720, "để": 721, "—": 722, "ngữ": 723, "bài": 724, "tranh": 725, "strict": 726, "partial": 727, "cung": 728, "tin": 729, "transitivity": 730, "kiểm": 731, "soát": 732, "hạ": 733, "matchup": 734, "trễ": 735, "transitive": 736, "thích": 737, "yêu": 738, "các": 739, "hỗ": 740, "trợ": 741, "closure": 742, "thi": 743, "bổ": 744, "sung": 745, "tưởng": 746, "cycle": 747, "quyết": 748, "chốt": 749, "u": 750, "w": 751, "tường": 752, "j": 753, "cụ": 754, "trưa": 755, "thua": 756, "21": 757, "24": 758, "nhiêu": 759, "lùi": 760, "32": 761, "24h": 762, "00": 763, "tiếng": 764, "mấy": 765, "buffer": 766, "15": 767, "bội": 768, "bây": 769, "56": 770, "18": 771, "30": 772, "kim": 773, "vạch": 774, "23": 775, "thúc": 776, "22": 777, "46": 778, "25": 779, "26": 780, "ring": 781, "17": 782, "16": 783, "đèn": 784, "báo": 785, "36": 786, "60": 787, "hướng": 788, "la": 789, "bàn": 790, "°": 791, "kwh": 792, "12h": 793, "14": 794, "20": 795, "bậc": 796, "lên": 797, "nấc": 798, "47": 799, "19": 800, "33": 801, "âm": 802, "tục": 803, "sẽ": 804, "28": 805, "vàng": 806, "tây": 807}