์•„์ฃผ ์‹คํ—˜์ ์ธ ์ƒ๊ฐ.

UZR-Lastest (Luria 3brains Meta Runner)

UZR-Lastest๋Š” GitHub ์ €์žฅ์†Œ 10kseason/uzr์— ์žˆ๋Š”
โ€œ๋ฃจ๋ฆฌ์•„ 3brains ๋ฉ”ํƒ€ ๋Ÿฌ๋„ˆ(UZR)โ€์˜ ์ตœ์‹  PyTorch ์ฒดํฌํฌ์ธํŠธ๋ฅผ ๋ชจ์•„๋‘” ๊ณต๊ฐ„์ž…๋‹ˆ๋‹ค.

โš ๏ธ ์—ฐ๊ตฌ์šฉ ํ”„๋กœํ† ํƒ€์ž…์ž…๋‹ˆ๋‹ค.

  • ์•ˆ์ •์„ฑยท์ผ๋ฐ˜ ์„ฑ๋Šฅยท์ง€์†์ ์ธ ์œ ์ง€๋ณด์ˆ˜๋Š” ๋ณด์žฅ๋˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค.
  • pickle ํฌ๋งท์„ ์‚ฌ์šฉํ•˜๊ธฐ ๋•Œ๋ฌธ์—, ์‹ ๋ขฐํ•˜๋Š” ํ™˜๊ฒฝ์—์„œ๋งŒ ๋กœ๋“œํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.

Files

์ด ์ €์žฅ์†Œ์—๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์€ PyTorch ์ฒดํฌํฌ์ธํŠธ๊ฐ€ ํฌํ•จ๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค:โ€‹

  • uzr_3brains_ckpt.pt (~48.7 MB)
  • uzr_3brains_ckpt_best.pt (~107 MB)
  • uzr_3brains_ckpt_last.pt (~107 MB)

๋ชจ๋“  ํŒŒ์ผ์€ torch.load()๋กœ ๋กœ๋“œ๋˜๋Š” pickle ๊ธฐ๋ฐ˜ ์ฒดํฌํฌ์ธํŠธ์ด๋ฉฐ,
๋‚ด๋ถ€์—์„œ ๋‹ค์Œ Python ํƒ€์ž…์„ importํ•ฉ๋‹ˆ๋‹ค:โ€‹

  • uzr.memory.MemoryItem
  • torch.device, torch.FloatStorage, torch._utils._rebuild_tensor_v2
  • collections.OrderedDict

๋”ฐ๋ผ์„œ, ์‚ฌ์šฉ ์‹œ์—๋Š” GitHub ์ €์žฅ์†Œ๋ฅผ ํ•จ๊ป˜ ํด๋ก ํ•˜๊ฑฐ๋‚˜
uzr/ ๋””๋ ‰ํ„ฐ๋ฆฌ๊ฐ€ Python path์— ์˜ฌ๋ผ์™€ ์žˆ์–ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.


What is UZR?

์งง๊ฒŒ ๋งํ•˜๋ฉด, UZR๋Š” โ€œ์ž‘์€ Transformer ์ธ์ฝ”๋”์— 3๊ฐœ์˜ latent ๋ธŒ๋ ˆ์ธ๊ณผ ์••์ถ• ๋ฉ”๋ชจ๋ฆฌ๋ฅผ ๋ถ™์ธ ๋ฉ”ํƒ€ ๋Ÿฌ๋„ˆโ€์ž…๋‹ˆ๋‹ค.

GitHub README ๊ธฐ์ค€์œผ๋กœ, UZR๋Š” ๋‹ค์Œ ์š”์†Œ๋“ค๋กœ ๊ตฌ์„ฑ๋ฉ๋‹ˆ๋‹ค:

  • 3brains latent space

    • ๋น ๋ฅธ ๊ทœ์น™ยท์ง€์‹์šฉ z_rule (inner-step์—์„œ ๋น ๋ฅด๊ฒŒ ์ ์‘)
    • ๋А๋ฆฐ ์–ธ์–ด/๋…ผ๋ฆฌ์šฉ z_slow_lang, z_slow_logic + ๋‘˜์„ ์ž‡๋Š” z_bridge
    • ์‚ฌ๊ณ  ๋ณด์กฐ์šฉ z_think
  • Identity & Intent

    • identity_self / identity_intent ๋ฒกํ„ฐ
    • identity_intent_control()์ด ๋‚ด๋†“๋Š” (bias, toggle)๋กœ
      โ–ธ inner-step ํšŸ์ˆ˜
      โ–ธ top-k / temperature
      โ–ธ ๋ฉ”๋ชจ๋ฆฌ ์“ฐ๊ธฐ ๊ฒŒ์ดํŠธ
      โ–ธ abstain ์—ฌ๋ถ€
      ๋ฅผ ํ•จ๊ป˜ ์ œ์–ดํ•ฉ๋‹ˆ๋‹ค.
  • Self-Eval & Abstain

    • conf / entropy / Brier ์Šค์ฝ”์–ด ๊ธฐ๋ฐ˜ ์ž๊ธฐ ํ‰๊ฐ€ ํ—ค๋“œ
    • โ€œํ™•์‹ ์ด ์—†์œผ๋ฉด ๊ฑฐ๋ถ€ํ•˜๊ฑฐ๋‚˜ ์•ฝํ•˜๊ฒŒ๋งŒ ํ•™์Šตโ€ํ•˜๋„๋ก ์„ค๊ณ„๋œ lossยท๊ฒŒ์ดํŠธ
  • CompressedMemory

    • surprise / entropy / ์ค‘๋ณต๋„ / ๊ทผ์ ‘๋„ / ๋ฒ„ํ‚ท ์ •์ฑ…์œผ๋กœ โ€œ์–ธ์ œ ์“ธ์ง€โ€๋ฅผ ์„ ํƒ
    • shadow bank, tail bucket, rebalance, learner(์˜ˆ์ธก๊ธฐ)๋ฅผ ํฌํ•จํ•œ ์žฅ๊ธฐ ์••์ถ• ๋ฉ”๋ชจ๋ฆฌ ๋‡Œ
  • NPU(QNN) / ORT ์—”์ง„ (์˜ต์…˜)

    • PyTorch ํŒŒ๋ผ๋ฏธํ„ฐ๋Š” ๊ทธ๋Œ€๋กœ ๋‘๊ณ , ONNX(QDQ) INT8 + QNN์œผ๋กœ ์ถ”๋ก ๋งŒ ์˜คํ”„๋กœ๋”ฉ
    • npu/runtime_ort.py, npu/engine.py์—์„œ ์—”์ง„ ํ† ๊ธ€ ๋ฐ ์ปจํ…์ŠคํŠธ ์บ์‹œ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.

์ž์„ธํ•œ ๊ตฌ์กฐ์™€ ํ•™์Šต/์ถ”๋ก  ํŒŒ์ดํ”„๋ผ์ธ์€ GitHub README์— ์ •๋ฆฌ๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค.


Intended use

์ด ์ฒดํฌํฌ์ธํŠธ๋Š” ์—ฐ๊ตฌยท๊ฐœ์ธ ์‹คํ—˜ยท์•„์ด๋””์–ด ํ”„๋กœํ† ํƒ€์ดํ•‘์„ ๋ชฉ์ ์œผ๋กœ ํ•ฉ๋‹ˆ๋‹ค.

์˜ˆ์‹œ ์šฉ๋„:

  • ์žฅ๊ธฐ ์„ธ์…˜ ๋™์•ˆ
    • Self-Eval / Abstain ์‹ ํ˜ธ๊ฐ€ ์–ด๋–ป๊ฒŒ ์›€์ง์ด๋Š”์ง€,
    • ๋ฉ”๋ชจ๋ฆฌ ๋ฒ„ํ‚ท(shadow / tail / rebalance)์ด ์–ด๋–ป๊ฒŒ ์ฑ„์›Œ์ง€๋Š”์ง€,
    • identity intent๊ฐ€ ์ถ”๋ก  ๊ณผ์ •์„ ์–ด๋–ป๊ฒŒ ๋ฐ”๊พธ๋Š”์ง€
      ๋ฅผ ๊ด€์ฐฐํ•˜๋Š” ์‹คํ—˜
  • โ€œ์ž‘์€ ๋ชจ๋ธ + ์••์ถ• ๋ฉ”๋ชจ๋ฆฌ + ๋ฉ”ํƒ€ ๋Ÿฌ๋„ˆโ€ ๊ตฌ์กฐ๋ฅผ ์ฐธ๊ณ ํ•˜์—ฌ
    ๋‹ค๋ฅธ ํ”„๋กœ์ ํŠธ์— ์‘์šฉํ•˜๋Š” ์šฉ๋„
  • NPU(QNN) + ONNX Runtime ํ™˜๊ฒฝ์—์„œ ๋ฉ”๋ชจ๋ฆฌ ๋‹ฌ๋ฆฐ ๋Ÿฌ๋„ˆ๋ฅผ ํ…Œ์ŠคํŠธํ•˜๋Š” ์šฉ๋„

๋น„๊ถŒ์žฅ ์‚ฌ์šฉ

  • ์ผ๋ฐ˜ ์‚ฌ์šฉ์ž ๋Œ€์ƒ ํ”„๋กœ๋•์…˜ ์„œ๋น„์Šค
  • ๊ฐ•ํ•œ ์•ˆ์ „/์ •ํ™•๋„๊ฐ€ ์š”๊ตฌ๋˜๋Š” ์‘์šฉ (์˜ˆ: ์˜๋ฃŒ, ๊ธˆ์œต, ๋ฒ•๋ฅ  ์˜์‚ฌ๊ฒฐ์ •)
  • ๋Œ€๊ทœ๋ชจ RLHF๊ฐ€ ๋ถ™์€ ๋ฒ”์šฉ ์ฑ—๋ด‡ ๋Œ€์ฒด ์šฉ๋„

How to load

  1. GitHub ์ €์žฅ์†Œ ํด๋ก :
  2. Python path์— uzr/๊ฐ€ ๋ณด์ด๋„๋ก ์„ค์ •ํ•ฉ๋‹ˆ๋‹ค.
  3. PyTorch์—์„œ torch.load("uzr_3brains_ckpt_*.pt")๋ฅผ ์‚ฌ์šฉํ•ด ์ฒดํฌํฌ์ธํŠธ๋ฅผ ๋กœ๋“œํ•ฉ๋‹ˆ๋‹ค.
    • ๋‚ด๋ถ€์— uzr.model.UZRModel, uzr.memory.MemoryItem ๋“ฑ์ด ๋“ฑ์žฅํ•˜๋ฏ€๋กœ,
      ๋™์ผํ•œ ์ฝ”๋“œ๋ฒ ์ด์Šค๋ฅผ ํ•จ๊ป˜ ๋ถˆ๋Ÿฌ์™€์•ผ ํ•ฉ๋‹ˆ๋‹ค.

๊ตฌ์ฒด์ ์ธ ์‚ฌ์šฉ ์˜ˆ์‹œ๋Š” GitHub ์ชฝ chat_cli.py, infer_longrun_*.py, uzr_live.py ๋“ฑ์„ ์ฐธ๊ณ ํ•˜๋Š” ๊ฒƒ์ด ๊ฐ€์žฅ ์•ˆ์ „ํ•ฉ๋‹ˆ๋‹ค.


License

  • ์ฝ”๋“œ์™€ ์ฒดํฌํฌ์ธํŠธ๋Š” ๋ชจ๋‘ MIT License๋ฅผ ๋”ฐ๋ฆ…๋‹ˆ๋‹ค.
  • KOBERT + KMMLU_KO + TASK.py Codebook.py๋งŒ ์‚ฌ์šฉ๋˜์–ด ํŠธ๋ ˆ์ด๋‹๋˜์—ˆ์Šต๋‹ˆ๋‹ค.
  • ์ž์œ ๋กญ๊ฒŒ fork / ์ˆ˜์ • / ์žฌ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์ง€๋งŒ,
    ์•ˆ์ „ยทํ’ˆ์งˆยท์œ ์ง€๋ณด์ˆ˜๋Š” ์ „์ ์œผ๋กœ ์‚ฌ์šฉ์ž ์ฑ…์ž„์ž…๋‹ˆ๋‹ค.
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support