add logitprocessor

#5
by leejunhyeok - opened
Motif Technologies org
No description provided.

λ¦¬λ“œλ―Έμ—…λ°μ΄νŠΈλ„ 같이 λΆ€νƒλ“œλ¦½λ‹ˆλ‹€ (vllm, parser μ‚¬μš©λ²•)

Motif Technologies org

vllm serveν• λ•Œλ„ yarn (scale factor 2, max len 131072) μˆ˜μ •λΆ€νƒλ“œλ¦½λ‹ˆλ‹€

Motif Technologies org

constant 듀은 ν•˜λ“œμ½”λ”© λ³΄λ‹€λŠ” 의미λ₯Ό μ•Œμˆ˜μžˆκ²Œ λ³€μˆ˜ν™”λ₯Ό ν•˜λŠ”κ²Œ 쒋을것 κ°™μŠ΅λ‹ˆλ‹€
ex.
ngrams = [tuple(input_ids[i:i+n]) for i in range(0, len(input_ids) - n + 1, 256)]
freq = Counter(ngrams)
return {ng: c for ng, c in freq.items() if c > 7}

256 : search_window
7 :freq_threshold

ThinkLogitsProcessor
μ—μ„œ ratio λŠ” μ‚¬μš©λ˜λŠ”κ³³μ΄ μ—†λŠ”κ²ƒ 같은데 ν•„μš”ν•œκ³³μ΄ μžˆλ‚˜μš”?

pr 이 μž˜λ €μ„œ λ³΄μ˜€λ„€μš” γ…‹γ…‹

Motif Technologies org

logits = torch.full_like(logits, torch.finfo(torch.bfloat16).min)
logits κ°€ 무쑰건 bf16 이라고 ν•˜λ”λΌλ„ logits 의 dtype 의 min 을 κ°€μ Έμ˜€λŠ”κ²Œ μ’‹μ•„λ³΄μ΄λ„€μš”

Motif Technologies org

past_token_ids κ°€ μ–΄λ–€ ν˜•νƒœλ‘œ λ“€μ–΄μ˜€λ‚˜μš”?
geneation token 이 계속 concat λ˜λŠ” ν˜•νƒœλΌλ©΄

ngrams = [tuple(input_ids[i:i+n]) for i in range(0, len(input_ids) - n + 1, WINDOW_SIZE)]
쀑볡검사가 λ§Žμ•„λ³΄μ΄λŠ”λ° μ‹œμž‘μ„ 0 μ—μ„œλΆ€ν„° μ•ˆν•΄λ„ λ˜μ§€ μ•Šλ‚˜ μ‹ΆμŠ΅λ‹ˆλ‹€

μ •ν™•νžˆ μ΄ν•΄ν•œκ²Œ λ§žλŠ”μ§€λŠ” λͺ¨λ₯΄κ² μ§€λ§Œ
ratio λž‘ ngram 은 independent ν•œ κ΄€κ³„λ‘œ λ³΄μ΄λŠ”λ° λ§žμ„κΉŒμš”?
budget 이 남지 μ•ŠμœΌλ©΄ ngram μ΄λž‘ λ¬΄κ΄€ν•˜κ²Œ think_end λ₯Ό μ‹œμΌœμ€˜μ•Όν• κ²ƒ 같은데
κ·Έλ ‡λ‹€λ©΄ ratio check λ₯Ό λ¨Όμ €ν•œν›„μ— remaining budget 이 μžˆλ‹€λ©΄, len(past_token_ids) % self.interval == 0 μΌλ•Œ ngram check λ₯Ό ν•΄μ£ΌλŠ”κ²Œ λ‚˜μ•„λ³΄μž…λ‹ˆλ‹€

ratio 도 logit processor μ—μ„œ 자주 μ‚¬μš©λ˜λŠ” κ°œλ…μΌκΉŒμš”?
λ§Œμ•½ μ•„λ‹ˆλΌλ©΄ README 에 ratio κ°€ μ–΄λ–€ κ°œλ…μΈμ§€ μ„€λͺ…이 있으면 쒋을것 κ°™μŠ΅λ‹ˆλ‹€
자주 μ‚¬μš©λ˜λŠ” κ°œλ…μ΄λΌλ„ μ™ΈλΆ€μ—μ„œ μ œμ–΄κ°€λŠ₯ν•œ λ³€μˆ˜μ΄κΈ° λ•Œλ¬Έμ— README 에 μ„€λͺ…이 μžˆλŠ”κ²Œ 쒋아보이긴 ν•˜κ΅¬μš” γ…‹γ…‹

  • ratio λ³΄λ‹€λŠ” thinking_ratio 같은 의미λ₯Ό 쒀더 μž˜μ•Œμˆ˜μžˆλŠ” λ³€μˆ˜λͺ…이면 더 μ’‹μ•„λ³΄μž…λ‹ˆλ‹€ γ…‹γ…‹
Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment