wl-ko-ner-v5

Korean 6-type NER (PS / LC / OG / DT / TI / QT) β€” KoELECTRA-base-v3 finetune, ONNX (fp32 + fp16). Roleplay/narrative 도메인 적응: multi-bot lorebook canon distant-supervision + ν•œκ΅­ μ§€λͺ… gazetteer + DT/TI/QT regex + KLUE mix (LLM 라벨링 λΉ„μš© 0).

μ„±λŠ₯ (entity-level micro-F1, exact span+type)

  • KLUE-NER dev: 0.821 β€” base 도메인 νšŒκ·€ 0.
  • μžκΈ°μž‘ν’ˆ held-out, core(PS/OG/LC): 0.987.
  • λ―Έκ΄€μΈ‘ 3세계 μΌλ°˜ν™” (full pipeline): all 0.797 / core 0.773 β€” ꡐ차도메인 NER μ •μƒλ²”μœ„ (CrossNER 0.63~0.74 λŒ€λΉ„ 상단).
  • per-type 은 held-out μ†Œν‘œλ³Έ(n=67)이라 λ°©ν–₯ μ§€ν‘œ.

Files

  • wl-ko-ner-v5-fp32.onnx (~429MB) β€” μ„œλ²„/λͺ¨λ°”일 (fused fp32, ν’ˆμ§ˆ κΈ°μ€€).
  • wl-ko-ner-v5-fp16.onnx (~215MB) β€” λ°μŠ€ν¬ν†± (fp16, quality drop ~0).
  • tokenizer / config β€” KoELECTRA base (monologg/koelectra-base-v3-discriminator).

μš©λ„

WygLore Leaf (RisuAI V3 ν”ŒλŸ¬κ·ΈμΈ) 의 ν•œκ΅­μ–΄ 개체 μΆ”μΆœ. 무료 CPU μ„œλ²„(HF Space) ONNX μ„œλΉ™ λ˜λŠ” self-host. 좜λ ₯: [{text, type, score, start, end}, ...]. λŸ°νƒ€μž„ ν›„μ²˜λ¦¬(ν•œμž κ΄„ν˜Έ 트리밍 + μΌλ°˜μ–΄ stoplist + ν•œμž-only λ“œλ‘­)λŠ” μ„œλ²„/ν”ŒλŸ¬κ·ΈμΈ μΈ‘μ—μ„œ μˆ˜ν–‰.

License stack

  • Base: monologg/KoELECTRA (Apache-2.0).
  • Training data: KLUE-NER (CC-BY-SA-4.0, Park et al. 2021) + roleplay lorebook canon (operator μžμ‚°).
  • Weights: CC-BY-SA-4.0 (KLUE 상속 β€” κ°€μž₯ μ œμ•½ κ°•ν•œ λΌμ΄μ„ μŠ€ μš°μ„ ). Code: Apache-2.0.
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for m6dd8m/wl-ko-ner-v5

Quantized
(3)
this model

Dataset used to train m6dd8m/wl-ko-ner-v5

Space using m6dd8m/wl-ko-ner-v5 1