bayan-api / src /nlp /punctuation

Commit History

Fix punctuation alignment and hallucinations, add punctuation safety rules, add kana/inna grammar rules
6effc77
Running

youssefreda9 commited on

fix(nlp): remove overly restrictive rules 0b and 0c from punctuation safety guards that blocked valid commas and colons
d791d61

youssefreda9 commited on

refactor: simplify architecture and replace whitelists with CamelTools
a4fd3e7

Bayan AI commited on

UI/UX: Add Quran standalone tab, summarize textarea, floating selection bar, fix clear editor, remove duplicate button
70184c4

youssefreda9 commited on

Phase 14c: Fix FIX-31 whitespace bug + fix misleading log truncation
ca212fd

youssefreda9 commited on

Merge punctuation_rules V1+V2: V2 threshold+fallback, V1 softened exclamation guard for short texts
c1f8f9f

youssefreda9 commited on

Fix: TerminalPunctuationGuard for short phrases and comprehensive Nasb/Jazm grammar rules fixes
31a6db1

Bayan AI commited on

Phase 11.5: Adversarial diagnostic + 4 critical pipeline fixes
57104c1

youssefreda9 commited on

Phase 13: Fix punctuation terminal injection filter (context-aware Rule 0)
c830869

Mohamed Atef commited on

FIX-26: Reject punct substitution when original has punct
175fc9d

youssefreda9 commited on

Phase 12: Fix 6 batches — grammar pipeline bypass, religious/structured protection, punct rearrangement, SV/gender agreement
015c7b7

youssefreda9 commited on

Phase 11: Master Bug Fix — 23 fixes across 8 files
095e270

youssefreda9 commited on

revert: restore backend/model files to cf07939 state, keep UI/UX changes only
01b11d4

youssefreda9 commited on

fix: block grammar كان→كأن + punctuation mid-word split (الدفت.ر)
b5890c0

youssefreda9 commited on

fix: all model bugs — S1 S2 S3 G1 P1 (6 fixes across 4 files)
e68c40c

youssefreda9 commited on

feat: Pipeline Hardening v3.3 — stage ownership, punctuation safety, UUID migration
a9630ec

youssefreda9 commited on

feat: NLP-3 PuncAra-v1 Integration — Local punctuation model - Created src/nlp/punctuation/ package (rules + service) - Extracted from PuncAra.py: preprocessing, postprocessing, chunking logic - PunctuationChecker: lazy-loaded EncoderDecoderModel from bayan10/PuncAra-v1 - Wired into /api/analyze as Step 3 (Spelling -> Grammar -> Punctuation) - Updated /api/punctuation standalone endpoint - Updated /api/health to report punctuation status - Dockerfile pre-downloads PuncAra-v1 weights during build - Increased gunicorn timeout to 300s for full pipeline
8bea99d

youssefreda9 commited on