Spaces:

insight-publica
/

README

Running

insightpublica commited on Feb 17

Commit

96fa118

verified ·

1 Parent(s): 5603d5c

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -6,5 +6,14 @@ colorTo: gray
 sdk: static
 pinned: false
 ---
-Edit this `README.md` markdown file to author your organization card.

 sdk: static
 pinned: false
 ---
+We are a publishing house with 2,000+ titles, dedicated to the digitization and preservation of literary classics and linguistic resources. Our focus is on building high-quality, aligned datasets for Low-Resource Languages, specifically within the Dravidian and Indic language families.
+Currently, we are working on large-scale projects including:
+Parallel Corpora: Multilingual alignment of classic literature (English, Malayalam, Hindi, Kannada, and Tamil).
+Lexical Datasets: Digitizing comprehensive dictionaries like Shabdatharavali for AI training and NLP research.
+Classic Literature Digitization: Converting a vast catalog of public domain titles into AI-ready formats (e-Pub/JSON).
+Our goal is to bridge the gap in Machine Translation and NLU for Indian languages by providing clean, human-verified, and culturally rich data.