5 21

Emmanouil Karystinaios PRO

manoskary

https://emmanouil-karystinaios.github.io/

AI & ML interests

Audio-Based Diffusion, Graph Neural Networks and Music Information Retrieval.

Recent Activity

upvoted a paper 3 days ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

upvoted a paper 3 days ago

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

upvoted a paper 3 days ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

View all activity

Organizations

upvoted 4 papers 3 days ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 164

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

Paper • 2606.03264 • Published 26 days ago • 23

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 174

Unlimited OCR Works

Paper • 2606.23050 • Published 6 days ago • 37

liked a model 3 days ago

baidu/Unlimited-OCR

Image-Text-to-Text • 3B • Updated 3 days ago • 213k • 1.14k

updated a Space 26 days ago

Neural Morphing

🎛

DAC RVQ palette morphing with beam search

updated a Space 29 days ago

Scoreprompts

🎼

Explain Music Scores through analysis.

published a Space 30 days ago

Neural Morphing

🎛

DAC RVQ palette morphing with beam search

updated a Space about 1 month ago

Stable Audio 3 Controls

🏆

Various Controls for SAO3 generation

published a Space about 1 month ago

Stable Audio 3 Controls

🏆

Various Controls for SAO3 generation

liked 3 models about 1 month ago

updated a Space about 2 months ago

AnalysisGNN Music Analysis

🎵

Inference for the AnalysisGNN score analysis model

liked a dataset about 2 months ago

SyMuPe/PianoCoRe

Viewer • Updated May 8 • 250k • 1.47k • 6

liked a model 2 months ago

mistralai/Voxtral-4B-TTS-2603

Text-to-Speech • Updated Mar 31 • 86.9k • 863

updated 2 Spaces 2 months ago

Woosh DFlow

🔊

Woosh-DFlow text-to-audio sound effect generation

Woosh VFlow

🔊

Generate synchronized sound for uploads and video URLs

updated a bucket 2 months ago

manoskary/Woosh-DFlow-storage

3.69 GB

published a bucket 2 months ago

manoskary/Woosh-DFlow-storage

3.69 GB

Emmanouil Karystinaios PRO

AI & ML interests

Recent Activity

Organizations

manoskary's activity

Neural Morphing

Scoreprompts

Neural Morphing

Stable Audio 3 Controls

Stable Audio 3 Controls

AnalysisGNN Music Analysis

Woosh DFlow

Woosh VFlow

manoskary/Woosh-DFlow-storage

manoskary/Woosh-DFlow-storage