tts Collection by AUSSIE Jan 5, 2024 - myshell-ai/OpenVoice Text-to-Speech • Updated Dec 24, 2024 • 488
LLMs Collection by nehabora Jan 5, 2024 - Understanding LLMs: A Comprehensive Overview from Training to Inference Paper • 2401.02038 • Published Jan 4, 2024 • 65
Understanding LLMs: A Comprehensive Overview from Training to Inference Paper • 2401.02038 • Published Jan 4, 2024 • 65
3d models Collection by ljhwild Jan 5, 2024 - Learning the 3D Fauna of the Web Paper • 2401.02400 • Published Jan 4, 2024 • 11
media Collection by worklt Jan 5, 2024 - Running on T4 290 Demucs Music Source Separation (v4) ⚡ 290 Separate vocals and instrumentals from any music track
Running on T4 290 Demucs Music Source Separation (v4) ⚡ 290 Separate vocals and instrumentals from any music track
Document Intelligence Collection by navakanth-reddy Jan 5, 2024 - DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 191
DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 191
Winograd Schema Challenge Datasets related to the original Winograd Schema Challenge (WSC) Collection by coref-data Jan 21, 2024 1 coref-data/davis_wsc_raw Viewer • Updated Jan 19, 2024 • 558 • 31 coref-data/davis_pdp_raw Viewer • Updated Jan 24, 2024 • 60 • 15 coref-data/mwsc_raw Viewer • Updated Jan 19, 2024 • 262 • 231 coref-data/superglue_wsc_raw Viewer • Updated Jan 19, 2024 • 1.61k • 12
Vision Collection by LT34848 Jan 21, 2025 - GPT-4V(ision) is a Generalist Web Agent, if Grounded Paper • 2401.01614 • Published Jan 3, 2024 • 22 Runtime error 11 Multimodal VDR Demo 🦙 11 Multimodal retrieval using llamaindex/vdr-2b-multi-v1
tts Collection by AUSSIE Jan 5, 2024 - myshell-ai/OpenVoice Text-to-Speech • Updated Dec 24, 2024 • 488
LLMs Collection by nehabora Jan 5, 2024 - Understanding LLMs: A Comprehensive Overview from Training to Inference Paper • 2401.02038 • Published Jan 4, 2024 • 65
Understanding LLMs: A Comprehensive Overview from Training to Inference Paper • 2401.02038 • Published Jan 4, 2024 • 65
3d models Collection by ljhwild Jan 5, 2024 - Learning the 3D Fauna of the Web Paper • 2401.02400 • Published Jan 4, 2024 • 11
Document Intelligence Collection by navakanth-reddy Jan 5, 2024 - DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 191
DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 191
Winograd Schema Challenge Datasets related to the original Winograd Schema Challenge (WSC) Collection by coref-data Jan 21, 2024 1 coref-data/davis_wsc_raw Viewer • Updated Jan 19, 2024 • 558 • 31 coref-data/davis_pdp_raw Viewer • Updated Jan 24, 2024 • 60 • 15 coref-data/mwsc_raw Viewer • Updated Jan 19, 2024 • 262 • 231 coref-data/superglue_wsc_raw Viewer • Updated Jan 19, 2024 • 1.61k • 12
media Collection by worklt Jan 5, 2024 - Running on T4 290 Demucs Music Source Separation (v4) ⚡ 290 Separate vocals and instrumentals from any music track
Running on T4 290 Demucs Music Source Separation (v4) ⚡ 290 Separate vocals and instrumentals from any music track
Vision Collection by LT34848 Jan 21, 2025 - GPT-4V(ision) is a Generalist Web Agent, if Grounded Paper • 2401.01614 • Published Jan 3, 2024 • 22 Runtime error 11 Multimodal VDR Demo 🦙 11 Multimodal retrieval using llamaindex/vdr-2b-multi-v1