ATLAS: Adaptive Transfer Scaling Laws for Multilingual Pretraining, Finetuning, and Decoding the Curse of Multilinguality Paper • 2510.22037 • Published Oct 24, 2025 • 22
On the Limitations of Vision-Language Models in Understanding Image Transforms Paper • 2503.09837 • Published Mar 12, 2025 • 10
On the Limitations of Vision-Language Models in Understanding Image Transforms Paper • 2503.09837 • Published Mar 12, 2025 • 10
On the Limitations of Vision-Language Models in Understanding Image Transforms Paper • 2503.09837 • Published Mar 12, 2025 • 10 • 2
Bridging the Data Provenance Gap Across Text, Speech and Video Paper • 2412.17847 • Published Dec 19, 2024 • 13
DataProvenanceInitiative/stack-exchange-instruction-2split Viewer • Updated Dec 8, 2024 • 10.8M • 1.93k
Consent in Crisis: The Rapid Decline of the AI Data Commons Paper • 2407.14933 • Published Jul 20, 2024 • 15
DataProvenanceInitiative/Commercial_or_unspecified_licenses_and_terms Viewer • Updated Sep 9, 2024 • 61M • 190
DataProvenanceInitiative/commercial_or_unspecified_licenses Viewer • Updated Sep 9, 2024 • 74.6M • 141
DataProvenanceInitiative/commercial_licenses_and_terms Viewer • Updated Sep 9, 2024 • 25.2M • 359 • 1