Running on CPU Upgrade 551 Visualize Dataset (v2.0+ latest dataset format) ๐ป 551 Visualize LeRobot datasets with interactive charts and tools
view article Article Deploying Open Source Vision Language Models (VLM) on Jetson nvidia โข Feb 24 โข 37
view article Article NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI nvidia โข Jan 5 โข 64
view article Article Introducing NVIDIA Cosmos Policy for Advanced Robot Control nvidia โข Jan 29 โข 48
Running on Zero Agents 183 HunyuanWorld-Mirror ๐ 183 Universal 3D World Reconstruction with Any Prior Prompting
view article Article SmolVLM - small yet mighty Vision Language Model +3 andito, merve, mfarre, eliebak, pcuenq โข Nov 26, 2024 โข 419
view post Post 3930 The new Qwen-2 VL models seem to perform quite well in object detection. You can prompt them to respond with bounding boxes in a reference frame of 1k x 1k pixels and scale those boxes to the original image size.You can try it out with my space maxiw/Qwen2-VL-Detection 6 replies ยท ๐ 14 14 ๐ 5 5 ๐ค 1 1 + Reply
view article Article Welcome PaliGemma 2 โ New vision language models by Google +2 merve, andsteing, pcuenq, ariG23498 โข Dec 5, 2024 โข 166