lapchann Labs

community
Activity Feed

AI & ML interests

None defined yet.

tsungyiย 
posted an update 4 months ago
view post
Post
305
๐Ÿค– Cosmos Policy just dropped for robotics.

Cutting edge research turning a world foundation model into a unified robot brain that can see, predict, and actโ€”no extra action heads, no complicated control stack.

Read our blog on @HuggingFace โžก๏ธ https://huggingface.co/blog/nvidia/cosmos-policy-for-robot-control

Want to get hands-on with Cosmos (Reason, Predict, Policy, Cookbook)?
Join the Cosmos Cookoff, sponsored by @Nebius and @Milestone โžก๏ธ https://luma.com/nvidia-cosmos-cookoff?utm_source=social
  • 2 replies
ยท
tsungyiย 
posted an update 4 months ago
view post
Post
2677
๐ŸŽ‰ Exciting News โ€” NVIDIA Cosmos is celebrating its 1st birthday and has hit 5 MILLION downloads! ๐ŸŽ‰

In just one year, the Cosmos ecosystem has grown rapidly:
๐Ÿง  Cosmos Reason and Cosmos Predict have surpassed 2 MILLION downloads each on @HuggingFace , topping physical AI leaderboards
๐Ÿ”„ Cosmos Transfer is enabling adaptation across domains and tasks
๐Ÿ”ฎ Cosmos Cookbook is the go-to hub for recipes from developers and partners like Uber and IntBot.

Thank you to our amazing developer community for making this possible. Here's to pushing the boundaries of world foundation models together!

๐Ÿง‘๐Ÿปโ€๐ŸณRead the Cosmos Cookbook: https://nvda.ws/4qevli8
๐Ÿ“š Explore Models & Datasets: https://huggingface.co/collections/nvidia/nvidia-cosmos-2
tsungyiย 
posted an update 4 months ago
view post
Post
1892
Big news from CES โ€” Cosmos Reason 2 is here โ€” our most advanced reasoning vision-language model for physical AI, now topping the Physical AI Bench leaderboard๐Ÿ† shi-labs/physical-ai-bench-leaderboard

Whatโ€™s new:
- Enhanced physical reasoning & spatio-temporal understanding
- Flexible deployment with 2B & 8B model sizes
- Long-context understanding (up to 256K tokens)
- Object detection with 2D/3D point localizations and trajectory data
- New Cosmos Cookbook Recipes for faster onboarding

Read the full blog ๐Ÿ“– https://huggingface.co/blog/nvidia/nvidia-cosmos-reason-2-brings-advanced-reasoning
Download Cosmos Reason 2 ๐Ÿ‘‰ nvidia/Cosmos-Reason2-8B

On top of Cosmos Reason 2, we also rolled out other new updates, including:
- Cosmos Predict 2.5 โ€“ Unified Text2World/Image2World/Video2World model for higher-quality synthetic video worlds
- Cosmos Transfer 2.5-2B โ€“ Lightweight, high-fidelity world-to-world translation with stronger physics alignment
- NVIDIA GR00T N1.6 โ€“ Open robot foundation model for general-purpose robotic learning and control, integrated with Cosmos Reason

Get Started with the Cosmos Cookbook ๐Ÿง‘๐Ÿปโ€๐Ÿณ https://nvda.ws/4qevli8
tsungyiย 
posted an update 8 months ago
view post
Post
3743
Weโ€™re excited to share that Cosmos Reason has surpassed 1 million downloads on Hugging Face!

Cosmos Reason is an open, customizable, commercial-ready 7B-parameter reasoning vision language model (VLM) designed for physical AI. By combining physics understanding, prior knowledge, and common sense reasoning, Cosmos Reason empowers AI agents and robots to operate intelligently in real-world environments.

Key applications already unlocked include:

โœ… Automating large-scale dataset curation and annotation

๐Ÿค– Powering robot planning and vision-language action (VLA) decision-making

๐Ÿ“Š Driving advanced video analytics and actionable insight generation

Weโ€™re proud to see a global community of developers using Cosmos Reason to teach robots to think like humansโ€”and weโ€™re just getting started.

โšก Get started with Cosmos Reason 1 NIM, an easy-to-use microservice for AI model deployment: https://catalog.ngc.nvidia.com/orgs/nim/teams/nvidia/containers/cosmos-reason1-7b?version=1

๐Ÿ“ˆ See the leaderboard: facebook/physical_reasoning_leaderboard
tsungyiย 
posted an update 9 months ago
view post
Post
2049
Cosmos Reason just topped Physical Reasoning Leaderboard on Hugging Face. ๐Ÿ‘๐Ÿ”ฅ

Cosmos Reason is an open, customizable, commercial-ready 7B-parameter, reasoning vision language model (VLM) for physical AI and robotics. The VLM empowers robots and vision AI agents to reason like humans, leveraging prior knowledge, physics understanding, and common sense to understand and operate intelligently in the real world.

This model unlocks advanced capabilities for robotics, autonomous vehicles, and real-world operationsโ€”from cities to high-tech factories.

Key use cases include:
Data curation & annotation: Automate high-quality dataset curation and annotation at scale.
Robot planning & reasoning: Serve as the "brain" for deliberate, methodical decision-making with vision language action (VLA) models.
Video analytics AI agents: Extract actionable insights and perform root-cause analysis on massive video datasets.

Ready to build the next generation of physical AI? Get started ๐Ÿ‘‰ nvidia/Cosmos-Reason1-7B
Try the preview here: https://build.nvidia.com/nvidia/cosmos-reason1-7b