AI & ML interests

LLM, Finance, Accounting, Spreadsheet

Recent Activity

AdinaY 
posted an update about 5 hours ago
AdinaY 
posted an update about 6 hours ago
view post
Post
67
Kimi K2.5 from Moonshot AI is more than just another large model🤯

https://huggingface.co/collections/moonshotai/kimi-k25

✨ Native multimodality : image + video + language + agents 💥
✨1T MoE / 32B active
✨ 256K context
✨ Modified MIT license
✨ Agent Swarm execution
✨ Open weights + open infra mindset
AdinaY 
posted an update 6 days ago
view post
Post
437
AgentCPM-report 🔥 local DeepResearch agent released from OpenBMB

openbmb/AgentCPM-Report

✨ 8B - Apache 2.0
✨ Gemini-2.5-Pro level DeepResearch report generation
✨ Fully offline, privacy-first local deployment
✨ + GGUF version
  • 1 reply
·
AdinaY 
posted an update 7 days ago
AdinaY 
posted an update 8 days ago
view post
Post
2341
Z.ai just released a powerful lightweight option of GLM 4.7

✨ 30B total/3B active - MoE

zai-org/GLM-4.7-Flash
  • 1 reply
·
AdinaY 
posted an update 8 days ago
view post
Post
228
Another Chinese model fully trained on domestic chips, released by China Telecom 👀

Tele-AI/TeleChat3-36B-Thinking

TeleChat3-36B-Thinking:
✨ Native support for the Ascend + MindSpore ecosystem
✨ Inspired by DeepSeek’s architecture design, bringing training stability and efficiency gains.
  • 2 replies
·
AdinaY 
posted an update 11 days ago
view post
Post
1050
After a VLM, StepFun dropped a new audio model: Step-Audio-R1.1, enabling thinking while speaking 🔥

stepfun-ai/Step-Audio-R1.1

✨ Apache 2.0
✨ Combines dual-brain architecture and acoustic-grounded reasoning to enable real-time dialogue with SOTA-level reasoning
  • 2 replies
·
AdinaY 
posted an update 12 days ago
view post
Post
1734
We have a new heatmap live on huggingface now🔥

woojun-jung/open-source-release-heatmap-ko

Korean community built their own version to track labs that actively publish open work, inspired by Chinese open source heat map!

This is the open source community at its best ♥️
  • 1 reply
·
AdinaY 
posted an update 13 days ago
view post
Post
705
More lightweight multimodal models are coming 👀

StepFun has been focused on multimodal AI from the very beginning. Their latest release a new foundational model: STEP3-VL🔥
https://huggingface.co/collections/stepfun-ai/step3-vl-10b
✨ 10B - Apache2.0
✨ Leads in the 10B class and competes with models 10–20× larger
AdinaY 
posted an update 13 days ago
view post
Post
345
Agentic capability is the new battleground🔥

LongCat-Flash-Thinking-2601, the latest reasoning model from Meituan- LongCat

✨ MoE - 560B total / 27B active
✨ MIT license
✨ Agentic tool use
✨ Multi-environment RL
✨ Parallel + iterative reasoning

meituan-longcat/LongCat-Flash-Thinking-2601
AdinaY 
posted an update 13 days ago
view post
Post
344
GLM-Image from Z.ai is out 🔥

It was fully trained on Ascend Atlas 800T A2 with MindSpore, probably the first SOTA multimodal model fully trained on domestic chips 👀

zai-org/GLM-Image

✨ Hybrid Architecture: combined autoregressive + diffusion design delivers strong semantic alignment with high-fidelity details
✨ Strong performance in long, dense, and multilingual text rendering
✨ MIT licensed (VQ tokenizer & ViT weights under Apache 2.0)
✨ Now live on Hugging Face inference provider 🤗
AdinaY 
posted an update 14 days ago
view post
Post
2652
From ChatGPT Healthcare to Claude for healthcare, AI in medicine is speeding up🚀

Now BaichuanAI joins with Baichuan-M3 🏥 an open medical LLM trained for clinical decision-making

https://huggingface.co/collections/baichuan-inc/baichuan-m3

✨ 235B - Apache2.0
✨ Lower hallucinations via Fact-Aware RL
✨ Built for long medical chats
  • 2 replies
·
AdinaY 
posted an update 15 days ago
view post
Post
2839
AgentCPM-Explore🔥 on device agent foundation model released by OpenBMB
openbmb/AgentCPM-Explore
✨ 4B - Apache2.0
✨ Supports 100+ multi-turn environment interactions with search + verification
✨ Full training/inference stack is openly shared as well
AdinaY 
posted an update 15 days ago
view post
Post
2595
Based on 2025 Chinese AI Timeline, here are some interesting takeaways:

✨ DeepSeek cadence: They shipped almost every month! (except Feb 2025)

✨ Qwen trajectory: Not a single “hit” model, but an expanding product line. VL/Math/Coder/Reranker/Embedding/Omni/Next/Image

✨ Multimodal trend: Steadily rising share, shifting from generation to editing + tooling.

✨ Reasoning as a main track: more engineered, system-level reasoning.

✨ From foundation to components: growth in infra models (embeddings, rerankers, OCR, speech) signals a move toward deployable stacks.

✨ Ecosystem broadening: more players beyond the top labs.

Follow for more updates👉
zh-ai-community

  • 2 replies
·
AdinaY 
posted an update 15 days ago
view post
Post
300
Spirit AI (千寻智能) shared its VLA foundation model Spirit v1.5 on huggingface 🔥

It’s now No. 1 on RoboChallenge’s Table30 leaderboard, beating Pi0.5 🏆
Spirit-AI-robotics/Spirit-v1.5
AdinaY 
posted an update 19 days ago
view post
Post
1524
Wechat AI is shipping!

WeDLM 🔥 A new language model that generates tokens in parallel, making it faster than standard LLMs , with the same Transformer setup!
https://huggingface.co/collections/tencent/wedlm

✨ 7B/8B - Base & Instruct
✨ Apache 2.0
·
AdinaY 
posted an update 19 days ago
AdinaY 
posted an update 20 days ago
AdinaY 
posted an update 20 days ago
view post
Post
303
Daily Papers just got an AI reading assistant 🔥

You can ask any question you want: clarify a paragraph, get a short summary...all without leaving the page!

✨ Powered by HuggingChat + Hugging Face MCP server