BadCat

Foresta

3 11 9

Aegis1863

AI & ML interests

LLMs Deep learning Reinforcement learning

Recent Activity

upvoted a paper about 18 hours ago

From Proprietary to Open-Source: Bridging the Distribution Gap via Multi-Agent Protocol Distillation in Agentic Search

upvoted a paper 6 days ago

Beyond Relevance-Centric Retrieval: Rubric-Oriented Document Set Selection and Ranking

new activity 25 days ago

Chunjiang-Intelligence/DeepSeek-v4-Fable:🚩 Report: Spam

View all activity

Organizations

None yet

upvoted a paper about 18 hours ago

From Proprietary to Open-Source: Bridging the Distribution Gap via Multi-Agent Protocol Distillation in Agentic Search

Paper • 2607.24280 • Published 2 days ago • 64

upvoted a paper 6 days ago

Beyond Relevance-Centric Retrieval: Rubric-Oriented Document Set Selection and Ranking

Paper • 2607.19747 • Published 7 days ago • 31

New activity in Chunjiang-Intelligence/DeepSeek-v4-Fable 25 days ago

🚩 Report: Spam

#10 opened 25 days ago by

Foresta

upvoted a paper about 2 months ago

On the Geometry of On-Policy Distillation

Paper • 2606.07082 • Published Jun 5 • 75

upvoted a paper 3 months ago

A^2TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping

Paper • 2605.06200 • Published May 7 • 15

liked a Space 4 months ago

TorchCode

🔥

Run and edit interactive notebooks in your browser

upvoted a paper 5 months ago

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published 28 days ago • 34

upvoted an article 5 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

NormalUhr

•

Aug 9, 2025

• 133

upvoted 2 papers 7 months ago

AT^2PO: Agentic Turn-based Policy Optimization via Tree Search

Paper • 2601.04767 • Published Jan 8 • 28

Evaluating Parameter Efficient Methods for RLVR

Paper • 2512.23165 • Published Dec 29, 2025 • 28

upvoted a paper 10 months ago

Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?

Paper • 2510.06036 • Published Oct 7, 2025 • 7

upvoted a paper 12 months ago

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published Aug 12, 2025 • 34

liked 2 datasets over 1 year ago

PKU-Alignment/PKU-SafeRLHF

Viewer • Updated Oct 18, 2024 • 164k • 19.1k • 192

hendrydong/gpqa_diamond

Viewer • Updated Jan 3, 2025 • 198 • 1.91k • 10

liked a model over 1 year ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1, 2025 • 11.9k • 3.64k

New activity in SicariusSicariiStuff/LLAMA-3_8B_Unaligned_BETA over 1 year ago

How to deploy model?

#3 opened over 1 year ago by

Foresta

liked 4 models over 1 year ago

SicariusSicariiStuff/LLAMA-3_8B_Unaligned_BETA

8B • Updated Nov 5, 2025 • 62 • 52

sentence-transformers/all-MiniLM-L6-v2

cooperleong00/Meta-Llama-3-8B-Instruct-Jailbroken

8B • Updated Dec 23, 2024 • 92 • 6

unitary/toxic-bert

Text Classification • 0.1B • Updated Mar 13, 2024 • 192k • • 225

BadCat

AI & ML interests

Recent Activity

Organizations

Foresta's activity

🚩 Report: Spam

TorchCode

From GRPO to DAPO and GSPO: What, Why, and How

How to deploy model?