alpayariyak commited on
Commit
f99ecdb
·
verified ·
1 Parent(s): 9a5709c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -9
README.md CHANGED
@@ -10,16 +10,23 @@ base_model:
10
  pipeline_tag: text-generation
11
  ---
12
 
13
-
14
- <div align="center">
15
- <span style="font-family: default; font-size: 2em; display:block; margin-bottom:0.1em;">DeepSWE-Preview</span>
16
- <a href="https://github.com/agentica-project/rllm"
17
- style="display:inline-flex;align-items:center;gap:8px;height:36px;padding:0 18px;background:linear-gradient(155deg,#2c3e50 0%,#34516a 100%);color:#fff;font:600 14px/1 -apple-system,BlinkMacSystemFont,'Segoe UI',sans-serif;line-height:1;border-radius:9999px;text-decoration:none;box-shadow:0 0 10px rgba(90,180,255,.55),0 3px 9px rgba(0,0,0,.18);white-space:nowrap;vertical-align:middle;">
18
- <img src="https://hebbkx1anhila5yf.public.blob.vercel-storage.com/IMG_3783-N75vmFhDaJtJkLR4d8pdBymos68DPo.png" alt="rLLM logo" style="width:18px;height:18px">
19
- Built&nbsp;with&nbsp;rLLM
20
- </a>
21
- <span style="font-family: default; font-size: 1em; display:block; margin-top:0.6em; margin-bottom:0.6em;">🚀 Democratizing Reinforcement Learning for LLM Agents (RLLM) 🌟</span>
 
 
 
 
 
 
22
  </div>
 
23
  <div align="center" style="line-height: 1;">
24
  <a href="https://github.com/agentica-project/rllm" style="margin: 2px;">
25
  <img alt="Code" src="https://img.shields.io/badge/rLLM-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white" style="display: inline-block; vertical-align: middle;"/>
@@ -40,6 +47,7 @@ pipeline_tag: text-generation
40
  </div>
41
  </div>
42
 
 
43
  ## DeepSWE Overview
44
  DeepSWE-Preview is a fully open-sourced, state-of-the-art coding agent trained with only reinforcement learning (RL) to excel at software engineering (SWE) tasks. DeepSWE-Preview demonstrates strong reasoning capabilities in navigating complex codebases and viewing/editing multiple files, and it serves as a foundational model for future coding agents. The model achieves an impressive **59.0%** on SWE-Bench-Verified, which is currently #1 in the open-weights category.
45
 
 
10
  pipeline_tag: text-generation
11
  ---
12
 
13
+ <!-- Header Section -->
14
+ <div align="center" style="background: linear-gradient(135deg, #1c1c1e 0%, #2c2c2e 100%); backdrop-filter: blur(1.25em); border-radius: 1.25em; padding: 1.5em 1.5em; box-shadow: 0 0.5em 2em rgba(0,0,0,0.3); margin-bottom: 1em; border: 1px solid #3a3a3c;">
15
+
16
+ <!-- Logo and Title -->
17
+ <div style="display: flex; align-items: center; justify-content: center; gap: 1em; margin-bottom: 1em;">
18
+ <div style="width: 3.5em; height: 3.5em; border-radius: 1em; display: flex; align-items: center; justify-content: center;">
19
+ <img src="https://hebbkx1anhila5yf.public.blob.vercel-storage.com/IMG_3783-N75vmFhDaJtJkLR4d8pdBymos68DPo.png" alt="DeepSWE logo" style="width: 2.5em; height: 2.5em;">
20
+ </div>
21
+ <h1 style="font-size: 3em; font-weight: 150; color: #ffffff; margin: 0; font-family: -apple-system, BlinkMacSystemFont, 'SF Pro Display', sans-serif; letter-spacing: -0.02em;">DeepSWE-Preview</h1>
22
+ </div>
23
+
24
+ <!-- Subtitle -->
25
+ <p style="font-size: 1.2em; color: #a1a1aa; margin: 0 0 0 0; font-family: -apple-system, BlinkMacSystemFont, 'SF Pro Text', sans-serif; line-height: 1.5; font-weight: 400;">
26
+ Democratizing Reinforcement Learning for LLM Agents
27
+ </p>
28
  </div>
29
+
30
  <div align="center" style="line-height: 1;">
31
  <a href="https://github.com/agentica-project/rllm" style="margin: 2px;">
32
  <img alt="Code" src="https://img.shields.io/badge/rLLM-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white" style="display: inline-block; vertical-align: middle;"/>
 
47
  </div>
48
  </div>
49
 
50
+
51
  ## DeepSWE Overview
52
  DeepSWE-Preview is a fully open-sourced, state-of-the-art coding agent trained with only reinforcement learning (RL) to excel at software engineering (SWE) tasks. DeepSWE-Preview demonstrates strong reasoning capabilities in navigating complex codebases and viewing/editing multiple files, and it serves as a foundational model for future coding agents. The model achieves an impressive **59.0%** on SWE-Bench-Verified, which is currently #1 in the open-weights category.
53