Improve citation in Light-IF-32B model card

#2
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +15 -96
README.md CHANGED
@@ -1,10 +1,11 @@
1
  ---
2
- license: apache-2.0
3
  base_model:
4
  - Qwen/Qwen3-32B
5
- pipeline_tag: text-generation
6
  library_name: transformers
 
 
7
  ---
 
8
  <!-- markdownlint-disable first-line-h1 -->
9
  <!-- markdownlint-disable html -->
10
  <!-- markdownlint-disable no-duplicate-header -->
@@ -34,7 +35,7 @@ library_name: transformers
34
  |Deepseek-v3-250324|0.306|0.859|0.833|0.405|
35
  |Doubao-1.5-pro-32k-250115|0.285|0.889|0.797|0.375|
36
  |Kimi-K2|0.227|0.921|0.820|0.395|
37
- |GLM-4.5|0.395|0.893|0.833|0.466|
38
  | [**Light-IF-32B (ours)** 🤗](https://huggingface.co/qihoo360/Light-IF-32B) |**0.575**|**0.938**|**0.85**|**0.575**|
39
 
40
 
@@ -98,8 +99,10 @@ try:
98
  except ValueError:
99
  index = 0
100
 
101
- thinking_content = tokenizer.decode(output_ids[:index], skip_special_tokens=True).strip("\n")
102
- content = tokenizer.decode(output_ids[index:], skip_special_tokens=True).strip("\n")
 
 
103
 
104
  print("thinking content:", thinking_content)
105
  print("content:", content)
@@ -130,7 +133,7 @@ waiting for the world to breathe.
130
 
131
  Wait, that's 4 lines (subtitle plus 3 lines). Wait, no: the user said each stanza begins with a highlighted subtitle. Does the subtitle count as a line? The original request says "total of 15 lines" and "4 stanzas, each beginning with a highlighted subtitle." So probably, the subtitles are part of the lines, so each stanza's first line is the subtitle, then the rest. Wait, but the example: if the first stanza is 4 lines, then subtitle (1) + 3 lines. Let's check: 4 stanzas, total lines: 15. Let's split: 4 stanzas: maybe stanza 1: 4 lines, stanza 2: 4, stanza 3: 4, stanza 4: 3? Wait, 4+4+4+3=15. That's a possible split.
132
 
133
- But need to make sure the flow is natural. Let's adjust the number of lines per stanza: for example, stanza 1: 4 lines (subtitle + 3), stanza 2: 4 (subtitle + 3), stanza 3: 4, stanza 4: 3.
134
 
135
  Let's try drafting stanza 1: *Beneath the Soil*
136
  a heartbeat of green,
@@ -233,88 +236,7 @@ Now, check the rhythm. Each line is roughly similar in length, maybe iambic tetr
233
 
234
  Another revision: maybe make some lines more concise or impactful. For example, stanza 1: "waiting for the world to breathe" is good.
235
 
236
- Stanza 2: "toward the faint" could be "toward faint light" to be more specific. So: tip tilts toward faint light, (5 words). Then stanza 2 word count: 3+5+5+5=18.
237
-
238
- Stanza 3: "when wind screams like rage" is 5 words. Maybe "when winds roar like rage" for more intensity. So: 5 words. Then stanza 3 word count: 4+5+5+5=19.
239
-
240
- Stanza 4: "each one asking to be seen" is 6 words. Maybe change to "each begging to be seen" for more impact. So: 5 words. Then stanza 4 word count: 3+4+6+1=14.
241
-
242
- Total word count: 17+18+19+14=68. Still fine.
243
-
244
- Now, make sure each stanza's subtitle is highlighted. The user said "highlighted subtitle," so in the response, we'll use italics.
245
-
246
- Another revision for flow: Maybe change the order of stanzas to make the growth more natural. Wait, the order is: beneath soil, sprout, bend, bloom. That's logical.
247
-
248
- Another check: 15 lines total. Let's count:
249
-
250
- Stanza 1: 1 (subtitle) +3=4 lines.
251
-
252
- Stanza 2: 5+3=4? Wait, no: each stanza is subtitle (1 line) plus number of lines. Wait, no: the user said "4 stanzas, each beginning with a highlighted subtitle." So the total number of lines includes the subtitles. So if each stanza has 4 lines, then 4 stanzas would be 16 lines. But the user wants 15. So one stanza has 3 lines.
253
-
254
- As per our split: 4,4,4,3. Let's confirm:
255
-
256
- Stanza 1: 4 lines (1 subtitle + 3 lines).
257
-
258
- Stanza 2: 4 (1+3).
259
-
260
- Stanza 3: 4 (1+3).
261
-
262
- Stanza 4: 3 lines (1 subtitle +2 lines).
263
-
264
- Yes, that's 4+4+4+3=15 lines.
265
-
266
- Wait, earlier I had stanza 4: 3 lines (1 subtitle +2). So let's adjust stanza 4 to have 3 lines. For example:
267
-
268
- *Bloom Into Sun*
269
- petals open like hearts,
270
- each begging to be seen.
271
-
272
- That's 3 lines (1 subtitle +2). Then the previous stanza (stanza 3) is 4 lines: subtitle +3.
273
-
274
- But wait, no: let's do this again. The total lines are 15. So if stanza 1 has 4 lines (1+3), stanza 2 has 4 (1+3), stanza 3 has 4 (1+3), stanza 4 has 3 (1+2). That's 4+4+4+3=15.
275
-
276
- Perfect. So stanza 4 is 3 lines: subtitle +2.
277
-
278
- So let's revise stanza 4: *Bloom Into Sun*
279
- petals open like hearts,
280
- each begging to be seen.
281
-
282
- That's 3 lines. Then total lines: 15.
283
-
284
- Now, check again:
285
-
286
- Stanza 1: 4 lines.
287
-
288
- 1. *Beneath the Soil*
289
- 2. a heartbeat of green,
290
- 3. wrapped tight in dark,
291
- 4. waiting for the world to breathe.
292
-
293
- Stanza 2: 4 lines.
294
-
295
- 5. *Sprout Through Cracks*
296
- 6. tip tilts toward faint light,
297
- 7. pushing through stone and time,
298
- 8. a fragile stem holding sky.
299
-
300
- Stanza 3: 4 lines.
301
-
302
- 9. *Bend, But Not Break*
303
- 10. when winds roar like rage,
304
- 11. I arch, then rise again,
305
- 12. stronger where I was torn.
306
-
307
- Stanza 4: 3 lines.
308
-
309
- 13. *Bloom Into Sun*
310
- 14. petals open like hearts,
311
- 15. each begging to be seen.
312
-
313
- Yes, exactly 15 lines.
314
-
315
- Now, check the flow: Each stanza builds on the growth theme.
316
-
317
- Another revision for better imagery: Maybe change "faint light" to "faint light through cracks" to tie back to the subtitle. But that adds words. Since we have to keep word count low, maybe keep it as is.
318
 
319
  Stanza 2: "tip tilts toward faint light,"—good, shows struggle.
320
 
@@ -383,13 +305,10 @@ For deployment, you can use `sglang>=0.4.6.post1` or `vllm>=0.8.5` or to create
383
 
384
  ## Citation
385
  ```
386
- @misc{lightifproj,
387
- title={Light-IF: Endowing LLMs with Generalizable Reasoning via Preview and Self-Checking for Complex Instruction Following.},
388
- author={Light-IF Team},
389
- year={2025},
390
- eprint={},
391
- archivePrefix={},
392
- primaryClass={},
393
- url={https://huggingface.co/qihoo360/Light-IF-32B},
394
  }
395
  ```
 
1
  ---
 
2
  base_model:
3
  - Qwen/Qwen3-32B
 
4
  library_name: transformers
5
+ license: apache-2.0
6
+ pipeline_tag: text-generation
7
  ---
8
+
9
  <!-- markdownlint-disable first-line-h1 -->
10
  <!-- markdownlint-disable html -->
11
  <!-- markdownlint-disable no-duplicate-header -->
 
35
  |Deepseek-v3-250324|0.306|0.859|0.833|0.405|
36
  |Doubao-1.5-pro-32k-250115|0.285|0.889|0.797|0.375|
37
  |Kimi-K2|0.227|0.921|0.820|0.395|
38
+ |GLM-4.5|0.395|0.893|0.833|0.466|\
39
  | [**Light-IF-32B (ours)** 🤗](https://huggingface.co/qihoo360/Light-IF-32B) |**0.575**|**0.938**|**0.85**|**0.575**|
40
 
41
 
 
99
  except ValueError:
100
  index = 0
101
 
102
+ thinking_content = tokenizer.decode(output_ids[:index], skip_special_tokens=True).strip("
103
+ ")
104
+ content = tokenizer.decode(output_ids[index:], skip_special_tokens=True).strip("
105
+ ")
106
 
107
  print("thinking content:", thinking_content)
108
  print("content:", content)
 
133
 
134
  Wait, that's 4 lines (subtitle plus 3 lines). Wait, no: the user said each stanza begins with a highlighted subtitle. Does the subtitle count as a line? The original request says "total of 15 lines" and "4 stanzas, each beginning with a highlighted subtitle." So probably, the subtitles are part of the lines, so each stanza's first line is the subtitle, then the rest. Wait, but the example: if the first stanza is 4 lines, then subtitle (1) + 3 lines. Let's check: 4 stanzas, total lines: 15. Let's split: 4 stanzas: maybe stanza 1: 4 lines, stanza 2: 4, stanza 3: 4, stanza 4: 3? Wait, 4+4+4+3=15. That's a possible split.
135
 
136
+ But need to make sure the flow is natural. Let's adjust the number of lines per stanza: for example, stanza 1: 4 lines, stanza 2: 4, stanza 3: 4, stanza 4: 3.
137
 
138
  Let's try drafting stanza 1: *Beneath the Soil*
139
  a heartbeat of green,
 
236
 
237
  Another revision: maybe make some lines more concise or impactful. For example, stanza 1: "waiting for the world to breathe" is good.
238
 
239
+ Stanza 2: "toward the faint" could be "toward faint light" to be more specific. But that adds words. Since we have to keep word count low, maybe keep it as is.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
240
 
241
  Stanza 2: "tip tilts toward faint light,"—good, shows struggle.
242
 
 
305
 
306
  ## Citation
307
  ```
308
+ @article{sun2025lightif,
309
+ title={Light-IF: Endowing LLMs with Generalizable Reasoning via Preview and Self-Checking for Complex Instruction Following},
310
+ author={Qiushi Sun and Zichen Ding and Zhiyong Wu and Yian Wang and Songli Gong and Kun Kuang and Yexin Cao and Fei Huang and Jian Li and Junjie Yan and Guohong Zeng and Xin Lin},
311
+ journal={arXiv preprint arXiv:2508.03178},
312
+ year={2025}
 
 
 
313
  }
314
  ```