daniicruzz commited on
Commit
957fa93
·
verified ·
1 Parent(s): 2f0a41b

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -245,6 +245,8 @@ Reported numbers use the methodology described above.
245
 
246
  ![Intelligence Thinking](assets/littlelamb-tc-intelligence-family.png)
247
 
 
 
248
  ### Quantitative Results (Inference Performance)
249
 
250
  #### Metrics reported
 
245
 
246
  ![Intelligence Thinking](assets/littlelamb-tc-intelligence-family.png)
247
 
248
+ BFCL V4 is the de facto industry standard for evaluating function-calling (tool-use) capability. It tests whether models can correctly generate structured function calls in response to user queries, across simple single-call scenarios, parallel calls, multi-turn conversations, and complex agentic workflows.
249
+
250
  ### Quantitative Results (Inference Performance)
251
 
252
  #### Metrics reported