Spaces:

jeanbaptdzd
/

open-finance-llm-8b

Paused

App Files Files Community

open-finance-llm-8b / docs /openai_api_verification.md

jeanbaptdzd

Reorganize tests and clean up documentation

6d3bf74 27 days ago

preview code

raw

history blame

6.96 kB

OpenAI API Compatibility Verification

Overview

This document verifies that our OpenAI API wrapper implementation correctly follows the OpenAI API specification and properly connects to the Qwen fine-tuned model.

Connection Flow

OpenAI-compatible Client
    ↓ (OpenAI API requests)
Hugging Face Space API (simple-llm-pro-finance)
    ↓ (FastAPI router)
TransformersProvider
    ↓ (Hugging Face Transformers)
Qwen-Open-Finance-R-8B Model

OpenAI API Specification Compliance

1. Chat Completions Endpoint: `/v1/chat/completions`

✅ Request Parameters (All Supported)

Parameter	Type	Status	Notes
`model`	string	✅	Required, defaults to configured model
`messages`	array	✅	Required, validated
`temperature`	number	✅	Optional, default 0.7, validated (0-2)
`max_tokens`	integer	✅	Optional, validated (≥1)
`stream`	boolean	✅	Optional, default false
`top_p`	number	✅	Optional, default 1.0
`tools`	array	✅	Optional, tool definitions
`tool_choice`	string/object	✅	Optional, supports "none", "auto", "required"
`response_format`	object	✅	Optional, supports {"type": "json_object"}

✅ Response Format

Field	Type	Status	Notes
`id`	string	✅	Generated chat completion ID
`object`	string	✅	"chat.completion"
`created`	integer	✅	Unix timestamp
`model`	string	✅	Model name
`choices`	array	✅	Array of Choice objects
`usage`	object	✅	Token usage statistics

✅ Choice Object

Field	Type	Status	Notes
`index`	integer	✅	Choice index
`message`	object	✅	Message object
`finish_reason`	string	✅	"stop", "length", "tool_calls"

✅ Message Object

Field	Type	Status	Notes
`role`	string	✅	"assistant"
`content`	string/null	✅	Message content
`tool_calls`	array/null	✅	Array of ToolCall objects

✅ ToolCall Object

Field	Type	Status	Notes
`id`	string	✅	Tool call ID
`type`	string	✅	"function"
`function`	object	✅	FunctionCall object

✅ FunctionCall Object

Field	Type	Status	Notes
`name`	string	✅	Function name
`arguments`	string	✅	JSON string of arguments

2. Tool Choice Handling

✅ Supported Values

"none": Model will not call any tools
"auto": Model can choose to call tools (default)
"required": Model must call a tool (converted to "auto" for text-based models)
{"type": "function", "function": {"name": "..."}}: Force specific tool

Implementation Note: Since Qwen is a text-based model (not native function calling), we convert "required" to "auto" and handle tool calls via text parsing.

3. Response Format Handling

✅ JSON Object Mode

When response_format={"type": "json_object"} is provided:

✅ System prompt is enhanced with JSON output instructions
✅ Response is parsed to extract JSON from markdown code blocks
✅ Clean JSON is returned for validation

Implementation: Since Qwen doesn't have native JSON mode, we enforce it via prompt engineering and post-processing.

Client Integration

✅ Supported Parameters

The API accepts standard OpenAI API parameters:

{
    "model": "dragon-llm-open-finance",
    "messages": [...],
    "temperature": 0.7,
    "max_tokens": 3000,
    "response_format": {"type": "json_object"},  # ✅ Supported
    "tool_choice": "required",  # ✅ Accepted (converted to "auto")
    "tools": [...]  # ✅ Tool definitions supported
}

✅ Implementation Details

✅ tool_choice="required" → Accepted and converted to "auto"
✅ response_format={"type": "json_object"} → JSON instructions added to prompt
✅ tools array → Formatted and added to system prompt
✅ Tool calls in response → Parsed from text and returned in OpenAI format

Qwen Model Integration

✅ Model Connection

Model Loading: ✅ Uses Hugging Face Transformers
- Model: DragonLLM/Qwen-Open-Finance-R-8B
- Tokenizer: Auto-loaded with model
- Device: Auto (CUDA if available)
Prompt Formatting: ✅ Uses Qwen chat template
- System prompts properly formatted
- Tools added to system prompt
- JSON instructions added when needed
Response Processing: ✅
- Text generation via Transformers
- Tool call parsing from text
- JSON extraction from markdown

✅ Qwen-Specific Considerations

Text-Based Tool Calls: Qwen doesn't have native function calling, so we:
- Format tools in system prompt
- Parse <tool_call>...</tool_call> blocks from response
- Convert to OpenAI-compatible format
JSON Output: Qwen doesn't have native JSON mode, so we:
- Add JSON instructions to system prompt
- Extract JSON from markdown code blocks
- Validate and return clean JSON

Verification Checklist

API Compatibility

All required OpenAI API parameters supported
Response format matches OpenAI specification
Error handling follows OpenAI error format
Streaming support implemented
Tool calls properly formatted

Client Compatibility

tool_choice="required" accepted
response_format supported
Structured output requests handled correctly
Tool definitions passed through
Structured outputs extracted

Qwen Model Integration

Model loads correctly from Hugging Face
Chat template applied correctly
Tools formatted for Qwen prompt style
Tool calls parsed from Qwen text format
JSON extracted from Qwen responses

Testing Recommendations

Basic Chat: Verify simple chat completions work
Tool Calls: Test with tools defined, verify parsing
Structured Outputs: Test with response_format, verify JSON extraction
Error Handling: Test invalid requests return proper errors
Streaming: Test streaming responses work correctly

Known Limitations

Native Function Calling: Qwen doesn't support native function calling, so we use text-based parsing
JSON Mode: Qwen doesn't have native JSON mode, so we enforce via prompts
Tool Choice "required": Converted to "auto" since we can't force tool calls in text-based models

Conclusion

✅ Our OpenAI API wrapper is correctly implemented and properly connected to the Qwen fine-tuned model.

The implementation:

Follows OpenAI API specification
Handles OpenAI-compatible parameters correctly
Properly integrates with Qwen model via Transformers
Provides fallbacks for features not natively supported by Qwen

OpenAI API Compatibility Verification

Overview

Connection Flow

OpenAI API Specification Compliance

1. Chat Completions Endpoint: /v1/chat/completions

✅ Request Parameters (All Supported)

✅ Response Format

✅ Choice Object

✅ Message Object

✅ ToolCall Object

✅ FunctionCall Object

2. Tool Choice Handling

✅ Supported Values

3. Response Format Handling

✅ JSON Object Mode

Client Integration

✅ Supported Parameters

✅ Implementation Details

Qwen Model Integration

✅ Model Connection

✅ Qwen-Specific Considerations

Verification Checklist

API Compatibility

Client Compatibility

Qwen Model Integration

Testing Recommendations

Known Limitations

Conclusion

1. Chat Completions Endpoint: `/v1/chat/completions`