Commit History

Revert "Replace use of transformers library with vLLM for high-speed inference"
74aae71

timadair commited on

Revert "Diagnosing why device not found"
eabb6a7

timadair commited on

Diagnosing why device not found
d642f29

timadair commited on

Replace use of transformers library with vLLM for high-speed inference
56c4132

timadair commited on

Change duration, reasoning level, max_new_tokens to ensure a smooth demo
7cea9da

timadair commited on

Split out system prompt to use gpt-oss-20b's low reasoning mode, max_new_tokens->1500, comment cleanup
593fd81

timadair commited on

Update readme with awareness of ZeroGPU limitations for visitors
5e8e9f3

timadair commited on

One last time. Duration of 45s, which should allow almost all requests to succeed. Visitors can't generate if the duration exceeds their remaining daily allotment.
aca3670

timadair commited on

Change duration to 120 (likely not a visitor setting)
9512057

timadair commited on

Set higher ZeroGPU limit for visitors.
762465c

timadair commited on

Create public link on launch
dd03218

timadair commited on

Address long method code smell in UI code
b57dcf0

timadair commited on

Populate the README, fix 'inference not run' formatting error
c4da447

timadair commited on

Some final cleanup of error messages when generation fails
0f911b4

timadair commited on

Add feedback in the results section
df5f56b

timadair commited on

Try to have the quiz_json box visible while the quiz is being generated.
246c399

timadair commited on

That works functionally but the layout is bad. Have questions be inline with answer radio button groups.
463477c

timadair commited on

Another UI rebuild. Have a fixed number of questions slots that are populated instead of trying to dynamically create UI element.
1142597

timadair commited on

Rework how quick UI elements are populated, add A-D
b5744e8

timadair commited on

Add some debugging prints for building the quiz
ba36476

timadair commited on

Additional cleanup and commenting of quiz generation code
ac34d65

timadair commited on

Add empty topic check to avoid useless inference
b3100e3

timadair commited on

Isolate inference from response extraction.
dcb2ece

timadair commited on

Extract out inference code to a separate file
4c20656

timadair commited on

Change to grabbing only text after the final instance of the start of the JSON.
da4ab71

timadair commited on

Ignore 'Final Answer:' appearing during reasoning
74ec963

timadair commited on

Avoid returning CoT by finding the final answer.
9588cda

timadair commited on

Refine prompt some.
bcf1221

timadair commited on

Be more explicit with instructions to remove code-generating behavior
c131aa6

timadair commited on

Use roleplaying to try to influence the LLM away from writing code that generates a quiz
eb22e5c

timadair commited on

Fix fstring issue with json literal
a36dbd8

timadair commited on

Use an LLM (gpt-oss-20B) to generate quiz questions
a21a9b6

timadair commited on

Initial Gradio app w/ stock json
6bba6fb

timadair commited on

initial commit
628a399
verified

timadair commited on