Spaces:

ashish-sarvam
/

data-gen

Runtime error

App Files Files Community

data-gen / todo.md

ashish-sarvam

Upload folder using huggingface_hub

fc1a684 verified 7 months ago

preview code

raw

history blame contribute delete

4.2 kB

A newer version of the Gradio SDK is available: 6.13.0

Upgrade

GOAL: Generate 10K diverse conversations | 1 M SFT Pairs in sets of 100

[X] add state machine remove hindin first messaage refinement [] memory [] triage agent [] add more tools to choose from - in global user memory flag

%% tools are not even getting attached / %% unneccassry states / %% transitioning with pre text / %% tool calling with question / %% AGENT in tool message label / %% Agent role and prompt not coming in gradio %% User role not coming in gradio / %% let me check but still no tool call

state does not exist --> added prompt and available states
tool does not exist --> added prompt
wrong params in tool --> added schemas
tool results were different from actual generated variables for bot and user --> sending bot and user prompt in tool simulation

[] tool call is not waiting for user repy it is just doing tool call even when it is asking question and user has not replied [] remove raw outs [] add public sector and other domains [] add environment - chat vs phone [] org --> companies graph -- right now it is just private sector --> trust dedup [] compute diversity

4th Oct [X] clean up todos [] make sure i am collating all user releated tools and passing to the bot - give all tools to bot for a user type and let it figure out. [] output and input schema for all steps [] Prompt generation below

[] after pre-text do not let user message [] tool call repeatitions agent messages [] generate convos in readable format quickly [] fix memory generation, usage and checker - no PII [] use case dedup 80% - use cases can be even better [] 1 word replies [] state machine - multi-agent [] bot prompt and user prompt path [] make sure generations are only in english [] tools in xml [] checking hf and nemo datasets for user personas [] improve on the checker - rubrick - tool use performance, overall difficulty, quality consistency over extended dialogues. did not go off-policy and commited something which was not true - specifically detects such deceptive claims. Safety [] define diversity and how are we getting it [] generate a data diversity summary [] states to triage - triage customer support - 100s of agent which agent should i select - umang [] adding skills like deep recall, language transition, memory [] context summarisation [] add more public sector use cases --> maybe start with all industries/companies --> make them granular by divisions --> generate uniques somehow?? [] skills in checker - with message number {[skill: "", message_number:]} [] add thinking layer/ reasoning tokens in some traces - minimal thinking - what are such complex cases where we can use thinking - https://huggingface.co/datasets/interstellarninja/tool-use-multiturn-reasoning/viewer/default/train?views%5B%5D=train&row=10 [] parallel and serial multi tools before feedback

5th Oct [] quality of code - read post training data practices [] user diversity - some graph approaches? [] understanding seeds everywhere [] when generating user proxies some goals can be generic like wrong number these goals can have lesser weight [] add kb with numeric data - quality of retrievals [] add reasoning traces for customer support chat [] --> train qwen 30 [] --> special thinking tokens for deep research ##thinking## approach of american model is better than chinese ? | thinking tokens are most important. [] voice based payment [] - economic times data? [] --> create benchmark of improvement [] - keep appening overall domains - like salon, healthcare, and stuff... [] Multiple conversations per user* with memory enabled. [] adding user context [] --> create mcp environment --> 500/600 mcp tools to expand [] dedup layer on generated conversations

BOT PROMPT GENERATION STYLES:

add concept of global and state prompts
add prompts with instructions

User Persona

personality description from user card
goal from user card
knobs from sampled data
metadata - company, agent and user roles, use case

AUDIO ISSUES:

generally audio issues are clustered - that is if they happen then they continue for 2-3 turns until user moves in a different environment

i can add thinking tag for rag also