readCtrl_lambda / code /RL_model /verl /Search-R1 /requirements.txt
mshahidul
Initial commit of readCtrl code without large models
030876e
accelerate
codetiming
datasets
dill
flash-attn
hydra-core
numpy
pandas
pybind11
ray
tensordict<0.6
transformers<4.48
vllm<=0.6.3
wandb
IPython
matplotlib