researchbot / src /data

Commit History

Create Globally Convergent Offline Reinforcement Learning with Smoothed Bellman Residual Minimization
0c31b92
verified

ehwkang commited on

Create Reasonably reasoning agents can avoid game-theoretic failures in zero-shot, provably.txt
c089bad
verified

ehwkang commited on

Create LLM Personas as a Substitute for Field Experiments in Method Benchmarking
b564b48
verified

ehwkang commited on

Rename src/data/MRRC.txt to src/data/Learning NP-Hard Multi-Agent Assignment Planning using GNN_Inference on a Random Graph and Provable Auction-Fitted Q-learning.txt
3d5390e
verified

ehwkang commited on

Rename src/data/Bounded_SC.txt to src/data/Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle.txt
ed9c763
verified

ehwkang commited on

Upload Is O(log N) practical_Near-Equivalence Between Delay Robustness and Bounded Regret in Bandits and RL.txt
8023657
verified

ehwkang commited on

Rename src/data/ERMIRL.txt to src/data/Empirical risk minimization for Inverse RL and Dynamic Discrete Choice models.txt
b2dd68e
verified

ehwkang commited on

Upload Stability and Generalization for Bellman Residuals.txt
2d7d599
verified

ehwkang commited on

Rename src/data/BO_language.txt to src/data/Bayesian optimization in language space: An eval-efficient AI self-improvement framework.txt
345fa94
verified

ehwkang commited on

Rename src/MRRC.txt to src/data/MRRC.txt
edcda58
verified

ehwkang commited on

Rename src/ERMIRL.txt to src/data/ERMIRL.txt
64805e9
verified

ehwkang commited on

Rename src/Bounded_SC.txt to src/data/Bounded_SC.txt
664ef20
verified

ehwkang commited on

Rename src/BO_language.txt to src/data/BO_language.txt
a879e27
verified

ehwkang commited on