Create Globally Convergent Offline Reinforcement Learning with Smoothed Bellman Residual Minimization 0c31b92 verified ehwkang commited on Feb 16
Create Reasonably reasoning agents can avoid game-theoretic failures in zero-shot, provably.txt c089bad verified ehwkang commited on Feb 16
Create LLM Personas as a Substitute for Field Experiments in Method Benchmarking b564b48 verified ehwkang commited on Dec 25, 2025
Rename src/data/MRRC.txt to src/data/Learning NP-Hard Multi-Agent Assignment Planning using GNN_Inference on a Random Graph and Provable Auction-Fitted Q-learning.txt 3d5390e verified ehwkang commited on Dec 14, 2025
Rename src/data/Bounded_SC.txt to src/data/Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle.txt ed9c763 verified ehwkang commited on Dec 14, 2025
Upload Is O(log N) practical_Near-Equivalence Between Delay Robustness and Bounded Regret in Bandits and RL.txt 8023657 verified ehwkang commited on Dec 14, 2025
Rename src/data/ERMIRL.txt to src/data/Empirical risk minimization for Inverse RL and Dynamic Discrete Choice models.txt b2dd68e verified ehwkang commited on Dec 14, 2025
Upload Stability and Generalization for Bellman Residuals.txt 2d7d599 verified ehwkang commited on Dec 14, 2025
Rename src/data/BO_language.txt to src/data/Bayesian optimization in language space: An eval-efficient AI self-improvement framework.txt 345fa94 verified ehwkang commited on Dec 14, 2025
Rename src/Bounded_SC.txt to src/data/Bounded_SC.txt 664ef20 verified ehwkang commited on Dec 14, 2025
Rename src/BO_language.txt to src/data/BO_language.txt a879e27 verified ehwkang commited on Dec 14, 2025