Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
ehwkang
/
researchbot
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
researchbot
/
src
/
data
1.06 MB
Ctrl+K
Ctrl+K
2 contributors
History:
13 commits
ehwkang
Create Globally Convergent Offline Reinforcement Learning with Smoothed Bellman Residual Minimization
0c31b92
verified
3 months ago
Bayesian optimization in language space: An eval-efficient AI self-improvement framework.txt
Safe
195 kB
Rename src/data/BO_language.txt to src/data/Bayesian optimization in language space: An eval-efficient AI self-improvement framework.txt
5 months ago
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle.txt
Safe
63.9 kB
Rename src/data/Bounded_SC.txt to src/data/Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle.txt
5 months ago
Empirical risk minimization for Inverse RL and Dynamic Discrete Choice models.txt
Safe
244 kB
Rename src/data/ERMIRL.txt to src/data/Empirical risk minimization for Inverse RL and Dynamic Discrete Choice models.txt
5 months ago
Globally Convergent Offline Reinforcement Learning with Smoothed Bellman Residual Minimization
Safe
62.2 kB
Create Globally Convergent Offline Reinforcement Learning with Smoothed Bellman Residual Minimization
3 months ago
Is O(log N) practical_Near-Equivalence Between Delay Robustness and Bounded Regret in Bandits and RL.txt
Safe
90.8 kB
Upload Is O(log N) practical_Near-Equivalence Between Delay Robustness and Bounded Regret in Bandits and RL.txt
5 months ago
LLM Personas as a Substitute for Field Experiments in Method Benchmarking
Safe
81.2 kB
Create LLM Personas as a Substitute for Field Experiments in Method Benchmarking
5 months ago
Learning NP-Hard Multi-Agent Assignment Planning using GNN_Inference on a Random Graph and Provable Auction-Fitted Q-learning.txt
Safe
110 kB
Rename src/data/MRRC.txt to src/data/Learning NP-Hard Multi-Agent Assignment Planning using GNN_Inference on a Random Graph and Provable Auction-Fitted Q-learning.txt
5 months ago
Reasonably reasoning agents can avoid game-theoretic failures in zero-shot, provably.txt
Safe
103 kB
Create Reasonably reasoning agents can avoid game-theoretic failures in zero-shot, provably.txt
3 months ago
Stability and Generalization for Bellman Residuals.txt
Safe
104 kB
Upload Stability and Generalization for Bellman Residuals.txt
5 months ago