Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -27,10 +27,13 @@ Paris Noah's Ark Lab consists of 3 research teams that cover the following topic
|
|
| 27 |
### Preprints
|
| 28 |
|
| 29 |
- [TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning](https://huggingface.co/papers/2502.15425): distributed multi-agent hierarchical reinforcement learning framework.
|
| 30 |
-
- [Zero-shot Model-based Reinforcement Learning using Large Language Models](https://huggingface.co/papers/2410.11711): disentangled in-context learning for multivariate time series forecasting and model-based RL.
|
| 31 |
- [Large Language Models as Markov Chains](https://huggingface.co/papers/2410.02724): theoretical insights on their generalization and convergence properties.
|
| 32 |
- [A Systematic Study Comparing Hyperparameter Optimization Engines on Tabular Data](https://balazskegl.medium.com/navigating-the-maze-of-hyperparameter-optimization-insights-from-a-systematic-study-6019675ea96c): insights to navigate the maze of hyperopt techniques.
|
| 33 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 34 |
### 2024
|
| 35 |
|
| 36 |
- *(NeurIPS'24)* [MANO: Unsupervised Accuracy Estimation Under Distribution Shifts](https://huggingface.co/papers/2405.18979): when logits are enough to estimate generalization of a pre-trained model.
|
|
|
|
| 27 |
### Preprints
|
| 28 |
|
| 29 |
- [TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning](https://huggingface.co/papers/2502.15425): distributed multi-agent hierarchical reinforcement learning framework.
|
|
|
|
| 30 |
- [Large Language Models as Markov Chains](https://huggingface.co/papers/2410.02724): theoretical insights on their generalization and convergence properties.
|
| 31 |
- [A Systematic Study Comparing Hyperparameter Optimization Engines on Tabular Data](https://balazskegl.medium.com/navigating-the-maze-of-hyperparameter-optimization-insights-from-a-systematic-study-6019675ea96c): insights to navigate the maze of hyperopt techniques.
|
| 32 |
|
| 33 |
+
### 2025
|
| 34 |
+
|
| 35 |
+
- *(ICLR'25)* - [Zero-shot Model-based Reinforcement Learning using Large Language Models](https://huggingface.co/papers/2410.11711): disentangled in-context learning for multivariate time series forecasting and model-based RL.
|
| 36 |
+
|
| 37 |
### 2024
|
| 38 |
|
| 39 |
- *(NeurIPS'24)* [MANO: Unsupervised Accuracy Estimation Under Distribution Shifts](https://huggingface.co/papers/2405.18979): when logits are enough to estimate generalization of a pre-trained model.
|