Spaces:

paris-noah
/

README

Running

vasilii-feofanov commited on Mar 3, 2025

Commit

94afca3

verified ·

1 Parent(s): 5ed850b

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -27,10 +27,13 @@ Paris Noah's Ark Lab consists of 3 research teams that cover the following topic
 ### Preprints
    - [TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning](https://huggingface.co/papers/2502.15425): distributed multi-agent hierarchical reinforcement learning framework.
-   - [Zero-shot Model-based Reinforcement Learning using Large Language Models](https://huggingface.co/papers/2410.11711): disentangled in-context learning for multivariate time series forecasting and model-based RL.
    - [Large Language Models as Markov Chains](https://huggingface.co/papers/2410.02724):  theoretical insights on their generalization and convergence properties.
    - [A Systematic Study Comparing Hyperparameter Optimization Engines on Tabular Data](https://balazskegl.medium.com/navigating-the-maze-of-hyperparameter-optimization-insights-from-a-systematic-study-6019675ea96c): insights to navigate the maze of hyperopt techniques.
 ### 2024
    - *(NeurIPS'24)* [MANO: Unsupervised Accuracy Estimation Under Distribution Shifts](https://huggingface.co/papers/2405.18979): when logits are enough to estimate generalization of a pre-trained model.

 ### Preprints
    - [TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning](https://huggingface.co/papers/2502.15425): distributed multi-agent hierarchical reinforcement learning framework.
    - [Large Language Models as Markov Chains](https://huggingface.co/papers/2410.02724):  theoretical insights on their generalization and convergence properties.
    - [A Systematic Study Comparing Hyperparameter Optimization Engines on Tabular Data](https://balazskegl.medium.com/navigating-the-maze-of-hyperparameter-optimization-insights-from-a-systematic-study-6019675ea96c): insights to navigate the maze of hyperopt techniques.
+### 2025
+   - *(ICLR'25)* - [Zero-shot Model-based Reinforcement Learning using Large Language Models](https://huggingface.co/papers/2410.11711): disentangled in-context learning for multivariate time series forecasting and model-based RL.
 ### 2024
    - *(NeurIPS'24)* [MANO: Unsupervised Accuracy Estimation Under Distribution Shifts](https://huggingface.co/papers/2405.18979): when logits are enough to estimate generalization of a pre-trained model.