hopenjin
/

CE-CoLLM_Cloud_LLM_Partition

hopenjin commited on Nov 17, 2025

Commit

e33bcdf

1 Parent(s): 3934e01

Update model card with CE-CoLLM paper info

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,13 +1,25 @@
 ## Citation
-If you find this model useful, please cite the following works:
 ```bibtex
-@misc{jin2024cecollmefficientadaptivelarge,
-      title={CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration},
-      author={Hongpeng Jin and Yanzhao Wu},
-      year={2024},
-      eprint={2411.02829},
-      archivePrefix={arXiv},
-      primaryClass={cs.DC},
-      url={https://arxiv.org/abs/2411.02829},
-}

+# CE-CoLLM Cloud LLM Partition
+This repository hosts the cloud-side partition of the CE-CoLLM model used in our IEEE ICWS 2025 paper "CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration".
+- Codebase: https://github.com/mlsysx/CE-CoLLM
+- Edge-side partition: https://huggingface.co/hopenjin/CE-CoLLM_Edge_LLM_Partition
+- Paper (IEEE Xplore): https://ieeexplore.ieee.org/abstract/document/11169709
 ## Citation
+Please cite the following paper when using CE-CoLLM:
 ```bibtex
+@INPROCEEDINGS{jin2024cecollmefficientadaptivelarge,
+  author={Jin, Hongpeng and Wu, Yanzhao},
+  booktitle={2025 IEEE International Conference on Web Services (ICWS)},
+  title={CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration},
+  year={2025},
+  pages={316-323},
+  keywords={Cloud computing;Accuracy;Web services;Large language models;Collaboration;Benchmark testing;Reliability engineering;Low latency communication;Edge computing;Software development management;Large Language Model;LLM Deployment;Cloud-Edge Collaboration;Cloud Services;Adaptive LLM Inference;Edge AI},
+  doi={10.1109/ICWS67624.2025.00046},
+  ISSN={2836-3868},
+  month={July},
+}
+```