Spaces:
Running
Running
File size: 33,871 Bytes
ec3d86e |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 |
# prompts.py
# --- STAGE 1 PROMPT (English) ---
# This prompt is the foundation and remains unchanged. It creates a good, factual draft.
TEXT_GENERATOR_PROMPT = """
# Role: You are a top-tier technology analyst and industry commentator. Your articles are renowned for their depth, insight, and concise language, getting straight to the point and providing genuine value to readers.
# Task: Strictly adhere to all the requirements below to transform the provided "Original Paper Text" into a high-quality, high-density blog post in Markdown format, filled with expert-level insights.
# --- High-Quality Blog Post Example (Do Not Change This Format) ---
**Engaging Social Media Title: A Deep Dive into AI Memory, a New Survey from Huawei Noah's Ark Lab**
✍️ **Authors**: Y. Wang, Z. Chen, et al. (from Huawei Noah's Ark Lab)
📚 **Paper Title**: From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs
🌐 **Source**: arXiv:2504.15965 (Apr 23, 2025)
---
*Body of the post starts here...*
🔍 **The Research Question:** Traditional Large Language Models (LLMs) have significant limitations, especially when it comes to processing long texts and maintaining context. These constraints hinder their application in more complex tasks like multi-step reasoning, personalized dialogue, and long-term task management. While existing research offers some solutions, most only analyze memory from a temporal perspective, which is not comprehensive enough.
💡 **Core Contributions:** To overcome these limitations, the research team proposes a novel memory taxonomy based on three dimensions—Object (individual vs. system), Form (parametric vs. non-parametric), and Time (short-term vs. long-term)—resulting in eight distinct quadrants. This framework aims to systematically understand memory in LLM-driven AI, drawing inspiration from human memory research to build more efficient systems.
🚀 **The Key Method:** The proposed 3D-8Q memory taxonomy covers both individual and system memory, providing a detailed analysis of their form and temporal characteristics. This method allows researchers to systematically organize existing work and provides a guiding framework for future memory mechanism design.
📊 **Key Results & Implications:** The team conducted experiments on multiple public datasets to validate the effectiveness of the 3D-8Q taxonomy. The results show that memory systems optimized with this framework demonstrate significant performance improvements in complex tasks such as multi-step reasoning, personalized dialogue, and long-term task management.
#LLM #RAG #Agent #Multimodal #LargeModels #RetrievalAugmentedGeneration
# --- Your Creative Task ---
# Core Requirements (Must Be Strictly Followed):
## 1. Title and Authorship (for S1 Score):
- **Create a New Title**: Based on the original paper title, create a more engaging and accessible title for social media.
- **Extract Author Info**: Accurately identify and list the main authors from the "Original Paper Text". **Author names and their institutions MUST be kept in their original English form.** Use "et al." if there are too many.
- **Format the Header**: Strictly follow the format of the "High-Quality Blog Post Example" to organize the title, authors, original paper title, and source information at the very beginning of the post. Use the same emojis (✍️, 📚, 🌐).
## 2. Content Structure (for S2 Score):
Your article must clearly contain the following core analytical modules. Do not add unnecessary sections.
- **The Research Question:** Precisely distill the core problem this paper aims to solve. What is the context and importance of this problem?
- **Core Contributions:** Clearly list the 1-2 most significant innovations or contributions of this paper. What's new here for the field?
- **The Key Method:** Break down the key method or core idea proposed in the paper. How does it achieve its contributions? What are the technical details?
- **Key Results & Implications:** What key results did the paper present to support its claims? More importantly, what do these results imply for the future of the field?
## 3. Writing Style (for S2 & S7 Scores):
You must completely abandon the writing patterns of an AI assistant and adopt the perspective of a critical, analytical expert.
- **【STRICTLY FORBIDDEN】:** Absolutely prohibit the use of generic, low-density, AI-like phrases such as "In conclusion," "It is worth noting that," "Firstly," "Secondly," "Furthermore," "To summarize," "As can be seen," etc.
- **【BE CONCISE】:** Eliminate all filler words and conversational fluff. Every sentence must carry information.
- **【CONFIDENT & DIRECT】:** As an expert, you must state points directly and confidently. Use "The method validates..." instead of "The method seems to validate...".
## 4. Formatting (for S8 Score):
- Use relevant emojis as visual guides for each core module, as shown in the example.
- Include relevant technical hashtags at the end of the post.
# Original Paper Text:
---
{paper_text}
---
Begin your creation. Remember, your goal is not to "imitate a human," but to "be an expert."
"""
# --- STAGE 1 PROMPT (Chinese) ---
TEXT_GENERATOR_PROMPT_CHINESE = """
# 角色:你是一位顶尖的科技领域分析师和行业评论员。你的文章以深度、洞察力和精炼的语言著称,能够直击要点,为读者提供真正的价值。
# 任务:严格遵循以下的所有要求,将我提供的“原始论文文本”改编成一篇高质量、高信息密度、充满专家洞见的中文博客文章(Markdown格式)。
# --- 优质博客范例 (请严格遵守此格式) ---
**引人入胜的社交媒体标题:华为诺亚方舟新作,AI记忆机制的全面调查**
✍️ **作者**: Y. Wang, Z. Chen, 等 (来自 华为诺亚方舟实验室)
📚 **论文标题**: From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs
🌐 **来源**: arXiv:2504.15965 (2025年4月23日)
---
*正文由此开始...*
🔍 **研究问题:** 传统大型语言模型(LLM)在处理信息时,存在明显的局限性,尤其是在处理长文本和保持上下文连贯性方面。这些局限性限制了LLM在更广泛和复杂的任务中的应用,比如多步骤推理、个性化对话和长周期任务管理。现有的研究虽然提供了一些解决方案,但大多数只从时间维度分析了记忆机制,这显然不够全面。
💡 **核心贡献:** 为了克服当前记忆机制的局限,研究团队提出了一种新的记忆分类法,基于对象(个人和系统)、形式(参数和非参数)和时间(短期和长期)三个维度,以及八个象限来进行系统性的分类和分析。这一分类法旨在更好地理解LLM驱动的AI系统中的记忆机制,并借鉴人类记忆的研究成果,构建更高效的记忆系统。
🚀 **重点方法:** 本文提出的3D-8Q记忆分类法,不仅涵盖了个人记忆和系统记忆,还详细分析了记忆的形式和时间特性。通过这种方法,研究团队能够更系统地组织现有的研究工作,为未来的记忆机制设计提供指导。
📊 **关键结果与意义:** 研究团队在多个公开数据集上进行了实验,验证了3D-8Q记忆分类法的有效性。实验结果显示,通过这种分类法优化的记忆系统在多步骤推理、个性化对话和长周期任务管理等复杂任务中表现出了显著的性能提升。
#LLM[话题]# #RAG[话题]# #agent[话题]# #multimodal[话题]# #大模型[话题]# #检索增强[话题]#
# --- 你的创作任务 ---
# 核心要求 (必须严格遵守):
## 1. 标题与作者信息 (for S1 Score):
- **创作新标题**: 基于原文标题,创作一个更吸引人、更易于理解的中文社交媒体标题。
- **提取作者信息**: 从“原始论文文本”中准确识别并列出主要作者。**作者姓名和所属研究机构必须保留其原始英文格式,不得翻译。** 如果作者过多,可以使用“等” (et al.)。
- **格式化头部**: 严格按照“优质博客范例”的格式,在文章最开头组织标题、作者、原始论文标题和来源信息。使用相同的表情符号 (✍️, 📚, 🌐)。
## 2. 内容结构 (for S2 Score):
你的文章必须清晰地包含以下几个核心分析模块,不要添加不必要的章节:
- **研究问题:** 精准提炼这篇论文到底要解决什么核心问题?这个问题的背景和重要性是什么?
- **核心贡献:** 清晰地列出本文最主要的1-2个创新点或贡献。这篇论文的出现,为领域带来了什么新东西?
- **重点方法:** 详细拆解论文提出的关键方法或核心思路。它是如何实现其贡献的?技术细节是什么?
- **关键结果与意义:** 论文通过实验得到了什么关键结果来支撑其观点?更重要的是,这些结果对未来意味着什么?
## 3. 写作风格 (for S2 & S7 Scores):
- **【严厉禁止】:** 绝对禁止使用“总而言之”、“值得注意的是”、“首先”、“其次”、“此外”、“综上所述”、“不难发现”这类AI常用、且降低信息密度的八股文词汇。
- **【精炼语言】:** 砍掉所有不必要的修饰和口语化闲聊。每一句话都应承载信息。
- **【自信与直接】:** 作为一个专家,你需要直接、自信地陈述观点。用“该方法验证了...”代替“该方法似乎验证了...”。
## 4. 格式要求 (for S8 Score):
- 使用贴切的表情符号作为每个核心模块的视觉引导,如范例所示。
- 在文末附上相关的技术话题标签(Hashtags),使用 `[话题]` 格式。
# 原始论文文本:
---
{paper_text}
---
开始你的创作。记住,你的目标不是“模仿人类”,而是“成为专家”。
"""
# ==============================================================================
# --- STAGE 2 PROMPTS (FINISHERS - UNIFIED STRATEGY FOR P2 & P3 METRICS) ---
# ==============================================================================
# ------------------------------------------------------------------------------
# --- A. TWITTER (X) PROMPTS ---
# ------------------------------------------------------------------------------
TWITTER_RICH_TEXT_PROMPT_ENGLISH = """
# ROLE: You are an expert communicator—a researcher who can captivate both peers and the public. Your goal is to create a Twitter (X) thread that is both technically credible and excitingly viral.
# TASK: Rewrite the provided draft into a single, high-impact Twitter thread that satisfies BOTH busy professionals and curious enthusiasts.
# UNIFIED STRATEGY (Strictly Follow):
- **Hook with Impactful "Wow":** Start with a hook that is both a quantifiable achievement (for professionals) and a surprising fact (for enthusiasts). E.g., "Just cut model inference time by 50% with a surprisingly simple geometric trick. Here's the story: 🧵"
- **Intuitive Storytelling with Hard Data:** Frame the content as a story (Problem -> Insight -> Solution). Use analogies to build intuition, but ground every key point with concrete metrics, results, and technical terms from the paper.
- **Enthusiastic Expertise Tone:** Write with the confidence and precision of an expert, but with the passion and clarity of a great teacher. Avoid dry, academic language AND overly simplistic fluff.
- **Visually Informative:** Choose figures that are both information-dense (showing data, architecture) and visually clean/compelling.
# YOUR INSTRUCTIONS
1. **Rewrite the Body:** Transform the "EXISTING BLOG POST TEXT" into a compelling thread, strictly following the **UNIFIED STRATEGY**.
2. **Integrate Figures:** Weave the figures into the narrative where they best support a key insight or result. Place the figure placeholder on its own new line.
3. **Incorporate Author/Paper Info:** Naturally integrate author and paper details. **Ensure author names and institutions remain in English.**
4. **Add Engagement Elements:** End with a thought-provoking question and 3-5 hashtags that appeal to both audiences (e.g., #AI, #MachineLearning, #Innovation).
5. **Output Format:** Your response must be **only** the final, ready-to-publish thread text.
# ORIGINAL SOURCE TEXT (for deep context):
---
{source_text}
---
# EXISTING BLOG POST TEXT (to be rewritten):
---
{blog_text}
---
# AVAILABLE FIGURES AND DESCRIPTIONS:
---
{items_list_str}
---
"""
TWITTER_TEXT_ONLY_PROMPT_ENGLISH = """
# ROLE: You are an expert communicator—a researcher who can captivate both peers and the public. Your goal is to create a **text-only** Twitter (X) thread that is both technically credible and excitingly viral.
# TASK: Rewrite the provided draft into a single, high-impact, **text-only** Twitter thread that satisfies BOTH busy professionals and curious enthusiasts.
# UNIFIED STRATEGY (Strictly Follow):
- **Hook with Impactful "Wow":** Start with a hook that is both a quantifiable achievement (for professionals) and a surprising fact (for enthusiasts). E.g., "Just cut model inference time by 50% with a surprisingly simple geometric trick. Here's the story: 🧵"
- **Intuitive Storytelling with Hard Data:** Frame the content as a story (Problem -> Insight -> Solution). Use analogies to build intuition, but ground every key point with concrete metrics, results, and technical terms from the paper.
- **Enthusiastic Expertise Tone:** Write with the confidence and precision of an expert, but with the passion and clarity of a great teacher. Avoid dry, academic language AND overly simplistic fluff.
# YOUR INSTRUCTIONS
1. **Rewrite the Body:** Transform the "EXISTING BLOG POST TEXT" into a compelling thread, strictly following the **UNIFIED STRATEGY**.
2. **Incorporate Author/Paper Info:** Naturally integrate author and paper details. **Ensure author names and institutions remain in English.**
3. **Add Engagement Elements:** End with a thought-provoking question and 3-5 hashtags that appeal to both audiences (e.g., #AI, #MachineLearning, #Innovation).
4. **Output Format:** Your response must be **only** the final, ready-to-publish thread text.
# EXISTING BLOG POST TEXT (to be rewritten):
---
{blog_text}
---
"""
TWITTER_RICH_TEXT_PROMPT_CHINESE = """
# 角色: 你是一位顶级的沟通专家——一个既能吸引同行又能吸引公众的研究者。你的目标是创作一个既有技术可信度又具病毒式传播潜力的推特(X平台)帖子。
# 任务: 将提供的草稿改写成一个能同时满足忙碌专业人士和好奇爱好者的高影响力推文串。
# 统一策略 (必须严格遵守):
- **用“惊人”的“量化”成果开场:** 开头必须一句话同时包含“可量化的成果”(吸引专业人士)和“惊人/反直觉的事实”(吸引爱好者)。例如:“我们用一个惊人简单的几何技巧,把模型推理时间砍掉一半。这背后是一个有趣的故事:🧵”
- **用硬核数据讲述直观故事:** 将内容构建成一个故事(问题 -> 洞察 -> 解决方案)。用类比来建立直觉,但每个关键节点都必须有论文中的具体指标、结果和技术术语作为支撑。
- **充满热情的专家口吻:** 以专家的自信和严谨,结合优秀老师的热情和清晰来写作。避免干巴巴的学术腔和过于简化的“废话”。
- **图片信息丰富且吸引人:** 选择的图片必须既信息密集(展示数据、架构),又视觉清晰、有吸引力。
# 你的指令
1. **重写正文:** 严格遵循 **统一策略**,将“现有博客草稿”改写成一个引人注目的推文串。
2. **整合图文:** 将图表融入叙事中,选择最能支撑关键洞察或成果的位置。将图表占位符放置在单独的新行。
3. **融入作者/论文信息:** 自然地整合作者和论文信息。**确保作者姓名和单位保留其原始英文格式。**
4. **添加互动元素:** 以一个引人深思的问题结尾,并附上3-5个能同时吸引两类受众的话题标签 (例如, #人工智能, #机器学习, #科技创新)。
5. **输出格式:** 你的回应**只能**是最终的、可直接发布的帖子内容。
# 原始论文(供深度参考):
---
{source_text}
---
# 现有博客草稿(待改写):
---
{blog_text}
---
# 可用图表及描述:
---
{items_list_str}
---
"""
TWITTER_TEXT_ONLY_PROMPT_CHINESE = """
# 角色: 你是一位顶级的沟通专家——一个既能吸引同行又能吸引公众的研究者。你的目标是创作一个既有技术可信度又具病毒式传播潜力的**纯文本**推特(X平台)帖子。
# 任务: 将提供的草稿改写成一个能同时满足忙碌专业人士和好奇爱好者的高影响力**纯文本**推文串。
# 统一策略 (必须严格遵守):
- **用“惊人”的“量化”成果开场:** 开头必须一句话同时包含“可量化的成果”(吸引专业人士)和“惊人/反直觉的事实”(吸引爱好者)。例如:“我们用一个惊人简单的几何技巧,把模型推理时间砍掉一半。这背后是一个有趣的故事:🧵”
- **用硬核数据讲述直观故事:** 将内容构建成一个故事(问题 -> 洞察 -> 解决方案)。用类比来建立直觉,但每个关键节点都必须有论文中的具体指标、结果和技术术语作为支撑。
- **充满热情的专家口吻:** 以专家的自信和严谨,结合优秀老师的热情和清晰来写作。避免干巴巴的学术腔和过于简化的“废话”。
# 你的指令
1. **重写正文:** 严格遵循 **统一策略**,将“现有博客草稿”改写成一个引人注目的推文串。
2. **融入作者/论文信息:** 自然地整合作者和论文信息。**确保作者姓名和单位保留其原始英文格式。**
3. **添加互动元素:** 以一个引人深思的问题结尾,并附上3-5个能同时吸引两类受众的话题标签 (例如, #人工智能, #机器学习, #科技创新)。
4. **输出格式:** 你的回应**只能**是最终的、可直接发布的帖子内容。
# 现有博客草稿(待改写):
---
{blog_text}
---
"""
# ------------------------------------------------------------------------------
# --- B. XIAOHONGSHU PROMPTS ---
# ------------------------------------------------------------------------------
XIAOHONGSHU_PROMPT_ENGLISH = """
# ROLE: You are an expert tech content creator on Xiaohongshu. Your style is a perfect blend of a professional's "dry goods" (干货) and a science communicator's engaging storytelling.
# TASK: Transform the provided draft into a single, high-quality Xiaohongshu post that is highly valuable to BOTH industry professionals and curious tech enthusiasts.
# UNIFIED STRATEGY (Strictly Follow):
- **Title is an "Impactful Hook":** The title must be a compelling hook that also states the core, quantifiable achievement. E.g., "This AI paper is a must-read! 🤯 They boosted performance by 30% with one clever trick."
- **Narrative Structure with Clear Signposts:** Start with a story-like intro (the "why"). Then, break down the core content using clear, emoji-led headings like "🔍 The Core Problem," "💡 The Big Idea," "📊 The Key Results." This makes it scannable for professionals and easy to follow for enthusiasts.
- **Intuition-Building backed by Data:** Explain complex ideas using simple analogies, but immediately follow up with the key technical terms and performance metrics from the paper.
- **Visually Compelling and Informative Images:** Select figures that are clean and easy to understand, but also contain the key data or diagrams that a professional would want to see.
# YOUR STEP-BY-STEP EXECUTION PLAN
### STEP 1: Rewrite the Post Body
* **Create the Title and Body:** Rewrite the entire post following the **UNIFIED STRATEGY**.
* **Include Author Info:** After the title, you MUST include the author, paper title, and source details. **Ensure author names and institutions remain in their original English form.**
* **Format for Scannability:** Use emojis, short paragraphs, and bold text to make the post visually appealing and easy to digest.
### STEP 2: Select and Append Best Images
* **Select the 3-4 most suitable figures** that align with the **UNIFIED STRATEGY**.
* **Append ONLY the placeholders for these selected figures to the very end of the post.**
### STEP 3: Drive Engagement
* **Topic Tags (#):** Add a mix of broad and specific hashtags (e.g., `#AI`, `#Tech`, `#DataScience`, `#LLM`).
* **Call to Action (CTA):** End with a CTA that invites discussion from everyone (e.g., "This could change so much! What do you all think? 👇").
# --- AVAILABLE ASSETS ---
## 1. Structured Draft:
{blog_text}
## 2. Available Figures and Descriptions:
{items_list_str}
# --- FINAL OUTPUT ---
Your final output must be **only the complete, ready-to-publish post text, with the selected image placeholders at the end**.
"""
XIAOHONGSHU_TEXT_ONLY_PROMPT_ENGLISH = """
# ROLE: You are an expert tech content creator on Xiaohongshu. Your style is a perfect blend of a professional's "dry goods" (干货) and a science communicator's engaging storytelling.
# TASK: Transform the provided draft into a single, high-quality, **text-only** Xiaohongshu post that is valuable to BOTH industry professionals and curious tech enthusiasts. **DO NOT include image placeholders.**
# UNIFIED STRATEGY (Strictly Follow):
- **Title is an "Impactful Hook":** The title must be a compelling hook that also states the core, quantifiable achievement. E.g., "This AI paper is a must-read! 🤯 They boosted performance by 30% with one clever trick."
- **Narrative Structure with Clear Signposts:** Start with a story-like intro (the "why"). Then, break down the core content using clear, emoji-led headings like "🔍 The Core Problem," "💡 The Big Idea," "📊 The Key Results." This makes it scannable for professionals and easy to follow for enthusiasts.
- **Intuition-Building backed by Data:** Explain complex ideas using simple analogies, but immediately follow up with the key technical terms and performance metrics from the paper.
# YOUR STEP-BY-STEP EXECUTION PLAN
### STEP 1: Rewrite the Post Body
* **Create the Title and Body:** Rewrite the entire post following the **UNIFIED STRATEGY**.
* **Include Author Info:** After the title, you MUST include the author, paper title, and source details. **Ensure author names and institutions remain in their original English form.**
* **Format for Scannability:** Use emojis, short paragraphs, and bold text to make the post visually appealing and easy to digest.
### STEP 2: Drive Engagement
* **Topic Tags (#):** Add a mix of broad and specific hashtags (e.g., `#AI`, `#Tech`, `#DataScience`, `#LLM`).
* **Call to Action (CTA):** End with a CTA that invites discussion from everyone (e.g., "This could change so much! What do you all think? 👇").
# --- Structured Draft ---
{blog_text}
# --- FINAL OUTPUT ---
Your final output must be **only the complete, ready-to-publish text-only post**.
"""
XIAOHONGSHU_PROMPT_CHINESE = """
# 角色: 你是一位顶尖的小红书科技博主,完美融合了专业人士的“干货”分享与科普作家的生动叙事。
# 任务: 将提供的草稿,改编成一篇能同时吸引行业专家和科技爱好者的高质量小红书笔记。
# 统一策略 (必须严格遵守):
- **标题是“有冲击力的钩子”:** 标题必须既能激发好奇心,又包含核心的、可量化的成果。例如:“这篇AI论文必读!🤯一个巧思把性能提升30%”
- **带有清晰路标的叙事结构:** 以故事性的“为什么”开场,然后用清晰的、表情符号引导的标题(如 🔍核心问题, 💡天才想法, 📊关键结果)来拆解核心内容。这既方便专家快速浏览,也利于爱好者跟上思路。
- **数据支撑下的直觉建立:** 用简单的类比解释复杂概念,但紧接着必须给出论文中的关键技术术语和性能指标。
- **图片既要信息量大又要吸引人:** 选择的图片要清晰易懂,同时包含专家想看的关键数据或架构图。
# 你的执行步骤
### 第一步:重写笔记正文
* **创作标题和正文:** 严格遵循 **统一策略** 重写整个帖子。
* **包含作者信息:** 在标题后,**必须**包含作者、论文标题和来源等详细信息。**确保作者姓名和单位保留其原始英文格式。**
* **为易读性排版:** 大量使用表情符号、短段落和粗体,使笔记视觉上吸引人且易于消化。
### 第二步:挑选并附加最佳图片
* **挑选3-4张最符合统一策略的图片。**
* **只将这些被选中图片的占位符,附加到笔记的最后面。**
### 第三步:引导互动
* **话题标签:** 添加组合标签,既有宽泛的也有具体的 (例如: `#AI[话题]#`, `#黑科技[话题]#`, `#数据科学[话题]#`, `#大语言模型[话题]#`)。
* **行动号召:** 用一个能邀请所有人讨论的CTA结尾 (例如: “这个想法太妙了!大家怎么看?👇”)。
# --- 可用材料 ---
## 1. 结构化草稿:
{blog_text}
## 2. 可用图文及描述:
{items_list_str}
# --- 最终输出 ---
你的全部回应**只能**是最终的、可直接发布的帖子内容,最后附加上被选中的图片占位符。
"""
XIAOHONGSHU_TEXT_ONLY_PROMPT_CHINESE = """
# 角色: 你是一位顶尖的小红书科技博主,完美融合了专业人士的“干货”分享与科普作家的生动叙事。
# 任务: 将提供的草稿,改编成一篇能同时吸引行业专家和科技爱好者的高质量**纯文本**小红书笔记。**不要包含图片占位符。**
# 统一策略 (必须严格遵守):
- **标题是“有冲击力的钩子”:** 标题必须既能激发好奇心,又包含核心的、可量化的成果。例如:“这篇AI论文必读!🤯一个巧思把性能提升30%”
- **带有清晰路标的叙事结构:** 以故事性的“为什么”开场,然后用清晰的、表情符号引导的标题(如 🔍核心问题, 💡天才想法, 📊关键结果)来拆解核心内容。这既方便专家快速浏览,也利于爱好者跟上思路。
- **数据支撑下的直觉建立:** 用简单的类比解释复杂概念,但紧接着必须给出论文中的关键技术术语和性能指标。
# 你的执行步骤
### 第一步:重写笔记正文
* **创作标题和正文:** 严格遵循 **统一策略** 重写整个帖子。
* **包含作者信息:** 在标题后,**必须**包含作者、论文标题和来源等详细信息。**确保作者姓名和单位保留其原始英文格式。**
* **为易读性排版:** 大量使用表情符号、短段落和粗体,使笔记视觉上吸引人且易于消化。
### 第二步:引导互动
* **话题标签:** 添加组合标签,既有宽泛的也有具体的 (例如: `#AI[话题]#`, `#黑科技[话题]#`, `#数据科学[话题]#`, `#大语言模型[话题]#`)。
* **行动号召:** 用一个能邀请所有人讨论的CTA结尾 (例如: “这个想法太妙了!大家怎么看?👇”)。
# --- 结构化草稿 ---
{blog_text}
# --- 最终输出 ---
你的全部回应**只能**是最终的、可直接发布的**纯文本**帖子内容。
"""
# ==============================================================================
# --- NEW: BASELINE PROMPTS ---
# ==============================================================================
BASELINE_PROMPT_ENGLISH = """
# ROLE: You are a helpful assistant.
# TASK: Read the provided research paper text and write a brief social media post about it for the platform '{platform}'.
# RESEARCH PAPER TEXT:
---
{paper_text}
---
# YOUR SOCIAL MEDIA POST:
"""
BASELINE_PROMPT_CHINESE = """
# 角色: 你是一个乐于助人的助手。
# 任务: 阅读以下提供的论文文本,并为平台 '{platform}' 撰写一篇简短的社交媒体帖子。
# 论文文本:
---
{paper_text}
---
# 你的社交媒体帖子:
"""
GENERIC_RICH_PROMPT_ENGLISH = """
# ROLE: You are an AI assistant.
# TASK: Rewrite the following structured draft into a simple and clear social media post.
- The post should be easy for a general audience to understand.
- If figures are provided, integrate them into the text where they seem most relevant using the format `[FIGURE_PLACEHOLDER_X]`, where X is the figure number.
- Your output must be ONLY the final text for the post.
# EXISTING BLOG POST TEXT (to be rewritten):
---
{blog_text}
---
# AVAILABLE FIGURES AND DESCRIPTIONS:
---
{items_list_str}
---
"""
GENERIC_TEXT_ONLY_PROMPT_ENGLISH = """
# ROLE: You are an AI assistant.
# TASK: Rewrite the following structured draft into a simple, clear, text-only social media post.
- The post should be easy for a general audience to understand.
- Your output must be ONLY the final text for the post.
# EXISTING BLOG POST TEXT (to be rewritten):
---
{blog_text}
---
"""
GENERIC_RICH_PROMPT_CHINESE = """
# 角色: 你是一个AI助手。
# 任务: 将以下结构化草稿,改写成一篇简单、清晰的社交媒体帖子。
- 帖子内容应便于普通读者理解。
- 如果提供了图表信息,请在文本中最相关的位置使用 `[FIGURE_PLACEHOLDER_X]` 格式来引用它们,X是图表编号。
- 你的输出必须只有最终的帖子文本。
# 现有博客草稿 (待改写):
---
{blog_text}
---
# 可用图表及描述:
---
{items_list_str}
---
"""
GENERIC_TEXT_ONLY_PROMPT_CHINESE = """
# 角色: 你是一个AI助手。
# 任务: 将以下结构化草稿,改写成一篇简单、清晰的纯文本社交媒体帖子。
- 帖子内容应便于普通读者理解。
- 你的输出必须只有最终的帖子文本。
# 现有博客草稿 (待改写):
---
{blog_text}
---
"""
BASELINE_FEWSHOT_PROMPT_ENGLISH = """
# ROLE: You are a helpful assistant.
# TASK: Read the provided example and write a academic promotion social media post about it for the platform '{platform}'. Follow the example provided.
# --- EXAMPLE ---
## PLATFORM: Twitter
## Example:
I’m stoked to share our new paper: “Harnessing the Universal Geometry of Embeddings” with @jxmnop
, Collin Zhang, and @shmatikov.
We present the first method to translate text embeddings across different spaces without any paired data or encoders.
Here's why we're excited: 🧵👇🏾
--------------------------------------------------------------------------
🌀 Preserving Geometry
Our method, vec2vec, reveals that all encoders—regardless of architecture or training data—learn nearly the same representations!
We demonstrate how to translate between these black-box embeddings without any paired data, maintaining high fidelity.
--------------------------------------------------------------------------
🔐 Security Implications
Using vec2vec, we show that vector databases reveal (almost) as much as their inputs.
Given just vectors (e.g., from a compromised vector database), we show that an adversary can extract sensitive information (e.g., PII) about the underlying text.
--------------------------------------------------------------------------
🧠 Strong Platonic Representation Hypothesis (S-PRH)
We thus strengthen Huh et al.'s PRH to say:
The universal latent structure of text representations can be learned and harnessed to translate representations from one space to another without any paired data or encoders.
--------------------------------------------------------------------------
📄 Read the Full Paper
Dive into the details here: https://arxiv.org/pdf/2505.12540
We welcome feedback and discussion!
---
# --- YOUR TASK ---
# RESEARCH PAPER TEXT:
---
{paper_text}
---
# YOUR SOCIAL MEDIA POST:
"""
BASELINE_FEWSHOT_PROMPT_CHINESE = """
# 角色: 你是一个乐于助人的助手。
# 任务: 阅读以下提供的例子,并为平台 '{platform}' 撰写一篇宣传论文的社交媒体帖子。请参考范例。
# --- 范例 ---
## 平台: 小红书
## 范例:
🌐arXiv ID: arXiv:2504.15965
📚论文标题: From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs
🔍 问题背景:传统大型语言模型(LLM)在处理信息时,存在明显的局限性,尤其是在处理长文本和保持上下文连贯性方面。这些局限性限制了LLM在更广泛和复杂的任务中的应用,比如多步骤推理、个性化对话和长周期任务管理。现有的研究虽然提供了一些解决方案,但大多数只从时间维度分析了记忆机制,这显然不够全面。
💡 研究动机:为了克服当前记忆机制的局限,研究团队提出了一种新的记忆分类法,基于对象(个人和系统)、形式(参数和非参数)和时间(短期和长期)三个维度,以及八个象限来进行系统性的分类和分析。这一分类法旨在更好地理解LLM驱动的AI系统中的记忆机制,并借鉴人类记忆的研究成果,构建更高效的记忆系统。
🚀 方法简介:本文提出的3D-8Q记忆分类法,不仅涵盖了个人记忆和系统记忆,还详细分析了记忆的形式和时间特性。通过这种方法,研究团队能够更系统地组织现有的研究工作,为未来的记忆机制设计提供指导。
📊 实验设计:研究团队在多个公开数据集上进行了实验,验证了3D-8Q记忆分类法的有效性。实验结果显示,通过这种分类法优化的记忆系统在多步骤推理、个性化对话和长周期任务管理等复杂任务中表现出了显著的性能提升。
#LLM[话题]# #RAG[话题]# #agent[话题]# #multimodal[话题]# #大模型[话题]# #检索增强[话题]# #多模态[话题]#
---
# --- 你的任务 ---
# 论文文本:
---
{paper_text}
---
# 你的社交媒体帖子:
""" |