Spaces:

yellowtown
/

easybib

Build error

App Files Files Community

yellowtown commited on Oct 7, 2024

Commit

d13100b

1 Parent(s): 5b8cd1f

🎉 init(v0.2):

Browse files

Files changed (6) hide show

.gitignore +2 -0
README.md +17 -1
app.py +64 -0
assets/example.jpg +0 -0
requirements.txt +2 -0
test.py +9 -0

.gitignore ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ flagged
2	+ **/__pycache__/

README.md CHANGED Viewed

@@ -11,4 +11,20 @@ license: apache-2.0
 short_description: paper title => bib tex
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 short_description: paper title => bib tex
 ---
+# EasyBib
+一个简易的工具，用于将文献标题转化为BibTeX格式。
+![运行示例](assets/example.jpg)
+运行后，会启动一个gradio应用，用户可以输入多个文献标题，每一行是一个标题，输出对应的BibTeX格式，不同论文的BibTex格式中间会空一行。
+## Changelog
+### v0.2
+* 新增重试机制。SemanticScholar站点限制所有匿名的API请求最多1,000次/秒，所以使用时会出现请求失败的情况，目前用重试机制规避，但时间会加长。
+### v0.1
+* 支持多行多个论文同时查询
+## TODO
+- [ ] 支持个人API KEY
+- [ ] 自动改写会议名称，比如'International Conference on Learning Representations' ==> 'ICLR'
+- [ ] 多论文加速

app.py ADDED Viewed

	@@ -0,0 +1,64 @@

+#!/usr/bin/env python3
+# -*- coding: utf-8 -*-
+import requests
+import gradio as gr
+def get_bibtext_from_title(title, retry_times=100):
+    # URL编码查询参数
+    encoded_query = requests.utils.quote(title)
+    print(f'query: {encoded_query}')
+    # 假设API的endpoint为'https://api.example.com/search'
+    url = f"https://api.semanticscholar.org/graph/v1/paper/search?query={encoded_query}&offset=0&limit=3&fields=title,citationStyles"
+    for i in range(retry_times):
+        # 发送GET请求
+        response = requests.get(url)
+        # 检查请求是否成功
+        if response.status_code == 200:
+            # 如果请求成功，返回JSON格式的数据
+            try:
+                data = response.json()['data']
+                bibtex = data[0]['citationStyles']['bibtex']
+                return bibtex
+            except:
+                msg = f"Failed to parse response: {response.json()}"
+                return msg
+        elif response.status_code == 429:
+            print(f'retry {i} times')
+            continue
+        else:
+            # 如果请求失败，打印错误信息
+            msg = "Failed to retrieve data: {response.json()}"
+            return msg
+def process_text(input_text):
+    titles = input_text.split('\n')
+    # 在这个例子中，我们仅仅将输入的文本原样返回。
+    # 你可以在这个函数中加入任何你想要的文本处理逻辑。
+    output = []
+    for title in titles:
+        if not title:
+            continue
+        bibtex = get_bibtext_from_title(title)
+        print(bibtex)
+        if bibtex is not None:
+            output.append(bibtex)
+        else:
+            output.append("Failed to process: " + title + '\n')
+    return '\n'.join(output)
+# 创建Gradio界面
+iface = gr.Interface(
+    fn=process_text,                 # 要调用的处理函数
+    inputs=gr.Textbox(lines=10, placeholder="请在此输入论文标题..."),  # 输入组件：文本框，设置多行输入
+    outputs=gr.Textbox(lines=20, placeholder="输出BibTex信息将会在此显示..."), # 输出组件：文本框，设置多行输出
+)
+if __name__ == "__main__":
+    # 启动界面
+    iface.launch()

assets/example.jpg ADDED Viewed

requirements.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ gradio
2	+ requests

test.py ADDED Viewed

	@@ -0,0 +1,9 @@

+#!/usr/bin/env python3
+# -*- coding: utf-8 -*-
+from app import process_text
+if __name__ == "__main__":
+    output = process_text("Language Models are Few-Shot Learners")
+    print(output)