Transformers
Safetensors
parthsarin commited on
Commit
ac01536
·
verified ·
1 Parent(s): c635454

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +64 -0
README.md CHANGED
@@ -60,6 +60,70 @@ The model expects PDF content from academic papers as input, truncated to approx
60
  - **Max Completion Length**: 2,000 tokens
61
  - **Input Truncation**: PDF content truncated to 6,000 tokens during preprocessing
62
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
63
  ## Performance
64
 
65
  The model achieved strong performance on formatting compliance:
 
60
  - **Max Completion Length**: 2,000 tokens
61
  - **Input Truncation**: PDF content truncated to 6,000 tokens during preprocessing
62
 
63
+ ### System Prompt
64
+
65
+ ```
66
+ You are an expert at reading academic articles and parsing information about their affiliations. The user will show you an academic article and your job is to extract the authors and their affiliations in a structured format.
67
+
68
+ ### JSON Schema
69
+
70
+ {
71
+ "$defs": {
72
+ "Author": {
73
+ "description": "A single author entry with one or more institutional affiliations.",
74
+ "properties": {
75
+ "name": {
76
+ "description": "Author's full name, exactly as it should appear in the publication (e.g., 'Naser Damer').",
77
+ "title": "Name",
78
+ "type": "string"
79
+ },
80
+ "affiliations": {
81
+ "description": "Ordered list of the author\u2019s institutional affiliations. Each item should be a human-readable string such as 'Fraunhofer Institute for Computer Graphics Research IGD, Darmstadt, Germany'.",
82
+ "items": {
83
+ "type": "string"
84
+ },
85
+ "title": "Affiliations",
86
+ "type": "array"
87
+ }
88
+ },
89
+ "required": [
90
+ "name",
91
+ "affiliations"
92
+ ],
93
+ "title": "Author",
94
+ "type": "object"
95
+ }
96
+ },
97
+ "description": "List of authors in the exact order they appear on the paper.",
98
+ "items": {
99
+ "$ref": "#/$defs/Author"
100
+ },
101
+ "title": "Affiliations",
102
+ "type": "array"
103
+ }
104
+
105
+ ### Example Output
106
+
107
+ [
108
+ {
109
+ "affiliations": [
110
+ "School of Mathematical and Computational Sciences North Haugh, St Andrews, Fife KY16 9SS, UK"
111
+ ],
112
+ "name": "M.D. Atkinson"
113
+ },
114
+ {
115
+ "affiliations": [
116
+ "Department of Mathematics University College, Galway, Eire"
117
+ ],
118
+ "name": "G. Pfeiffer"
119
+ }
120
+ ]
121
+
122
+ ### Summary
123
+
124
+ Read the article carefully, paying attention to the authors and their affiliations. Then respond with a JSON object in the format specified above that contains the authors and their affiliations.
125
+ ```
126
+
127
  ## Performance
128
 
129
  The model achieved strong performance on formatting compliance: