File size: 13,609 Bytes
90b9cf3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
629c289
90b9cf3
 
 
629c289
 
 
 
 
 
 
 
90b9cf3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
72629fd
90b9cf3
 
 
 
 
 
72629fd
90b9cf3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
72629fd
90b9cf3
 
 
 
c0efcd4
90b9cf3
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
<!DOCTYPE html>
<html lang="en">
<head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>Research Paper Metadata Database</title>
    <script src="https://cdn.tailwindcss.com"></script>
    <style>
        .gradient-bg {
            background: linear-gradient(135deg, #3b82f6 0%, #6366f1 100%);
        }
        .card-hover {
            transition: transform 0.3s ease, box-shadow 0.3s ease;
        }
        .card-hover:hover {
            transform: translateY(-4px);
            box-shadow: 0 20px 40px rgba(0,0,0,0.15);
        }
    </style>
</head>
<body class="bg-gray-50">
    <!-- Header -->
    <header class="gradient-bg text-white">
        <div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-16">
            <div class="text-center">
                <div class="text-6xl mb-4">πŸ“š</div>
                <h1 class="text-5xl font-bold mb-4">Research Paper Metadata Database</h1>
                <p class="text-xl opacity-90 mb-6">Centralized Metadata Repository for Scientific Research</p>
                <p class="text-lg opacity-75 max-w-3xl mx-auto mb-6">
                    A structured metadata repository designed to enable AI-powered visualization and analysis 
                    of research structure with the goal of expanding research in interesting, useful, and practical ways.
                </p>
                <div class="mt-6">
                    <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/papers-database-table.html" 
                       target="_blank" 
                       rel="noopener noreferrer"
                       class="inline-block bg-white text-blue-600 font-semibold px-8 py-4 rounded-lg hover:bg-blue-50 transition-colors shadow-lg hover:shadow-xl">
                        πŸš€ View Working Prototype β†’
                    </a>
                </div>
            </div>
        </div>
    </header>

    <!-- Prior Work & Research Contributions -->
    <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
        <div class="bg-gradient-to-r from-blue-50 to-indigo-50 rounded-xl shadow-lg p-8 mb-8">
            <h2 class="text-3xl font-bold text-gray-900 mb-6">πŸ“š Prior Work & Research Contributions</h2>
            
            <div class="bg-white rounded-lg p-6 mb-6">
                <h3 class="text-xl font-semibold text-gray-900 mb-4">Overview</h3>
                <p class="text-gray-700 mb-4">
                    The Research Paper Metadata Database represents <strong>prior work</strong> that demonstrates the creation of a structured 
                    metadata repository for scientific research papers. This project establishes a foundation for using AI tools to visualize 
                    and analyze the structure of scientific research, enabling systematic exploration of research patterns, citation networks, 
                    and interdisciplinary connections.
                </p>
            </div>

            <div class="grid md:grid-cols-2 gap-6 mb-6">
                <div class="bg-white rounded-lg p-6">
                    <h3 class="text-lg font-semibold text-gray-900 mb-3">πŸ”¬ Research Contributions</h3>
                    <ul class="text-sm text-gray-700 space-y-2">
                        <li>β€’ <strong>Structured Metadata Repository:</strong> Centralized database of research paper metadata</li>
                        <li>β€’ <strong>AI-Powered Preprocessing:</strong> LLM-based entity extraction and annotation</li>
                        <li>β€’ <strong>Citation Network Analysis:</strong> Cross-reference linking and relationship mapping</li>
                        <li>β€’ <strong>Integration Framework:</strong> Designed for CopernicusAI Knowledge Engine integration</li>
                    </ul>
                </div>
                
                <div class="bg-white rounded-lg p-6">
                    <h3 class="text-lg font-semibold text-gray-900 mb-3">βš™οΈ Technical Achievements</h3>
                    <ul class="text-sm text-gray-700 space-y-2">
                        <li>β€’ <strong>JSON-Based Storage:</strong> Structured metadata format for programmatic access</li>
                        <li>β€’ <strong>Entity Extraction:</strong> Automated extraction of genes, proteins, compounds, equations</li>
                        <li>β€’ <strong>Quality Assessment:</strong> Automated quality scoring and relevance metrics</li>
                        <li>β€’ <strong>API Architecture:</strong> RESTful API design for external access</li>
                    </ul>
                </div>
            </div>

            <div class="bg-white rounded-lg p-6">
                <h3 class="text-lg font-semibold text-gray-900 mb-3">🎯 Position Within CopernicusAI Knowledge Engine</h3>
                <p class="text-gray-700 mb-3">
                    The Research Paper Metadata Database serves as a <strong>core data infrastructure component</strong> of the CopernicusAI Knowledge Engine, providing:
                </p>
                <div class="grid md:grid-cols-2 gap-4 text-sm mb-3">
                    <ul class="text-gray-700 space-y-1">
                        <li>β€’ Foundation for knowledge graph construction</li>
                        <li>β€’ Integration with AI podcast generation</li>
                        <li>β€’ Support for GLMP source references</li>
                    </ul>
                    <ul class="text-gray-700 space-y-1">
                        <li>β€’ Science Video Database integration</li>
                        <li>β€’ Programming Framework support</li>
                    </ul>
                </div>
                <p class="text-gray-600 text-sm italic">
                    This work establishes a proof-of-concept for AI-assisted research metadata management, demonstrating how structured data 
                    can enable systematic analysis and visualization of scientific research patterns.
                </p>
            </div>
        </div>
    </section>

    <!-- Project Goals -->
    <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
        <div class="bg-white rounded-xl shadow-lg p-8">
            <h2 class="text-3xl font-bold text-gray-900 mb-6">🎯 Project Goals</h2>
            <p class="text-gray-700 mb-4">
                This project creates a database of scientific research paper metadata for the purpose of:
            </p>
            <ul class="text-gray-700 space-y-2">
                <li>β€’ Using AI tools to visualize and analyze the structure of scientific research</li>
                <li>β€’ Expanding research in interesting, useful, and practical ways</li>
                <li>β€’ Enabling systematic exploration of research patterns and connections</li>
                <li>β€’ Supporting knowledge graph construction and semantic search</li>
            </ul>
        </div>
    </section>

    <!-- Technical Architecture -->
    <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
        <div class="bg-white rounded-xl shadow-lg p-8">
            <h2 class="text-3xl font-bold text-gray-900 mb-6">πŸ”§ Technical Architecture</h2>
            
            <div class="grid md:grid-cols-3 gap-6">
                <div>
                    <h3 class="text-lg font-semibold text-gray-800 mb-3">Metadata Structure</h3>
                    <ul class="text-sm text-gray-600 space-y-1">
                        <li>β€’ DOI, arXiv ID, publication info</li>
                        <li>β€’ Abstracts & key findings</li>
                        <li>β€’ Extracted entities</li>
                        <li>β€’ Citation networks</li>
                        <li>β€’ Paradigm shift indicators</li>
                        <li>β€’ Quality scores</li>
                    </ul>
                </div>
                
                <div>
                    <h3 class="text-lg font-semibold text-gray-800 mb-3">AI-Powered Preprocessing</h3>
                    <ul class="text-sm text-gray-600 space-y-1">
                        <li>β€’ LLM-based entity extraction</li>
                        <li>β€’ Automatic categorization</li>
                        <li>β€’ Keyword extraction</li>
                        <li>β€’ Citation tracking</li>
                        <li>β€’ Quality assessment</li>
                    </ul>
                </div>
                
                <div>
                    <h3 class="text-lg font-semibold text-gray-800 mb-3">Integration Features</h3>
                    <ul class="text-sm text-gray-600 space-y-1">
                        <li>β€’ DOI/arXiv ID resolution</li>
                        <li>β€’ Cross-reference linking</li>
                        <li>β€’ Podcast-to-paper tracking</li>
                        <li>β€’ Search & query capabilities</li>
                        <li>β€’ API access</li>
                    </ul>
                </div>
            </div>
        </div>
    </section>

    <!-- Related Projects -->
    <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
        <h2 class="text-3xl font-bold text-gray-900 mb-6 text-center">πŸ”— Related Projects</h2>
        
        <div class="grid md:grid-cols-2 gap-6">
            <div class="bg-white rounded-lg shadow-md p-6 card-hover">
                <h3 class="text-xl font-semibold text-gray-900 mb-3">πŸ”¬ CopernicusAI</h3>
                <p class="text-gray-600 mb-4">
                    Main knowledge engine integrating metadata with AI podcasts and research synthesis.
                </p>
                <a href="https://huggingface.co/spaces/garywelz/copernicusai" 
                   class="text-blue-600 hover:text-blue-700 font-semibold"
                   target="_blank" rel="noopener noreferrer">
                    Visit CopernicusAI β†’
                </a>
            </div>

            <div class="bg-white rounded-lg shadow-md p-6 card-hover">
                <h3 class="text-xl font-semibold text-gray-900 mb-3">🧬 GLMP</h3>
                <p class="text-gray-600 mb-4">
                    Genome Logic Modeling Project using metadata for source paper references.
                </p>
                <a href="https://huggingface.co/spaces/garywelz/glmp" 
                   class="text-blue-600 hover:text-blue-700 font-semibold"
                   target="_blank" rel="noopener noreferrer">
                    Explore GLMP β†’
                </a>
            </div>

            <div class="bg-white rounded-lg shadow-md p-6 card-hover">
                <h3 class="text-xl font-semibold text-gray-900 mb-3">πŸ› οΈ Programming Framework</h3>
                <p class="text-gray-600 mb-4">
                    Universal process analysis tool that can utilize metadata for research analysis.
                </p>
                <a href="https://huggingface.co/spaces/garywelz/programming_framework" 
                   class="text-blue-600 hover:text-blue-700 font-semibold"
                   target="_blank" rel="noopener noreferrer">
                    Explore Framework β†’
                </a>
            </div>

            <div class="bg-white rounded-lg shadow-md p-6 card-hover">
                <h3 class="text-xl font-semibold text-gray-900 mb-3">🎬 Science Video Database</h3>
                <p class="text-gray-600 mb-4">
                    Video content management with potential metadata linking.
                </p>
                <a href="https://huggingface.co/spaces/garywelz/sciencevideodb" 
                   class="text-blue-600 hover:text-blue-700 font-semibold"
                   target="_blank" rel="noopener noreferrer">
                    Visit Video Database β†’
                </a>
            </div>
        </div>
    </section>

    <!-- How to Cite This Work -->
    <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
        <div class="bg-white rounded-xl shadow-lg p-8">
            <h2 class="text-3xl font-bold text-gray-900 mb-6">How to Cite This Work</h2>
            <div class="bg-gray-50 rounded-lg p-6 mb-4">
                <p class="text-gray-800 font-mono text-lg leading-relaxed mb-4">
                    Welz, G. (2024–2025). <em>Research Paper Metadata Database</em>.<br>
                    Hugging Face Spaces. https://huggingface.co/spaces/garywelz/metadata_database
                </p>
            </div>
            <div class="bg-blue-50 rounded-lg p-4">
                <p class="text-gray-700 mb-2">
                    This project serves as infrastructure for AI-assisted research analysis, enabling systematic visualization and exploration of scientific research patterns through structured metadata management.
                </p>
                <p class="text-gray-700 font-semibold">
                    The Research Paper Metadata Database is designed as infrastructure for AI-assisted science, providing the foundational data layer for knowledge graph construction and semantic search capabilities within the CopernicusAI Knowledge Engine.
                </p>
            </div>
        </div>
    </section>

    <!-- Footer -->
    <footer class="gradient-bg text-white py-8 mt-12">
        <div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 text-center">
            <p class="text-lg font-semibold mb-2">Research Paper Metadata Database</p>
            <p class="text-sm opacity-75">Part of the CopernicusAI Knowledge Engine</p>
            <p class="text-xs opacity-50 mt-4">&copy; 2025 Gary Welz. All rights reserved.</p>
        </div>
    </footer>
</body>
</html>