fengxb30 commited on
Commit
56db8f3
·
verified ·
1 Parent(s): 877acbb

Delete README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -199
README.md DELETED
@@ -1,199 +0,0 @@
1
- ## FinAI Contest Task 2 - FinGPT-Powered Compliance Agents
2
-
3
- ### 🧠 Task Overview
4
-
5
- This task focuses on developing FinGPT agents for financial compliance and regulatory scenarios. Participants will build LLM-based agents capable of handling SEC filings analysis, regulatory compliance, sentiment analysis, antitrust reasoning, patent analysis, and financial audio processing. We encourage participants to use LoRA and reinforcement fine-tuning.
6
-
7
- - **SEC Filings Analysis**: Analyze and extract insights from SEC filings, including XBRL data processing, financial statement Q&A, and mathematical reasoning.
8
- - **Regulatory Compliance**: Handle real-time financial data retrieval, sentiment analysis, antitrust and copyright reasoning, patent and IP protection analysis.
9
- - **Multimodal Processing**: Process various data types including text, structured data, and audio for comprehensive financial analysis.
10
-
11
- #### 🎯 Objective
12
-
13
- Your primary goal is to fine-tune or train a language model for financial compliance and regulatory tasks, covering SEC filings analysis, regulatory compliance, sentiment analysis, antitrust reasoning, patent analysis, and financial audio processing. You may also enhance your agent by integrating external tools, such as a retrieval-augmented knowledge base (RAG), to improve its analytical and question-answering capabilities.
14
-
15
- #### 💡 What You Need To Do
16
-
17
- 1. **Collect and Prepare Your Raw Training Data**
18
- Participants need to collect raw data given the sources provided below.
19
-
20
- 2. **Develop FinGPT Compliance Agents**
21
- Use your collected data to fine-tune your own LLM for financial compliance and regulatory tasks. You can use FinGPT framework to fine-tune your model. We encourage participants to use LoRA and reinforcement fine-tuning. You can also enhance your agent by integrating external tools, such as RAG. You can view [**FinLoRA documentation**](https://finlora-docs.readthedocs.io/en/latest/index.html) to learn more about LoRA and some financial tasks.
22
-
23
- 3. **Submit Your Agent**
24
- Submit your agent following the competition guidelines. Make sure your model is:
25
-
26
- - Capable of analyzing SEC filings and extracting structured information.
27
- - Proficient in regulatory compliance reasoning and analysis.
28
- - Robust in interpreting multimodal financial data and reasoning over it.
29
-
30
- 4. **Benchmarking Phase**
31
- After submission, we will use our question sets to evaluate your model's performance across SEC filings analysis and regulatory compliance tasks.
32
-
33
- ---
34
-
35
- ### 📊 Question Set Overview
36
-
37
- These question sets contain question-answer pairs collected and organized for evaluating model capabilities across SEC filings analysis and regulatory compliance tasks. These question sets are sampled from the test split of the datasets, which are used to benchmark your agent's performance. You **SHOULD NOT** use it or the entire test split for fine-tuning or training.
38
-
39
- #### SEC Filings Analysis
40
-
41
- | **Task** | **Dataset** | **Size** | **Metrics** | **Description** | **Source** |
42
- | -------- | ----------- | -------- | ----------- | --------------- | ----------|
43
- | Financial Q&A | FinanceBench | 150 | BERTScore | Open-book financial Q&A on company filings based on OCR-processed annual reports. | https://huggingface.co/datasets/PatronusAI/financebench |
44
- | XBRL Tag Extraction | XBRL Analysis | 1k | Accuracy, F1-Score | Extract specific XBRL tags from raw XBRL text segments given natural language descriptions. | https://huggingface.co/datasets/wangd12/XBRL_analysis |
45
- | XBRL Value Extraction | XBRL Analysis | 12k | Accuracy, F1-Score | Extract numeric values from XBRL text segments given natural language descriptions. | https://huggingface.co/datasets/wangd12/XBRL_analysis |
46
- | XBRL Formula Construction | XBRL Analysis | 1k | Accuracy, F1-Score | Select relevant facts and tags from XBRL data and construct standard financial formulas. | https://huggingface.co/datasets/wangd12/XBRL_analysis |
47
- | XBRL Formula Calculation | XBRL Analysis | 1k | Accuracy, F1-Score | Substitute actual values into constructed formulas and compute final results. | https://huggingface.co/datasets/wangd12/XBRL_analysis |
48
- | General Mathematics | Math Problems | 1k | Accuracy | Solve general mathematical problems related to ratio calculation and algebra. | https://github.com/Open-Finance-Lab/SecureFinAI_Contest_2025/tree/main/Task_2_FinGPT_Powered_Compliance_Agents |
49
- | Numerical Entity Identification | FinNI | TBD | Accuracy, F1-Score | Identify and extract numerical entities from 10K annual reports for financial analysis. | https://github.com/The-FinAI/FinTagging |
50
- | Concept Mapping/Tagging | FinCL | TBD | Accuracy, F1-Score | Map financial concepts to appropriate tags using retrieval+rerank or classification approaches on 10K annual reports. | https://github.com/The-FinAI/FinTagging |
51
- | Semantic Inconsistency Detection | FinSM | TBD | Precision@K, Recall@K | Detect semantic inconsistencies in XBRL filings through information retrieval methods. | https://github.com/The-FinAI/FinAuditing |
52
- | Relation Inconsistency Detection | FinRE | TBD | Accuracy, F1-Score | Classify relation error types (Reversal, Inappropriateness, CombinationErr) in XBRL filings. | https://github.com/The-FinAI/FinAuditing |
53
- | Mathematical Reasoning | FinMR | TBD | Accuracy | Extract XBRL report element values and calculate true values through mathematical reasoning. | https://github.com/The-FinAI/FinAuditing |
54
-
55
- #### Regulatory Compliance
56
-
57
- | **Task** | **Dataset** | **Size** | **Metrics** | **Description** | **Source** |
58
- | -------- | ----------- | -------- | ----------- | --------------- |-----------------------------|
59
- | Financial Data Retrieval | Real-time Data | 331 | Accuracy, F1-Score | Real-time retrieval from active web pages and open-domain search on company financials. | Yahoo Finance and Bloomberg |
60
- | Sentiment Analysis | Financial Sentiment | 4.8k | Accuracy, F1-Score | Aspect-specific sentiment classification for financial texts (news, social media, transcripts, ESG, macro). | BloombergGPT FPB, FiQA SA |
61
- | Financial Audio | FinAudio | 5k | Word Error Rate | Automatic speech recognition for financial audio content. | SPGISpeech [Download Link] [SPGISpeech - Download](https://drive.google.com/file/d/1yP4RB8ThVuBmpz-oaV1KyonUFZD71ls5/view?usp=sharing) |
62
-
63
- We will sample questions from the test split for each dataset for our evaluation.
64
-
65
- ---
66
-
67
- ### 📁 Data Sources and Collection
68
-
69
- #### 📥 1. SEC Filings Data
70
-
71
- You can manually retrieve XBRL filings for individual companies via the U.S. Securities and Exchange Commission (SEC):
72
-
73
- 1. Visit the [SEC EDGAR Company Search](https://www.sec.gov/edgar/searchedgar/companysearch).
74
- 2. Search by company name or ticker symbol.
75
- 3. Filter by filing types such as 10-K, 10-Q, etc.
76
- 4. Click on a specific filing.
77
- 5. Look for files with extensions like:
78
- - `.xml`
79
- - `.xsd`
80
- - `.xbrl`
81
- - or links labeled "Interactive Data".
82
- 6. Download the corresponding XBRL instance and taxonomy files.
83
-
84
- > 💡 This method is ideal for collecting filings from specific companies or filing types in a controlled manner.
85
-
86
- #### ⚙️ 2. XBRL Terminology & Standards
87
-
88
- As a starting point, you may also use the provided web crawling script to automate the retrieval of XBRL-related documents from the [XBRL International Glossary](https://www.xbrl.org/guidance/xbrl-glossary/). This source provides standardized definitions and explanations of XBRL terms.
89
-
90
- - 📎 Provided Code: [xbrl_webcrawl.ipynb](./xbrl_webcrawl.ipynb)
91
-
92
- This script offers a basic template to:
93
-
94
- - Scrape and parse glossary terms.
95
- - Crawl linked resources or downloadable attachments related to XBRL filings.
96
- - Extend it further for large-scale automated crawling from additional sources.
97
-
98
- > 💡 This data helps build XBRL term comprehension tasks, enabling models to understand and explain technical terms used in filings.
99
-
100
- #### 📊 3. Financial Data APIs
101
-
102
- For real-time financial data retrieval tasks, participants can utilize various financial data APIs and web sources:
103
-
104
- - Yahoo Finance API
105
- - Alpha Vantage API
106
- - Financial news websites
107
- - Company investor relations pages
108
-
109
- #### 🎵 4. Financial Audio Data
110
-
111
- For the FinAudio task, participants could collect financial audio content such as:
112
-
113
- - Earnings call recordings
114
- - Financial news broadcasts
115
- - Investor presentations
116
- - Financial podcasts
117
-
118
- #### 📊 5. FinTagging Data
119
-
120
- For the FinTagging tasks (FinNI and FinCL), participants can access the datasets from:
121
-
122
- - **Data Source**: 10K Annual Reports
123
- - **Repository**: https://github.com/The-FinAI/FinTagging
124
- - **Training Data**: Use the trainset for model training
125
- - **Test Data**: Use the subset for challenge evaluation
126
- - **Tasks**:
127
- - **FinNI**: Numerical entity identification from financial documents
128
- - **FinCL**: Concept mapping/tagging with retrieval+rerank or classification approaches
129
- - **Taxonomy**: US-GAAP taxonomy provided for concept mapping tasks
130
-
131
- #### 🔍 6. FinAuditing Data
132
-
133
- For the FinAuditing tasks (FinSM, FinRE, FinMR), participants can access the datasets from:
134
-
135
- - **Data Source**: XBRL Filing documents
136
- - **Repository**: https://github.com/The-FinAI/FinAuditing
137
- - **Tasks**:
138
- - **FinSM**: Semantic inconsistency detection using retrieval methods
139
- - **FinRE**: Relation inconsistency detection with three-class classification (Reversal, Inappropriateness, CombinationErr)
140
- - **FinMR**: Mathematical reasoning for XBRL element value extraction and calculation
141
- - **Subsets**: Each task provides dedicated subsets for ICAIF 2025 challenge evaluation
142
-
143
- ---
144
-
145
- ### 🔧 Training and Fine-tuning
146
-
147
- - 📎 Provided Code: [task_2_finetune.ipynb](./task_2_finetune.ipynb)
148
-
149
- This script offers a basic template for fine-tuning:
150
-
151
- - The notebook is simplified for quick start.
152
- - For more detailed instructions, please check the tutorials under the FinLoRA docs here: https://finlora-docs.readthedocs.io/en/latest/index.html.
153
- - The full process for a **simplified** Buffett Agent model we created can be found here: https://finlora-docs.readthedocs.io/en/latest/tutorials/buffett_agent.html.
154
-
155
- Note: We will additionally test on subsets of various financial datasets. Please use the batched versions provided in this folder for fine-tuning to avoid overfitting.
156
-
157
- ---
158
-
159
- ### 📥 Submission Requirement
160
-
161
- Submit a Hugging Face repository with model weights, scripts, and all necessary files for inference. Make sure your submission includes:
162
-
163
- - Model weights and configuration files
164
- - Inference scripts for all subtasks
165
- - Requirements.txt or environment.yml
166
- - Clear documentation on how to run inference
167
- - Any custom libraries or preprocessing code
168
-
169
- ### 📊 Metrics
170
-
171
- The model evaluation in each domain is the average score of all tasks.
172
-
173
- #### 📘 Note for Participants
174
-
175
- Participants are encouraged to use the above sources as a starting point to construct their own training/fine-tuning datasets. Your model's performance will strongly depend on the quality and comprehensiveness of your self-collected training data. These sources can help you build a rich and task-aligned dataset for model training, ensuring better performance on regulatory reasoning and question answering.
176
-
177
- To ensure fair comparison and practical deployment, it is recommended that the model size should not exceed 8B parameters.
178
-
179
- ---
180
-
181
- ### 📚 References
182
-
183
- [1] Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, Mark Dredze, Sebastian Gehrmann, Prabhanjan Kambadur, David Rosenberg, Gideon Mann. BloombergGPT: A Large Language Model for Finance. arXiv: 2303.17564, 2023.
184
-
185
- [2] Xiao-Yang Liu, Guoxuan Wang, Hongyang Yang, Daochen Zha. FinGPT: Democratizing internet-scale data for financial large language models. Workshop on Instruction Tuning and Instruction Following, NeurIPS, 2023.
186
-
187
- [3] Felix Tian, Ajay Byadgi, Daniel S Kim, Daochen Zha, Matt White, Kairong Xiao, Xiao-Yang Liu. Customized FinGPT search agents using foundation models. Proceedings of the 5th ACM International Conference on AI in Finance, pages 469--477, 2024.
188
-
189
- [4] Yinheng Li, Shaofei Wang, Han Ding, Hang Chen. Large Language Models in Finance: A Survey. ACM International Conference on AI in Finance, pages 374–382, 2023.
190
-
191
- [5] Yuqi Nie, Yaxuan Kong, Xiaowen Dong, John M. Mulvey, H. Vincent Poor, Qingsong Wen, Stefan Zohren. A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges. arXiv preprint arXiv:2406.11903, 2024.
192
-
193
- [6] Shengyuan Colin Lin, Felix Tian, Keyi Wang, Xingjian Zhao, Jimin Huang, Qianqian Xie, Luca Borella, Matt White, Christina Dan Wang, Kairong Xiao, Xiao-Yang Liu Yanglet, Li Deng. Open FinLLM Leaderboard: Towards Financial AI Readiness. International Workshop on Multimodal Financial Foundation Models (MFFMs) at 5th ACM International Conference on AI in Finance, 2024.
194
-
195
- [7] Haochen Sun, Jason Li, Hongyang Zhang. zkLLM: Zero Knowledge Proofs for Large Language Models. ACM SIGSAC Conference on Computer and Communications Security, 2024.
196
-
197
- [8] Xiao-Yang Liu, Ziyi Xia, Hongyang Yang, Jiechao Gao, Daochen Zha, Ming Zhu, Christina Dan Wang, Zhaoran Wang, Jian Guo. Dynamic datasets and market environments for financial reinforcement learning. Machine Learning - Nature, 2024.
198
-
199
- [9] Cao, Yupeng, et al. "FinAudio: A Benchmark for Audio Large Language Models in Financial Applications." arXiv preprint arXiv:2503.20990 (2025).