Update README.md
Browse files
README.md
CHANGED
|
@@ -11,10 +11,14 @@ LLMs have the potential to support representative democracy by providing constit
|
|
| 11 |
### Training Data
|
| 12 |
|
| 13 |
The primary dataset used to train and evaluate this model is EZ-STANCE. This dataset contains labeled stances on a variate of topics across politics and pop culture. This dataset contains the following four fields which are relevant for this project.
|
| 14 |
-
|
| 15 |
-
|
| 16 |
-
|
| 17 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
|
| 19 |
### Training Method
|
| 20 |
The base model used for this project was Qwen2.5-7B-Instruct-1M . I chose this model because it could handle large context windows, was instruction tuned, and its relatively low number of parameters would make it more efficient to train. The final model was trained on the stance classification task using the LoRA method of PEFT. Then, few-shot chain-of-thought prompting was used to ask the final model for reasoning behind the stances it generated. When reviewing the output of the model on my task, I observed that few-shot prompting alone went a very long way in improving the output of the model when having it explain its reasoning, which is why I only trained the model on the stance classification component of the task. I used PEFT over full fine-tuning because I did not want to drastically change my model since it was already performing well on the reasoning task. Also, since I am using a 7B parameter model and my desired model output is open-ended, I had concerns around the efficiency of full-fine tuning. My aim was to take a targeted training approach to assist the model on its classification task.
|
|
@@ -51,19 +55,33 @@ The primary limitation encountered was improving stance classification accuracy
|
|
| 51 |
MARKDOWN RESULTS TABLE PLACEHOLDER
|
| 52 |
Another limitation of this approach is that I was only training on the stance classification task. Given that the minimally invasive approach of PEFT did not have much impact, it could be worth exploring more impactful approaches like full fine-tuning. In that case, there is a risk that training the entire model will damage its ability to reason logically.
|
| 53 |
Great Example of Why “None” is hard:
|
|
|
|
| 54 |
Statement: This had the effect of taking the fringe religious doctrine of NI and making it much more common and American evangelical teaching but also taught a lot of churches that private education is a massive cash cow money laundry tax dodge on top of being a church
|
|
|
|
| 55 |
Topic or Claim: Private education institutions could face scrutiny in the future for potential financial irregularities, leading to investigations into money laundering and tax evasion practices.
|
|
|
|
| 56 |
Stance: FAVOR
|
|
|
|
| 57 |
Response: The author supports the claim by linking private education institutions to financial irregularities, including money laundering and tax evasion, suggesting that these institutions could indeed face future scrutiny and investigations. This aligns with the claim that private education may be at risk due to such practices.
|
|
|
|
| 58 |
Example of the model perhaps confusing sentiment and stance, and maybe struggling with slang:
|
|
|
|
| 59 |
Statement: Let s aim to recycle and reuse as much as we can. . . . . noplastic plasticfree plasticpollution environment environmentalawareness environmentalfriendly savetheplanet plasticpollution noplasticbags noplasticwaste makeachange makeachangetoday
|
|
|
|
| 60 |
Topic or Claim: plasticpollution
|
|
|
|
| 61 |
Stance: FAVOR
|
|
|
|
| 62 |
Response: The author is in favor of reducing plastic pollution by promoting recycling, reusing, and minimizing plastic use. Phrases like "make a change today" and "save the planet" indicate a strong commitment to addressing environmental issues related to plastic pollution.
|
|
|
|
| 63 |
Example of getting the right answer for the wrong reason:
|
|
|
|
| 64 |
Statement: Two young Russian men decided to flee military mobilization and took the boat across the Bering Strait to a small American island off the coast of Alaska. They have now applied for asylum as refugees. Will the U.S. see a new wave of migrants in the north? 99 582 2,367
|
|
|
|
| 65 |
Topic or Claim: Two Russian men have fled military mobilization and taken a boat across the Bering Strait to Alaska, seeking asylum as refugees.
|
|
|
|
| 66 |
Stance: FAVOR
|
|
|
|
| 67 |
Response: The author supports the claim by highlighting the Russian men's actions and the potential for increased migration, suggesting a favorable view of their situation and the broader implications. This indicates an overall positive stance toward the topic.
|
| 68 |
|
| 69 |
## Sources and References
|
|
|
|
| 11 |
### Training Data
|
| 12 |
|
| 13 |
The primary dataset used to train and evaluate this model is EZ-STANCE. This dataset contains labeled stances on a variate of topics across politics and pop culture. This dataset contains the following four fields which are relevant for this project.
|
| 14 |
+
|
| 15 |
+
Text: The source Tweet that stances will be generated from.
|
| 16 |
+
|
| 17 |
+
Target: A modification of the source Tweet which indicates a favorable, neutral, or unfavorable stance. The creators of the datasets used human annotators to generate these values.
|
| 18 |
+
|
| 19 |
+
Stance: The stance label for the Target text (Favorable, Unfavorable, or Neutral).
|
| 20 |
+
|
| 21 |
+
Using this dataset, I was able to provide the model with the source text and ask it to determine whether the author of that text would have a favorable, unfavorable, or no stance towards the target topic or claim. I did not make any modifications to those fields in the training dataset, other than adding structure around the data in the prompt to clarify to the model what I wanted it to provide. The second component of the task was to have the model provide step-by-step reasoning behind the stance it provided. This reasoning was not included in the training dataset, but I thought it was important to have the model generate this reasoning because an explanation for the user to see is important when considering that the original reason behind this model is to help build trust.
|
| 22 |
|
| 23 |
### Training Method
|
| 24 |
The base model used for this project was Qwen2.5-7B-Instruct-1M . I chose this model because it could handle large context windows, was instruction tuned, and its relatively low number of parameters would make it more efficient to train. The final model was trained on the stance classification task using the LoRA method of PEFT. Then, few-shot chain-of-thought prompting was used to ask the final model for reasoning behind the stances it generated. When reviewing the output of the model on my task, I observed that few-shot prompting alone went a very long way in improving the output of the model when having it explain its reasoning, which is why I only trained the model on the stance classification component of the task. I used PEFT over full fine-tuning because I did not want to drastically change my model since it was already performing well on the reasoning task. Also, since I am using a 7B parameter model and my desired model output is open-ended, I had concerns around the efficiency of full-fine tuning. My aim was to take a targeted training approach to assist the model on its classification task.
|
|
|
|
| 55 |
MARKDOWN RESULTS TABLE PLACEHOLDER
|
| 56 |
Another limitation of this approach is that I was only training on the stance classification task. Given that the minimally invasive approach of PEFT did not have much impact, it could be worth exploring more impactful approaches like full fine-tuning. In that case, there is a risk that training the entire model will damage its ability to reason logically.
|
| 57 |
Great Example of Why “None” is hard:
|
| 58 |
+
|
| 59 |
Statement: This had the effect of taking the fringe religious doctrine of NI and making it much more common and American evangelical teaching but also taught a lot of churches that private education is a massive cash cow money laundry tax dodge on top of being a church
|
| 60 |
+
|
| 61 |
Topic or Claim: Private education institutions could face scrutiny in the future for potential financial irregularities, leading to investigations into money laundering and tax evasion practices.
|
| 62 |
+
|
| 63 |
Stance: FAVOR
|
| 64 |
+
|
| 65 |
Response: The author supports the claim by linking private education institutions to financial irregularities, including money laundering and tax evasion, suggesting that these institutions could indeed face future scrutiny and investigations. This aligns with the claim that private education may be at risk due to such practices.
|
| 66 |
+
|
| 67 |
Example of the model perhaps confusing sentiment and stance, and maybe struggling with slang:
|
| 68 |
+
|
| 69 |
Statement: Let s aim to recycle and reuse as much as we can. . . . . noplastic plasticfree plasticpollution environment environmentalawareness environmentalfriendly savetheplanet plasticpollution noplasticbags noplasticwaste makeachange makeachangetoday
|
| 70 |
+
|
| 71 |
Topic or Claim: plasticpollution
|
| 72 |
+
|
| 73 |
Stance: FAVOR
|
| 74 |
+
|
| 75 |
Response: The author is in favor of reducing plastic pollution by promoting recycling, reusing, and minimizing plastic use. Phrases like "make a change today" and "save the planet" indicate a strong commitment to addressing environmental issues related to plastic pollution.
|
| 76 |
+
|
| 77 |
Example of getting the right answer for the wrong reason:
|
| 78 |
+
|
| 79 |
Statement: Two young Russian men decided to flee military mobilization and took the boat across the Bering Strait to a small American island off the coast of Alaska. They have now applied for asylum as refugees. Will the U.S. see a new wave of migrants in the north? 99 582 2,367
|
| 80 |
+
|
| 81 |
Topic or Claim: Two Russian men have fled military mobilization and taken a boat across the Bering Strait to Alaska, seeking asylum as refugees.
|
| 82 |
+
|
| 83 |
Stance: FAVOR
|
| 84 |
+
|
| 85 |
Response: The author supports the claim by highlighting the Russian men's actions and the potential for increased migration, suggesting a favorable view of their situation and the broader implications. This indicates an overall positive stance toward the topic.
|
| 86 |
|
| 87 |
## Sources and References
|