| | --- |
| | language: |
| | - en |
| | - pt |
| | library_name: peft |
| | model-index: |
| | - name: ZeroShot-Multilanguage-Zephyr-7B |
| | results: |
| | - task: |
| | type: text-generation |
| | name: Text Generation |
| | dataset: |
| | name: ENEM Challenge (No Images) |
| | type: eduagarcia/enem_challenge |
| | split: train |
| | args: |
| | num_few_shot: 3 |
| | metrics: |
| | - type: acc |
| | value: 50.73 |
| | name: accuracy |
| | source: |
| | url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=Weni/ZeroShot-Multilanguage-Zephyr-7B |
| | name: Open Portuguese LLM Leaderboard |
| | - task: |
| | type: text-generation |
| | name: Text Generation |
| | dataset: |
| | name: BLUEX (No Images) |
| | type: eduagarcia-temp/BLUEX_without_images |
| | split: train |
| | args: |
| | num_few_shot: 3 |
| | metrics: |
| | - type: acc |
| | value: 45.62 |
| | name: accuracy |
| | source: |
| | url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=Weni/ZeroShot-Multilanguage-Zephyr-7B |
| | name: Open Portuguese LLM Leaderboard |
| | - task: |
| | type: text-generation |
| | name: Text Generation |
| | dataset: |
| | name: OAB Exams |
| | type: eduagarcia/oab_exams |
| | split: train |
| | args: |
| | num_few_shot: 3 |
| | metrics: |
| | - type: acc |
| | value: 39.32 |
| | name: accuracy |
| | source: |
| | url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=Weni/ZeroShot-Multilanguage-Zephyr-7B |
| | name: Open Portuguese LLM Leaderboard |
| | - task: |
| | type: text-generation |
| | name: Text Generation |
| | dataset: |
| | name: Assin2 RTE |
| | type: assin2 |
| | split: test |
| | args: |
| | num_few_shot: 15 |
| | metrics: |
| | - type: f1_macro |
| | value: 89.25 |
| | name: f1-macro |
| | source: |
| | url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=Weni/ZeroShot-Multilanguage-Zephyr-7B |
| | name: Open Portuguese LLM Leaderboard |
| | - task: |
| | type: text-generation |
| | name: Text Generation |
| | dataset: |
| | name: Assin2 STS |
| | type: eduagarcia/portuguese_benchmark |
| | split: test |
| | args: |
| | num_few_shot: 15 |
| | metrics: |
| | - type: pearson |
| | value: 67.74 |
| | name: pearson |
| | source: |
| | url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=Weni/ZeroShot-Multilanguage-Zephyr-7B |
| | name: Open Portuguese LLM Leaderboard |
| | - task: |
| | type: text-generation |
| | name: Text Generation |
| | dataset: |
| | name: FaQuAD NLI |
| | type: ruanchaves/faquad-nli |
| | split: test |
| | args: |
| | num_few_shot: 15 |
| | metrics: |
| | - type: f1_macro |
| | value: 71.37 |
| | name: f1-macro |
| | source: |
| | url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=Weni/ZeroShot-Multilanguage-Zephyr-7B |
| | name: Open Portuguese LLM Leaderboard |
| | - task: |
| | type: text-generation |
| | name: Text Generation |
| | dataset: |
| | name: HateBR Binary |
| | type: ruanchaves/hatebr |
| | split: test |
| | args: |
| | num_few_shot: 25 |
| | metrics: |
| | - type: f1_macro |
| | value: 80.18 |
| | name: f1-macro |
| | source: |
| | url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=Weni/ZeroShot-Multilanguage-Zephyr-7B |
| | name: Open Portuguese LLM Leaderboard |
| | - task: |
| | type: text-generation |
| | name: Text Generation |
| | dataset: |
| | name: PT Hate Speech Binary |
| | type: hate_speech_portuguese |
| | split: test |
| | args: |
| | num_few_shot: 25 |
| | metrics: |
| | - type: f1_macro |
| | value: 65.65 |
| | name: f1-macro |
| | source: |
| | url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=Weni/ZeroShot-Multilanguage-Zephyr-7B |
| | name: Open Portuguese LLM Leaderboard |
| | - task: |
| | type: text-generation |
| | name: Text Generation |
| | dataset: |
| | name: tweetSentBR |
| | type: eduagarcia/tweetsentbr_fewshot |
| | split: test |
| | args: |
| | num_few_shot: 25 |
| | metrics: |
| | - type: f1_macro |
| | value: 62.85 |
| | name: f1-macro |
| | source: |
| | url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=Weni/ZeroShot-Multilanguage-Zephyr-7B |
| | name: Open Portuguese LLM Leaderboard |
| | --- |
| | ## Training procedure |
| |
|
| |
|
| | The following `bitsandbytes` quantization config was used during training: |
| | - quant_method: bitsandbytes |
| | - load_in_8bit: False |
| | - load_in_4bit: True |
| | - llm_int8_threshold: 6.0 |
| | - llm_int8_skip_modules: None |
| | - llm_int8_enable_fp32_cpu_offload: False |
| | - llm_int8_has_fp16_weight: False |
| | - bnb_4bit_quant_type: nf4 |
| | - bnb_4bit_use_double_quant: True |
| | - bnb_4bit_compute_dtype: bfloat16 |
| | ### Framework versions |
| | |
| | |
| | - PEFT 0.4.0 |
| | |
| | # Open Portuguese LLM Leaderboard Evaluation Results |
| | |
| | Detailed results can be found [here](https://huggingface.co/datasets/eduagarcia-temp/llm_pt_leaderboard_raw_results/tree/main/Weni/ZeroShot-Multilanguage-Zephyr-7B) and on the [🚀 Open Portuguese LLM Leaderboard](https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard) |
| | |
| | | Metric | Value | |
| | |--------------------------|---------| |
| | |Average |**63.63**| |
| | |ENEM Challenge (No Images)| 50.73| |
| | |BLUEX (No Images) | 45.62| |
| | |OAB Exams | 39.32| |
| | |Assin2 RTE | 89.25| |
| | |Assin2 STS | 67.74| |
| | |FaQuAD NLI | 71.37| |
| | |HateBR Binary | 80.18| |
| | |PT Hate Speech Binary | 65.65| |
| | |tweetSentBR | 62.85| |
| | |
| | |