histlearn commited on May 24

Commit

4e70841

verified ·

1 Parent(s): b662189

Initial release: 5 LoRA fold adapters + souped + model card

Browse files

Files changed (17) hide show

README.md +159 -0
adapter_fold_1/adapter_config.json +45 -0
adapter_fold_1/adapter_model.safetensors +3 -0
adapter_fold_2/adapter_config.json +45 -0
adapter_fold_2/adapter_model.safetensors +3 -0
adapter_fold_3/adapter_config.json +45 -0
adapter_fold_3/adapter_model.safetensors +3 -0
adapter_fold_4/adapter_config.json +45 -0
adapter_fold_4/adapter_model.safetensors +3 -0
adapter_fold_5/adapter_config.json +45 -0
adapter_fold_5/adapter_model.safetensors +3 -0
adapter_souped/adapter_config.json +45 -0
adapter_souped/adapter_model.safetensors +3 -0
examples/inference_ensemble.py +38 -0
examples/inference_single_fold.py +29 -0
examples/inference_souped.py +29 -0
manifesto.json +28 -0

README.md ADDED Viewed

	@@ -0,0 +1,159 @@

+---
+license: apache-2.0
+base_model: Qwen/Qwen3-Reranker-0.6B
+library_name: peft
+language:
+- pt
+tags:
+- text-classification
+- community-notes
+- portuguese
+- reranker
+- lora
+- peft
+- misinformation
+pipeline_tag: text-classification
+---
+# Community Notes Reranker (PT-BR) — Qwen3-Reranker fine-tunado
+Cross-encoder fine-tunado com **LoRA** sobre `Qwen/Qwen3-Reranker-0.6B` para classificar a **utilidade** de notas da comunidade do X (antigo Twitter) em português brasileiro.
+> Dado um par (tweet, nota), o modelo devolve a probabilidade de que a comunidade marcaria a nota como **útil** (`CURRENTLY_RATED_HELPFUL` / CRH) versus **não-útil** (`CURRENTLY_RATED_NOT_HELPFUL` / CRNH).
+## Como usar
+```python
+import json, torch
+from transformers import AutoTokenizer, AutoModelForCausalLM
+from peft import PeftModel
+from huggingface_hub import snapshot_download
+REPO = "histlearn/community-notes-reranker-ptbr"
+path = snapshot_download(REPO)
+m    = json.load(open(f"{path}/manifesto.json"))
+tok = AutoTokenizer.from_pretrained(m["base_model"], padding_side="left")
+base = AutoModelForCausalLM.from_pretrained(m["base_model"], torch_dtype=torch.float16)
+model = PeftModel.from_pretrained(base, f"{path}/adapter_souped").cuda().eval()
+def util_prob(tweet: str, nota: str) -> float:
+    text = (m["prompt_prefixo"] + "<Instruct>: " + m["instrucao"] +
+            "\n<Query>: " + tweet + "\n<Document>: " + nota + m["prompt_sufixo"])
+    enc = tok(text, return_tensors="pt", truncation=True, max_length=m["max_length"]).to(model.device)
+    with torch.no_grad():
+        l = model(**enc).logits[:, -1, :]
+    return float(torch.sigmoid(l[:, m["id_yes"]] - l[:, m["id_no"]]).item())
+print(util_prob(
+    "Bolsonaro disse que a Terra e plana",
+    "Bolsonaro nunca afirmou isso; checagem em https://exemplo.org/checagem"
+))
+```
+Exemplos completos em `examples/`:
+- `inference_single_fold.py` — usa só o `adapter_fold_1` (rápido)
+- `inference_ensemble.py` — média das probas dos 5 folds (**reproduz exatamente o número reportado** abaixo)
+- `inference_souped.py` — usa o soup pré-computado (rápido, qualidade próxima)
+## Resultados
+Avaliação **out-of-fold** sob `StratifiedGroupKFold(5)` agrupado por `tweetId` em 13.525 notas
+(hidratação cobrindo 70% das notas estritas em PT-BR; 71,67% positivos).
+| Modelo | macro-F1 | ROC-AUC | MCC | PR-AUC (minoritária) |
+|---|---|---|---|---|
+| **Ensemble de probas (5 folds)** | **0.7920** | **0.8932** | **0.5905** | **0.8293** |
+| Soup-OOF (1 forward) | 0.9097 | 0.9714 | 0.8209 | 0.9505 |
+Este modelo é o **D3** da escada experimental do projeto Community Notes BR. Para contexto:
+| Baseline | macro-F1 | ROC-AUC | usa tweet? |
+|---|---|---|---|
+| Dummy (classe majoritária) | 0.4175 | 0.5000 | — |
+| TF-IDF nota + LR | 0.7725 | 0.8622 | não |
+| Embedding Qwen nota + LR | 0.7489 | 0.8488 | não |
+| Embedding Qwen nota+tweet + LR | 0.7193 | 0.8057 | sim (frozen) |
+| **D3 (este modelo)** | **0.7920** | **0.8932** | **sim (fine-tuned)** |
+| Stacking de todos os baselines + D3 | 0.8282 | 0.9081 | sim |
+Leitura central: **com cross-encoder fine-tunado, somar o tweet à nota traz ganho real** (D3 0.79 vs nota-só 0.77). Com embeddings frozen o tweet atrapalha — só com aprendizado conjunto da interação tweet↔nota o sinal aparece.
+### Soup de produção (única forward pass)
+Carregando `adapter_souped/` em vez de um fold único, o usuário obtém uma versão "fundida" dos 5 adapters (média aritmética dos pesos LoRA). A qualidade dele foi validada em **Soup-OOF (leave-one-fold-out)**:
+| Métrica | Ensemble de probas (5×forward) | Soup-OOF (1×forward) | Δ |
+|---|---|---|---|
+| macro-F1 | 0.7920 | 0.9097 | +11.78 pp |
+| ROC-AUC  | 0.8932  | 0.9714  | +7.82 pp |
+| MCC      | 0.5905      | 0.8209      | +23.04 pp |
+| PR-AUC (minoritária) | 0.8293 | 0.9505 | +12.12 pp |
+Esta é a versão recomendada para produção (custo de inferência = 1 fold único).
+## Dados de treino
+- Dataset base: [`histlearn/notas-comunidade-ptbr`](https://huggingface.co/datasets/histlearn/notas-comunidade-ptbr) (notas em PT-BR, CC0, sem texto de tweet por restrição da X).
+- Texto de tweet hidratado via *X syndication* (não redistribuído).
+- Universo estrito: notas com `consenso ∈ {"CRH", "CRNH"}`.
+## Detalhes de treinamento
+- Base: `Qwen/Qwen3-Reranker-0.6B`
+- Método: **LoRA** (`r=16`, `α=32`, `dropout=0.1`, alvos `q_proj, k_proj, v_proj, o_proj`)
+- Loss: `BCEWithLogitsLoss` sobre `logit(yes) − logit(no)`, com `pos_weight = n_neg / n_pos`
+- Otimizador: AdamW, `lr = 0.0001`, `batch = 8`, `epochs = 2` por fold
+- `max_length = 512`, mixed precision fp16
+- Protocolo: `StratifiedGroupKFold(5)` agrupado por `tweetId` (evita vazamento intra-tweet)
+- Seed: `42`
+Template do prompt (literal, do manifesto):
+```
+<|im_start|>system
+Judge whether the Document meets the requirements based on the Query and the Instruct provided. Note that the answer can only be "yes" or "no".<|im_end|>
+<|im_start|>user
+<Instruct>: A nota e uma Community Note util e bem avaliada para o tweet?
+<Query>: <texto do tweet>
+<Document>: <texto da nota><|im_end|>
+<|im_start|>assistant
+<think>
+</think>
+```
+O score é `sigmoid(logits["yes"] − logits["no"])` na última posição.
+## Limitações e vieses
+- **Viés de seleção:** o universo `CRH ∪ CRNH` exclui ~80% das notas (que ficaram em `NEEDS_MORE_RATINGS`). O modelo aprende a separar CRH×CRNH, não a fazer triagem de notas indecisas.
+- **Viés de sobrevivência da hidratação:** tweets de notas CRH hidrataram 61%; tweets de CRNH hidrataram 87%. Notas CRH "boas" tinham tweet enganoso mais frequentemente removido. As métricas refletem o universo hidratado, não o universo total.
+- **Domínio:** PT-BR, contexto político-social brasileiro 2024-2025. Generalização para outros idiomas, períodos ou plataformas exige re-treino.
+- **Não é classificador de veracidade:** decide se a *comunidade marcaria como útil*, não se o conteúdo é factualmente correto. Útil≠verdadeiro, não-útil≠falso.
+- **Sob deslocamento temporal:** o ensemble degrada ~1 pp macro-F1 ao treinar no passado e testar no futuro (testado no projeto); o D0e tabular sozinho degrada ~3 pp. Robusto, mas não imune.
+## Reprodutibilidade
+O experimento completo está em [github.com/.../community-notes-br](https://github.com/) (substituir pelo repo do projeto): NB1 hidrata tweets, NB2 treina os 5 folds e gera os adapters publicados aqui, NB3 reorganiza a comparação por paradigmas. Os `oof_fold_{1..5}.npz` correspondentes a estes adapters estão no zip de artefatos do projeto.
+## Citação
+```bibtex
+@misc{communitynotes_reranker_ptbr_2026,
+  author = {Rocha, Davi Machado},
+  title  = {Community Notes Reranker (PT-BR) — Qwen3-Reranker fine-tunado},
+  year   = {2026},
+  publisher = {Hugging Face},
+  url    = {https://huggingface.co/histlearn/community-notes-reranker-ptbr},
+  note   = {Disciplina Introdução à IA, prof. Ricardo M. Marcacini, ICMC/USP},
+}
+```
+## Licença
+Apache 2.0 (mesmo do base model). Use livre, atribuição apreciada.

adapter_fold_1/adapter_config.json ADDED Viewed

	@@ -0,0 +1,45 @@

+{
+  "alora_invocation_tokens": null,
+  "alpha_pattern": {},
+  "arrow_config": null,
+  "auto_mapping": null,
+  "base_model_name_or_path": "Qwen/Qwen3-Reranker-0.6B",
+  "bias": "none",
+  "corda_config": null,
+  "ensure_weight_tying": false,
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_bias": false,
+  "lora_dropout": 0.1,
+  "lora_ga_config": null,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "peft_version": "0.19.1",
+  "qalora_group_size": 16,
+  "r": 16,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "v_proj",
+    "o_proj",
+    "k_proj",
+    "q_proj"
+  ],
+  "target_parameters": null,
+  "task_type": "CAUSAL_LM",
+  "trainable_token_indices": null,
+  "use_bdlora": null,
+  "use_dora": false,
+  "use_qalora": false,
+  "use_rslora": false
+}

adapter_fold_1/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9497a46dcda94bf2c026b82d4ca3fc0fff205600bbc05c3580d25a51f9ac89a9
+size 18380008

adapter_fold_2/adapter_config.json ADDED Viewed

	@@ -0,0 +1,45 @@

+{
+  "alora_invocation_tokens": null,
+  "alpha_pattern": {},
+  "arrow_config": null,
+  "auto_mapping": null,
+  "base_model_name_or_path": "Qwen/Qwen3-Reranker-0.6B",
+  "bias": "none",
+  "corda_config": null,
+  "ensure_weight_tying": false,
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_bias": false,
+  "lora_dropout": 0.1,
+  "lora_ga_config": null,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "peft_version": "0.19.1",
+  "qalora_group_size": 16,
+  "r": 16,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "v_proj",
+    "o_proj",
+    "k_proj",
+    "q_proj"
+  ],
+  "target_parameters": null,
+  "task_type": "CAUSAL_LM",
+  "trainable_token_indices": null,
+  "use_bdlora": null,
+  "use_dora": false,
+  "use_qalora": false,
+  "use_rslora": false
+}

adapter_fold_2/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f8b8c55796f8c846678c5ac5d56b96c9faf9075ba83cf01b49aafa52ba15a358
+size 18380008

adapter_fold_3/adapter_config.json ADDED Viewed

	@@ -0,0 +1,45 @@

+{
+  "alora_invocation_tokens": null,
+  "alpha_pattern": {},
+  "arrow_config": null,
+  "auto_mapping": null,
+  "base_model_name_or_path": "Qwen/Qwen3-Reranker-0.6B",
+  "bias": "none",
+  "corda_config": null,
+  "ensure_weight_tying": false,
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_bias": false,
+  "lora_dropout": 0.1,
+  "lora_ga_config": null,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "peft_version": "0.19.1",
+  "qalora_group_size": 16,
+  "r": 16,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "v_proj",
+    "o_proj",
+    "k_proj",
+    "q_proj"
+  ],
+  "target_parameters": null,
+  "task_type": "CAUSAL_LM",
+  "trainable_token_indices": null,
+  "use_bdlora": null,
+  "use_dora": false,
+  "use_qalora": false,
+  "use_rslora": false
+}

adapter_fold_3/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:97313d9d2f7b3bf10c68567454bd6235cf9b2006253a29f4b1566bcb3600f16b
+size 18380008

adapter_fold_4/adapter_config.json ADDED Viewed

	@@ -0,0 +1,45 @@

+{
+  "alora_invocation_tokens": null,
+  "alpha_pattern": {},
+  "arrow_config": null,
+  "auto_mapping": null,
+  "base_model_name_or_path": "Qwen/Qwen3-Reranker-0.6B",
+  "bias": "none",
+  "corda_config": null,
+  "ensure_weight_tying": false,
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_bias": false,
+  "lora_dropout": 0.1,
+  "lora_ga_config": null,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "peft_version": "0.19.1",
+  "qalora_group_size": 16,
+  "r": 16,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "v_proj",
+    "o_proj",
+    "k_proj",
+    "q_proj"
+  ],
+  "target_parameters": null,
+  "task_type": "CAUSAL_LM",
+  "trainable_token_indices": null,
+  "use_bdlora": null,
+  "use_dora": false,
+  "use_qalora": false,
+  "use_rslora": false
+}

adapter_fold_4/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:809c53eb0613726acdb3dc7f567f9827d9d144993931b0ec25eb225a306b9ed9
+size 18380008

adapter_fold_5/adapter_config.json ADDED Viewed

	@@ -0,0 +1,45 @@

+{
+  "alora_invocation_tokens": null,
+  "alpha_pattern": {},
+  "arrow_config": null,
+  "auto_mapping": null,
+  "base_model_name_or_path": "Qwen/Qwen3-Reranker-0.6B",
+  "bias": "none",
+  "corda_config": null,
+  "ensure_weight_tying": false,
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_bias": false,
+  "lora_dropout": 0.1,
+  "lora_ga_config": null,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "peft_version": "0.19.1",
+  "qalora_group_size": 16,
+  "r": 16,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "v_proj",
+    "o_proj",
+    "k_proj",
+    "q_proj"
+  ],
+  "target_parameters": null,
+  "task_type": "CAUSAL_LM",
+  "trainable_token_indices": null,
+  "use_bdlora": null,
+  "use_dora": false,
+  "use_qalora": false,
+  "use_rslora": false
+}

adapter_fold_5/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:70e688fb737dabdfa971189d12fbd260f90aec91c5e2882975e544829d1271e5
+size 18380008

adapter_souped/adapter_config.json ADDED Viewed

	@@ -0,0 +1,45 @@

+{
+  "alora_invocation_tokens": null,
+  "alpha_pattern": {},
+  "arrow_config": null,
+  "auto_mapping": null,
+  "base_model_name_or_path": "Qwen/Qwen3-Reranker-0.6B",
+  "bias": "none",
+  "corda_config": null,
+  "ensure_weight_tying": false,
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_bias": false,
+  "lora_dropout": 0.1,
+  "lora_ga_config": null,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "peft_version": "0.19.1",
+  "qalora_group_size": 16,
+  "r": 16,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "v_proj",
+    "o_proj",
+    "k_proj",
+    "q_proj"
+  ],
+  "target_parameters": null,
+  "task_type": "CAUSAL_LM",
+  "trainable_token_indices": null,
+  "use_bdlora": null,
+  "use_dora": false,
+  "use_qalora": false,
+  "use_rslora": false
+}

adapter_souped/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5ee5218a520aabf72e89f4175df201aaae0bfd9ac050eafd4283bcc9b455dc0b
+size 18379976

examples/inference_ensemble.py ADDED Viewed

	@@ -0,0 +1,38 @@

+"""Ensemble de probas: usa os 5 folds e devolve a media.
+E a melhor pratica cientifica - replica o numero 0.7920 macro-F1 reportado.
+Em GPU T4: ~250ms por par. Em CPU: ~30s por par.
+"""
+import json, torch
+from transformers import AutoTokenizer, AutoModelForCausalLM
+from peft import PeftModel
+from huggingface_hub import snapshot_download
+REPO = "histlearn/community-notes-reranker-ptbr"
+path = snapshot_download(REPO, allow_patterns=["manifesto.json", "adapter_fold_*/*"])
+m = json.load(open(f"{path}/manifesto.json"))
+tok = AutoTokenizer.from_pretrained(m["base_model"], padding_side="left")
+base = AutoModelForCausalLM.from_pretrained(
+    m["base_model"], torch_dtype=torch.float16 if torch.cuda.is_available() else torch.float32)
+if torch.cuda.is_available(): base.cuda()
+def make_text(tw, nt):
+    return (m["prompt_prefixo"] + "<Instruct>: " + m["instrucao"] +
+            "\n<Query>: " + tw + "\n<Document>: " + nt + m["prompt_sufixo"])
+def score_ensemble(tweet, nota):
+    probs = []
+    for k in range(1, 6):
+        model = PeftModel.from_pretrained(base, f"{path}/adapter_fold_{k}")
+        model.eval()
+        enc = tok(make_text(tweet, nota), return_tensors="pt",
+                  truncation=True, max_length=m["max_length"]).to(model.device)
+        with torch.no_grad():
+            logits = model(**enc).logits[:, -1, :]
+        probs.append(float(torch.sigmoid(
+            logits[:, m["id_yes"]] - logits[:, m["id_no"]]).item()))
+        model.unload()  # libera memoria do adapter
+    return sum(probs) / 5
+print(score_ensemble("Bolsonaro disse que a Terra e plana",
+                     "Bolsonaro nunca afirmou isso; checagem em https://exemplo.org"))

examples/inference_single_fold.py ADDED Viewed

	@@ -0,0 +1,29 @@

+"""Inferencia rapida usando um unico fold (fold 1). Ideal para demos.
+Em CPU: ~5-10s por par. Em GPU T4: ~50ms.
+"""
+import json, torch
+from transformers import AutoTokenizer, AutoModelForCausalLM
+from peft import PeftModel
+from huggingface_hub import snapshot_download
+REPO = "histlearn/community-notes-reranker-ptbr"
+path = snapshot_download(REPO, allow_patterns=["manifesto.json", "adapter_fold_1/*"])
+m = json.load(open(f"{path}/manifesto.json"))
+tok = AutoTokenizer.from_pretrained(m["base_model"], padding_side="left")
+model = AutoModelForCausalLM.from_pretrained(
+    m["base_model"], torch_dtype=torch.float16 if torch.cuda.is_available() else torch.float32)
+model = PeftModel.from_pretrained(model, f"{path}/adapter_fold_1")
+if torch.cuda.is_available(): model.cuda()
+model.eval()
+def score(tweet, nota):
+    text = (m["prompt_prefixo"] + "<Instruct>: " + m["instrucao"] +
+            "\n<Query>: " + tweet + "\n<Document>: " + nota + m["prompt_sufixo"])
+    enc = tok(text, return_tensors="pt", truncation=True, max_length=m["max_length"]).to(model.device)
+    with torch.no_grad():
+        logits = model(**enc).logits[:, -1, :]
+    return float(torch.sigmoid(logits[:, m["id_yes"]] - logits[:, m["id_no"]]).item())
+print(score("Bolsonaro disse que a Terra e plana",
+            "Bolsonaro nunca afirmou isso; checagem em https://exemplo.org"))

examples/inference_souped.py ADDED Viewed

	@@ -0,0 +1,29 @@

+"""Inferencia usando o soup (media de pesos LoRA dos 5 folds).
+Velocidade de fold unico; qualidade validada em soup-OOF (ver model card).
+"""
+import json, torch
+from transformers import AutoTokenizer, AutoModelForCausalLM
+from peft import PeftModel
+from huggingface_hub import snapshot_download
+REPO = "histlearn/community-notes-reranker-ptbr"
+path = snapshot_download(REPO, allow_patterns=["manifesto.json", "adapter_souped/*"])
+m = json.load(open(f"{path}/manifesto.json"))
+tok = AutoTokenizer.from_pretrained(m["base_model"], padding_side="left")
+model = AutoModelForCausalLM.from_pretrained(
+    m["base_model"], torch_dtype=torch.float16 if torch.cuda.is_available() else torch.float32)
+model = PeftModel.from_pretrained(model, f"{path}/adapter_souped")
+if torch.cuda.is_available(): model.cuda()
+model.eval()
+def score(tweet, nota):
+    text = (m["prompt_prefixo"] + "<Instruct>: " + m["instrucao"] +
+            "\n<Query>: " + tweet + "\n<Document>: " + nota + m["prompt_sufixo"])
+    enc = tok(text, return_tensors="pt", truncation=True, max_length=m["max_length"]).to(model.device)
+    with torch.no_grad():
+        logits = model(**enc).logits[:, -1, :]
+    return float(torch.sigmoid(logits[:, m["id_yes"]] - logits[:, m["id_no"]]).item())
+print(score("Bolsonaro disse que a Terra e plana",
+            "Bolsonaro nunca afirmou isso; checagem em https://exemplo.org"))

manifesto.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "base_model": "Qwen/Qwen3-Reranker-0.6B",
+  "tarefa": "utilidade de Community Note (CRH=1, CRNH=0)",
+  "entrada": "par (tweet, nota) no template Qwen3-Reranker",
+  "prompt_prefixo": "<|im_start|>system\nJudge whether the Document meets the requirements based on the Query and the Instruct provided. Note that the answer can only be \"yes\" or \"no\".<|im_end|>\n<|im_start|>user\n",
+  "prompt_sufixo": "<|im_end|>\n<|im_start|>assistant\n<think>\n\n</think>\n\n",
+  "instrucao": "A nota e uma Community Note util e bem avaliada para o tweet?",
+  "max_length": 512,
+  "score": "sigmoid(logits['yes'] - logits['no']) na ultima posicao",
+  "id_yes": 9693,
+  "id_no": 2152,
+  "lora": {
+    "r": 16,
+    "alpha": 32,
+    "dropout": 0.1,
+    "targets": [
+      "q_proj",
+      "k_proj",
+      "v_proj",
+      "o_proj"
+    ]
+  },
+  "n_folds": 5,
+  "epocas": 2,
+  "lr": 0.0001,
+  "batch": 8,
+  "random_state": 42
+}