pszemraj
/

bart-base-grammar-synthesis

@@ -1,4 +1,6 @@
 ---
 license:
 - cc-by-nc-sa-4.0
 - apache-2.0
@@ -55,54 +57,24 @@ widget:
     interested and anyone basical e may be applyind reaching the browing
     approach were
 - medical course audio transcription
-inference: False
-pipeline_tag: text2text-generation
 language:
 - en
 ---
 # bart-base-grammar-synthesis
-<a href="https://colab.research.google.com/gist/pszemraj/3e44e384dcd4614e1350d457bf9be8ad/bart-batch-grammar-check-correct-demo.ipynb">
-  <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>
-</a>
 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on an expanded version of the JFLEG dataset.
 You can find other grammar-synthesis models by [searching for the grammar synthesis tag](https://huggingface.co/models?other=grammar%20synthesis)
-## Basic Usage Example
-### Installation
-First, make sure you have the `transformers` package installed. You can install it using pip:
-```
-pip install -U transformers
-```
-### Usage
-```python
-from transformers import pipeline
-# Initialize the text-generation pipeline for text correction
-corrector = pipeline("text2text-generation", "pszemraj/bart-base-grammar-synthesis")
-# Example text to correct
-raw_text = "The toweris 324 met (1,063 ft) tall, about height as .An 81-storey building, and biggest longest structure paris. Is square, measuring 125 metres (410 ft) on each side. During its constructiothe eiffel tower surpassed the washington monument to become the tallest man-made structure in the world, a title it held for 41 yearsuntilthe chryslerbuilding in new york city was finished in 1930. It was the first structure to goat a height of 300 metres. Due 2 the addition ofa brdcasting aerial at the t0pp of the twr in 1957, it now taller than  chrysler building 5.2 metres (17 ft). Exxxcluding transmitters,  eiffel tower is  2ndd tallest ree-standing structure in france after millau viaduct."
-# Correct the text using the text-generation pipeline
-corrected_text = corrector(raw_text)[0]["generated_text"]
-# Print the corrected text
-print(corrected_text)
-```
-This example demonstrates how to use the text-generation pipeline to correct the grammar in a given text. The `corrector` pipeline is initialized with the "pszemraj/bart-base-grammar-synthesis" model, which is designed for grammar correction. The `corrector` pipeline takes the raw text as input and returns the corrected text. Make sure to install the required dependencies and models before running the code.
 ## Intended uses & limitations
@@ -129,3 +101,14 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.02
 - num_epochs: 3.0

 ---
+languages:
+- en
 license:
 - cc-by-nc-sa-4.0
 - apache-2.0
     interested and anyone basical e may be applyind reaching the browing
     approach were
 - medical course audio transcription
+parameters:
+  max_new_tokens: 128
+  num_beams: 4
+  repetition_penalty: 1.21
+  length_penalty: 1
+  early_stopping: true
 language:
 - en
+pipeline_tag: text2text-generation
 ---
 # bart-base-grammar-synthesis
 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on an expanded version of the JFLEG dataset.
 You can find other grammar-synthesis models by [searching for the grammar synthesis tag](https://huggingface.co/models?other=grammar%20synthesis)
 ## Intended uses & limitations
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.02
 - num_epochs: 3.0
+### Training results
+### Framework versions
+- Transformers 4.28.1
+- Pytorch 2.0.1+cu117
+- Datasets 2.12.0
+- Tokenizers 0.13.3

config.json CHANGED Viewed

@@ -48,6 +48,26 @@
   "num_hidden_layers": 6,
   "pad_token_id": 1,
   "scale_embedding": false,
   "torch_dtype": "float32",
   "transformers_version": "4.28.1",
   "use_cache": true,

   "num_hidden_layers": 6,
   "pad_token_id": 1,
   "scale_embedding": false,
+  "task_specific_params": {
+    "summarization": {
+      "length_penalty": 1.0,
+      "max_length": 128,
+      "min_length": 12,
+      "num_beams": 4
+    },
+    "summarization_cnn": {
+      "length_penalty": 2.0,
+      "max_length": 142,
+      "min_length": 56,
+      "num_beams": 4
+    },
+    "summarization_xsum": {
+      "length_penalty": 1.0,
+      "max_length": 62,
+      "min_length": 11,
+      "num_beams": 6
+    }
+  },
   "torch_dtype": "float32",
   "transformers_version": "4.28.1",
   "use_cache": true,