syndi-models
/

titlewave-t5-base

text2text-generation

text-generation-inference

Model card Files Files and versions

titlewave-t5-base / README.md

tennessejoyce

Initial commit

f5dd303 almost 5 years ago

|

history blame contribute delete

2.04 kB

	---
	language: en
	license: cc-by-4.0
	pipeline_tag: summarization
	widget:
	- text: "Example question body."
	---

	# Titlewave: t5-base

	## Model description

	Titlewave is a Chrome extension that helps you choose better titles for your Stack Overflow questions. See https://github.com/tennessejoyce/TitleWave for more information.
	This is one of two NLP models used in the Titlewave project, and its purpose is to suggests a new title based on on the body of the question. The companion model (https://huggingface.co/tennessejoyce/titlewave-bert-base-uncased) classifies whether question will be answered or not just based on the title

	## Intended use

	Try out different titles for your Stack Overflow post, and see which one gives you the best chance of recieving an answer.
	This model can be used in your browser as a Chrome extension by following the installation instructions at https://github.com/tennessejoyce/TitleWave.
	Or load it in Python like this (which will automatically download the model to your machine):

	```python
	>>> from transformers import pipeline
	>>> classifier = pipeline('summarization', model='tennessejoyce/titlewave-t5-base')
	>>> body = """"Example question body."""
	>>> classifier(body)

	[{'summary_text': 'Example title suggestion?'}]
	```


	## Training data

	The weights were initialized from the BERT base model (https://huggingface.co/bert-base-uncased), which was trained on BookCorpus and English Wikipedia.
	Then the model was fine-tuned on the dataset of previous Stack Overflow post titles (https://archive.org/details/stackexchange).
	Specifically I used three years of posts from 2017-2019, filtered out posts which were closed, and selected 25% of the remaining posts at random to use in
	the training set. In order to improve the quality of the titles generated, the model was trained only on questions with an accepted answer.

	## Evaluation

	See https://github.com/tennessejoyce/TitleWave/blob/master/model_training/test_summarizer.ipynb for the performance of the title generation model on the test set.