IntelLabs
/

eftnas-s1-bert-base

Model card Files Files and versions

eftnas-s1-bert-base / README.md

jinjieyuan's picture

Update README.md

e9998a9 about 2 years ago

|

2.04 kB

	---
	language: en
	license: apache-2.0
	datasets:
	- nyu-mll/glue
	---

	# EFTNAS Model Card: eftnas-s1-bert-base

	The super-networks fine-tuned on BERT-base with [GLUE benchmark](https://gluebenchmark.com/) using EFTNAS.

	## Model Details

	### Information

	- Model name: eftnas-s1-bert-base-[TASK]
	- Base model: [bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased)
	- Subnetwork version: Super-network
	- NNCF Configurations: [eftnas_configs](https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/EFTNAS/eftnas_configs)

	### Training and Evaluation

	[GLUE benchmark](https://gluebenchmark.com/)

	## Results

	Results of the optimal sub-network discoverd from the super-network:

	\| Model \| GFLOPs \| GLUE Avg. \| MNLI-m \| QNLI \| QQP \| SST-2 \| CoLA \| MRPC \| RTE \|
	\|-------------------------------\|-----------\|---------------\|----------\|------\|----------\|----------\|----------\|----------\|------\|
	\| Development Set: \|
	\| EFTNAS-S1 \| 5.7 \| 82.9 \| 84.6 \| 90.8 \| 91.2 \| 93.5 \| 60.6 \| 90.8 \| 69.0 \|
	\| Test Set: \|
	\| EFTNAS-S1 \| 5.7 \| 77.7 \| 83.7 \| 89.9 \| 71.8 \| 93.4 \| 52.6 \| 87.6 \| 65.0 \|


	## Model Sources

	- Repository: [https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/EFTNAS](https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/EFTNAS)
	- Paper: [Searching for Efficient Language Models in First-Order Weight-Reordered Super-Networks]()

	## Citation

	```bibtex
	@inproceedings{
	eftnas2024,
	title={Searching for Efficient Language Models in First-Order Weight-Reordered Super-Networks},
	author={J. Pablo Munoz and Yi Zheng and Nilesh Jain},
	booktitle={The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation},
	year={2024},
	url={}
	}
	```

	## License

	Apache-2.0