master_cate_bc9 / README.md

mini1013

Push model using huggingface_hub.

c61145e verified about 1 year ago

preview code

raw

history blame contribute delete

18.9 kB

metadata

tags:
  - setfit
  - sentence-transformers
  - text-classification
  - generated_from_setfit_trainer
widget:
  - text: 중탕기 젖병 국통 뷔페 휴대용 멜팅기 데우기 2칸 출산/육아 > 수유용품 > 보틀워머
  - text: 티지엠 실리콘 하트 쪽쪽이 일체형 공갈 노리개 젖꼭지 하트쪽쪽이_스노우 출산/육아 > 수유용품 > 노리개젖꼭지
  - text: >-
      제이앤제나 27쿠션 키즈 430백수 신생아부터 허리에 무리없는 분리형 백수_제나양_뒷면메쉬(커버+솜K27)세트_일반스트랩 출산/육아
      > 수유용품 > 수유쿠션/시트
  - text: '[모윰] 올실리콘 마카롱 쪽쪽이(전용케이스 포함) 2개세트 2단계(네추럴)_1단계(네추럴) 출산/육아 > 수유용품 > 기타수유용품'
  - text: >-
      앙뽀 실리콘 젖병 150ml 260ml 신생아 배앓이 젖병 출산 준비물 선물 실리콘 젖병 260ml_맘꼭지1단계(0~1개월)_화이트
      출산/육아 > 수유용품 > 젖병
metrics:
  - accuracy
pipeline_tag: text-classification
library_name: setfit
inference: true
base_model: mini1013/master_domain
model-index:
  - name: SetFit with mini1013/master_domain
    results:
      - task:
          type: text-classification
          name: Text Classification
        dataset:
          name: Unknown
          type: unknown
          split: test
        metrics:
          - type: accuracy
            value: 1
            name: Accuracy

SetFit with mini1013/master_domain

This is a SetFit model that can be used for Text Classification. This SetFit model uses mini1013/master_domain as the Sentence Transformer embedding model. A LogisticRegression instance is used for classification.

The model has been trained using an efficient few-shot learning technique that involves:

Fine-tuning a Sentence Transformer with contrastive learning.
Training a classification head with features from the fine-tuned Sentence Transformer.

Model Details

Model Description

Model Type: SetFit
Sentence Transformer body: mini1013/master_domain
Classification head: a LogisticRegression instance
Maximum Sequence Length: 512 tokens
Number of Classes: 12 classes

Model Sources

Repository: SetFit on GitHub
Paper: Efficient Few-Shot Learning Without Prompts
Blogpost: SetFit: Efficient Few-Shot Learning Without Prompts

Model Labels

Label	Examples
1.0	'빕스 쪽쪽이 대니쉬 보헴 천연고무 신제품 꽃모양 노리개 공갈젖꼭지 1개+1개+보관 케이스 2단계(6~18개월)_세이지_블러쉬 출산/육아 > 수유용품 > 노리개젖꼭지' '프리그 데이지 노리개 젖꼭지
0.0	'그로미미 컬러 핸들 밀크 출산/육아 > 수유용품 > 기타수유용품' '분유 제조기 자동 우유 기계 이유식 간편 수유 B 출산/육아 > 수유용품 > 기타수유용품' '릿첼 TLI 소프트 이유스푼세트2P (케이스포함) / 이유식스푼 릿첼 TLI 이유식볼(소) 99194 / 이유 출산/육아 > 수유용품 > 기타수유용품'
2.0	'엠피엘 모유저장팩 리필 출산/육아 > 수유용품 > 모유보관비닐팩' '메델라 모유저장팩 50매 모유저장팩 100매 출산/육아 > 수유용품 > 모유보관비닐팩' '모유저장팩 심플세트 바로모 mpl 30매 출산/육아 > 수유용품 > 모유보관비닐팩'
9.0	'스펙트라 유축기 깔때기 (와이드/일반형 흡입기) 소모품 일반형 흡입기세트_깔대기 M사이즈 (내경26mm 외경 83mm) 출산/육아 > 수유용품 > 유축기' '[대여] 스펙트라 유축기 (깔대기+젖병 미사용제품) 충전용 휴대용 임대 특A급 휴대용 S9+ [본체+어뎁터+유축세트]_7-3. PA 젖병 세트 - L 사이즈_1개월 대여 출산/육아 > 수유용품 > 유축기' '웨어러블 프리티 Free-T2 eco 유축기_핸즈프리, 휴대용 출산/육아 > 수유용품 > 유축기'
10.0	'헤겐 모유느낌 비중심 젖꼭지 단계선택 젖꼭지0단계 2P 출산/육아 > 수유용품 > 젖꼭지' '헤겐 모유느낌 비중심 젖꼭지 1단계 2P 출산/육아 > 수유용품 > 젖꼭지' '닥터브라운 내로우넥 옵션스플러스 젖꼭지 2P (P 1 2 3 4 Y-Cut 6종 중 선택) Y-CUT 2P (9개월~ 이유식/과즙용) 출산/육아 > 수유용품 > 젖꼭지'
11.0	'그로미미 PPSU 200ml 젖병 트윈팩 (크림베이지) 출산/육아 > 수유용품 > 젖병' '하이비 PPSU 에잇벤트 배앓이방지 빨대 젖병 170ml 270ml 1. 170 트윈팩_수박볼빵 샌드_L(6개월 이상) 출산/육아 > 수유용품 > 젖병' '하이비 PPSU 에잇벤트 배앓이방지 빨대 젖병 170ml 270ml 3. 170 트윈팩+핸들2P+추스트로우2P_다람쥐 화이트(11/13일부터 출고가능)_S(0개월 이상) 출산/육아 > 수유용품 > 젖병'
8.0	'유두 보호기 2개 실리콘 보호기 수유 커버 모유 06 Y 출산/육아 > 수유용품 > 유두케어/젖몸살용품' '유두 보호기 모유 수유 커버 재사용 가능한 수유 패드 01 1Pcs 출산/육아 > 수유용품 > 유두케어/젖몸살용품' '산모용(1개입)+교정용h(1개입)+교정용s(1개입) 호유방 니플업 함몰 유두 교정기 교정 산모 모유 수유 젖꼭지 꼭지 L(대): 15mm 출산/육아 > 수유용품 > 유두케어/젖몸살용품'
7.0	'더블하트 수유패드 60 135매 퍼펙션 모유저장팩 컴포트필 블루코어 출산준비물 06 퍼펙션 모유저장팩(200ml) 30x4팩 출산/육아 > 수유용품 > 수유패드' '폴레드 프랭클린 안심+ 수유패드 3개 세트_3mm허니콤 출산/육아 > 수유용품 > 수유패드' '아이앤비 수유패드 산후조리 모유수유 전후 대용량 100매 모유넘침방지 순간흡수 방수 출산/육아 > 수유용품 > 수유패드'
4.0	'모아케어 심플 BPA 프리 PP 분유케이스 크림_5단 출산/육아 > 수유용품 > 분유케이스' '제이미로프트 밀키보틀 휴대용 여행용 분유통 힙 분유 소분통 간식 보관 케이스 밀키보틀 600ml_민트 출산/육아 > 수유용품 > 분유케이스' '맘핸드슨 3단 분유케이스 B001 3단분유케이스(민트브라운) 출산/육아 > 수유용품 > 분유케이스'
6.0	'모유 수유쿠션 아기 신생아 역류방지쿠션 d자형 짱구베개 D자형_말리 출산/육아 > 수유용품 > 수유쿠션/시트' '임산부바디필로우 임신축하선물 초기임산부 대형 자는 U타입 편안한 옆으로 겨자 출산/육아 > 수유용품 > 수유쿠션/시트' '[제이앤제나] 아기 허리에 무리없는 신생아부터 커버분리 오리지널 역류방지쿠션 래빗 래빗_핑크(커버+솜)세트 출산/육아 > 수유용품 > 수유쿠션/시트'
5.0	'세맘스 수유가리개+파우치 01_스트라이프블랙 출산/육아 > 수유용품 > 수유가리개' '수유가리개 모유 아기띠 가림막 여름 수유 가리개 덮개 트림천 9종 도트그레이 출산/육아 > 수유용품 > 수유가리개' '수유가리개 수유커버 유모차햇빛가리개 우주선 출산/육아 > 수유용품 > 수유가리개'
3.0	'휴대용젖병워머 야외 다기능 물티슈 워머 온도조절 차량충전 캠핑워머 07 라운드 블랙 출산/육아 > 수유용품 > 보틀워머' '줄무늬 겨울골프모자 겨울등산모자 낚시 군밤 레드 출산/육아 > 수유용품 > 보틀워머' '캐릭터 입체 목도리-블랙 출산/육아 > 수유용품 > 보틀워머'

Evaluation

Metrics

Label	Accuracy
all	1.0

Uses

Direct Use for Inference

First install the SetFit library:

pip install setfit

Then you can load this model and run inference.

from setfit import SetFitModel

# Download from the 🤗 Hub
model = SetFitModel.from_pretrained("mini1013/master_cate_bc9")
# Run inference
preds = model("중탕기 젖병 국통 뷔페 휴대용 멜팅기 데우기 2칸 출산/육아 > 수유용품 > 보틀워머")

Training Details

Training Set Metrics

Training set	Min	Median	Max
Word count	7	14.4119	29

Label	Training Sample Count
0.0	70
1.0	70
2.0	70
3.0	70
4.0	70
5.0	70
6.0	70
7.0	70
8.0	70
9.0	70
10.0	70
11.0	70

Training Hyperparameters

batch_size: (256, 256)
num_epochs: (30, 30)
max_steps: -1
sampling_strategy: oversampling
num_iterations: 50
body_learning_rate: (2e-05, 1e-05)
head_learning_rate: 0.01
loss: CosineSimilarityLoss
distance_metric: cosine_distance
margin: 0.25
end_to_end: False
use_amp: False
warmup_proportion: 0.1
l2_weight: 0.01
seed: 42
eval_max_steps: -1
load_best_model_at_end: False

Training Results

Epoch	Step	Training Loss	Validation Loss
0.0061	1	0.4988	-
0.3030	50	0.4987	-
0.6061	100	0.4398	-
0.9091	150	0.1411	-
1.2121	200	0.0221	-
1.5152	250	0.0149	-
1.8182	300	0.0147	-
2.1212	350	0.008	-
2.4242	400	0.0071	-
2.7273	450	0.0069	-
3.0303	500	0.0003	-
3.3333	550	0.0	-
3.6364	600	0.0	-
3.9394	650	0.0	-
4.2424	700	0.0	-
4.5455	750	0.0	-
4.8485	800	0.0	-
5.1515	850	0.0	-
5.4545	900	0.0	-
5.7576	950	0.0	-
6.0606	1000	0.0	-
6.3636	1050	0.0	-
6.6667	1100	0.0	-
6.9697	1150	0.0	-
7.2727	1200	0.0	-
7.5758	1250	0.0	-
7.8788	1300	0.0	-
8.1818	1350	0.0	-
8.4848	1400	0.0	-
8.7879	1450	0.0	-
9.0909	1500	0.0	-
9.3939	1550	0.0	-
9.6970	1600	0.0	-
10.0	1650	0.0	-
10.3030	1700	0.0	-
10.6061	1750	0.0	-
10.9091	1800	0.0	-
11.2121	1850	0.0	-
11.5152	1900	0.0	-
11.8182	1950	0.0001	-
12.1212	2000	0.0	-
12.4242	2050	0.0	-
12.7273	2100	0.0	-
13.0303	2150	0.0	-
13.3333	2200	0.0	-
13.6364	2250	0.0	-
13.9394	2300	0.0	-
14.2424	2350	0.0	-
14.5455	2400	0.0	-
14.8485	2450	0.0	-
15.1515	2500	0.0	-
15.4545	2550	0.0	-
15.7576	2600	0.0	-
16.0606	2650	0.0	-
16.3636	2700	0.0	-
16.6667	2750	0.0001	-
16.9697	2800	0.0	-
17.2727	2850	0.0	-
17.5758	2900	0.0	-
17.8788	2950	0.0	-
18.1818	3000	0.0	-
18.4848	3050	0.0	-
18.7879	3100	0.0	-
19.0909	3150	0.0	-
19.3939	3200	0.0	-
19.6970	3250	0.0	-
20.0	3300	0.0	-
20.3030	3350	0.0	-
20.6061	3400	0.0	-
20.9091	3450	0.0	-
21.2121	3500	0.0	-
21.5152	3550	0.0	-
21.8182	3600	0.0	-
22.1212	3650	0.0	-
22.4242	3700	0.0	-
22.7273	3750	0.0	-
23.0303	3800	0.0	-
23.3333	3850	0.0	-
23.6364	3900	0.0	-
23.9394	3950	0.0	-
24.2424	4000	0.0	-
24.5455	4050	0.0	-
24.8485	4100	0.0	-
25.1515	4150	0.0	-
25.4545	4200	0.0	-
25.7576	4250	0.0	-
26.0606	4300	0.0	-
26.3636	4350	0.0	-
26.6667	4400	0.0	-
26.9697	4450	0.0	-
27.2727	4500	0.0	-
27.5758	4550	0.0	-
27.8788	4600	0.0	-
28.1818	4650	0.0	-
28.4848	4700	0.0	-
28.7879	4750	0.0	-
29.0909	4800	0.0	-
29.3939	4850	0.0	-
29.6970	4900	0.0	-
30.0	4950	0.0	-

Framework Versions

Python: 3.10.12
SetFit: 1.1.0
Sentence Transformers: 3.3.1
Transformers: 4.44.2
PyTorch: 2.2.0a0+81ea7a4
Datasets: 3.2.0
Tokenizers: 0.19.1

Citation

BibTeX

@article{https://doi.org/10.48550/arxiv.2209.11055,
    doi = {10.48550/ARXIV.2209.11055},
    url = {https://arxiv.org/abs/2209.11055},
    author = {Tunstall, Lewis and Reimers, Nils and Jo, Unso Eun Seo and Bates, Luke and Korat, Daniel and Wasserblat, Moshe and Pereg, Oren},
    keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
    title = {Efficient Few-Shot Learning Without Prompts},
    publisher = {arXiv},
    year = {2022},
    copyright = {Creative Commons Attribution 4.0 International}
}