kantundpeterpan
/

skopush-test

Text Classification

Scikit-learn

skops

Model card Files Files and versions

xet

Community

kantundpeterpan commited on Feb 2, 2025

Commit

bf34d20

verified ·

1 Parent(s): 05a7f14

push push push

Browse files

Files changed (2) hide show

README.md +6 -6
skops.yaml +52 -0

README.md CHANGED Viewed

@@ -32,16 +32,16 @@ Trained with a lot of care
 | Hyperparameter                | Value                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
 |-------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
 | memory                        |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
-| steps                         | [('lemmatizer', FunctionTransformer(func=<function lemmatize_X at 0x7f5c1a052ca0>)), ('tfidf', TfidfVectorizer(max_df=0.95, min_df=2,<br />                stop_words=['if', 'when', 'most', 'ourselves', 'your', 'having',<br />                            "didn't", '@', "you've", 'hasn', 'at', "mightn't",<br />                            "mustn't", 'these', "it's", 'our', 'had', 'll',<br />                            'too', 'this', 'by', 'it', 'further', 'wasn',<br />                            'before', 'all', '{', 'herself', 'other', 'above', ...],<br />                tokenizer=<function tokenize_quote at 0x7f5c1a09da60>)), ('rf', RandomForestClassifier())]                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | transform_input               |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | verbose                       | False                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
-| lemmatizer                    | FunctionTransformer(func=<function lemmatize_X at 0x7f5c1a052ca0>)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
-| tfidf                         | TfidfVectorizer(max_df=0.95, min_df=2,<br />                stop_words=['if', 'when', 'most', 'ourselves', 'your', 'having',<br />                            "didn't", '@', "you've", 'hasn', 'at', "mightn't",<br />                            "mustn't", 'these', "it's", 'our', 'had', 'll',<br />                            'too', 'this', 'by', 'it', 'further', 'wasn',<br />                            'before', 'all', '{', 'herself', 'other', 'above', ...],<br />                tokenizer=<function tokenize_quote at 0x7f5c1a09da60>)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | rf                            | RandomForestClassifier()                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
 | lemmatizer__accept_sparse     | False                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
 | lemmatizer__check_inverse     | True                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
 | lemmatizer__feature_names_out |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
-| lemmatizer__func              | <function lemmatize_X at 0x7f5c1a052ca0>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
 | lemmatizer__inv_kw_args       |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | lemmatizer__inverse_func      |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | lemmatizer__kw_args           |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
@@ -64,7 +64,7 @@ Trained with a lot of care
 | tfidf__strip_accents          |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | tfidf__sublinear_tf           | False                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
 | tfidf__token_pattern          | (?u)\b\w\w+\b                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
-| tfidf__tokenizer              | <function tokenize_quote at 0x7f5c1a09da60>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |
 | tfidf__use_idf                | True                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
 | tfidf__vocabulary             |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | rf__bootstrap                 | True                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
@@ -168,7 +168,7 @@ div.sk-label-container:hover .sk-estimator-doc-link.fitted:hover,
 #sk-container-id-1 a.estimator_doc_link:hover {/* unfitted */background-color: var(--sklearn-color-unfitted-level-3);color: var(--sklearn-color-background);text-decoration: none;
 }#sk-container-id-1 a.estimator_doc_link.fitted:hover {/* fitted */background-color: var(--sklearn-color-fitted-level-3);
 }
-</style><div id="sk-container-id-1" class="sk-top-container" style="overflow: auto;"><div class="sk-text-repr-fallback"><pre>Pipeline(steps=[(&#x27;lemmatizer&#x27;,FunctionTransformer(func=&lt;function lemmatize_X at 0x7f5c1a052ca0&gt;)),(&#x27;tfidf&#x27;,TfidfVectorizer(max_df=0.95, min_df=2,stop_words=[&#x27;if&#x27;, &#x27;when&#x27;, &#x27;most&#x27;, &#x27;ourselves&#x27;,&#x27;your&#x27;, &#x27;having&#x27;, &quot;didn&#x27;t&quot;, &#x27;@&#x27;,&quot;you&#x27;ve&quot;, &#x27;hasn&#x27;, &#x27;at&#x27;, &quot;mightn&#x27;t&quot;,&quot;mustn&#x27;t&quot;, &#x27;these&#x27;, &quot;it&#x27;s&quot;, &#x27;our&#x27;,&#x27;had&#x27;, &#x27;ll&#x27;, &#x27;too&#x27;, &#x27;this&#x27;, &#x27;by&#x27;,&#x27;it&#x27;, &#x27;further&#x27;, &#x27;wasn&#x27;, &#x27;before&#x27;,&#x27;all&#x27;, &#x27;{&#x27;, &#x27;herself&#x27;, &#x27;other&#x27;,&#x27;above&#x27;, ...],tokenizer=&lt;function tokenize_quote at 0x7f5c1a09da60&gt;)),(&#x27;rf&#x27;, RandomForestClassifier())])</pre><b>In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook. <br />On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.</b></div><div class="sk-container" hidden><div class="sk-item sk-dashed-wrapped"><div class="sk-label-container"><div class="sk-label fitted sk-toggleable"><input class="sk-toggleable__control sk-hidden--visually" id="sk-estimator-id-1" type="checkbox" ><label for="sk-estimator-id-1" class="sk-toggleable__label fitted sk-toggleable__label-arrow"><div><div>Pipeline</div></div><div><span class="sk-estimator-doc-link fitted">i<span>Fitted</span></span></div></label><div class="sk-toggleable__content fitted"><pre>Pipeline(steps=[(&#x27;lemmatizer&#x27;,FunctionTransformer(func=&lt;function lemmatize_X at 0x7f5c1a052ca0&gt;)),(&#x27;tfidf&#x27;,TfidfVectorizer(max_df=0.95, min_df=2,stop_words=[&#x27;if&#x27;, &#x27;when&#x27;, &#x27;most&#x27;, &#x27;ourselves&#x27;,&#x27;your&#x27;, &#x27;having&#x27;, &quot;didn&#x27;t&quot;, &#x27;@&#x27;,&quot;you&#x27;ve&quot;, &#x27;hasn&#x27;, &#x27;at&#x27;, &quot;mightn&#x27;t&quot;,&quot;mustn&#x27;t&quot;, &#x27;these&#x27;, &quot;it&#x27;s&quot;, &#x27;our&#x27;,&#x27;had&#x27;, &#x27;ll&#x27;, &#x27;too&#x27;, &#x27;this&#x27;, &#x27;by&#x27;,&#x27;it&#x27;, &#x27;further&#x27;, &#x27;wasn&#x27;, &#x27;before&#x27;,&#x27;all&#x27;, &#x27;{&#x27;, &#x27;herself&#x27;, &#x27;other&#x27;,&#x27;above&#x27;, ...],tokenizer=&lt;function tokenize_quote at 0x7f5c1a09da60&gt;)),(&#x27;rf&#x27;, RandomForestClassifier())])</pre></div> </div></div><div class="sk-serial"><div class="sk-item"><div class="sk-estimator fitted sk-toggleable"><input class="sk-toggleable__control sk-hidden--visually" id="sk-estimator-id-2" type="checkbox" ><label for="sk-estimator-id-2" class="sk-toggleable__label fitted sk-toggleable__label-arrow"><div><div>lemmatize_X</div><div class="caption">FunctionTransformer</div></div><div><a class="sk-estimator-doc-link fitted" rel="noreferrer" target="_blank" href="https://scikit-learn.org/1.6/modules/generated/sklearn.preprocessing.FunctionTransformer.html">?<span>Documentation for FunctionTransformer</span></a></div></label><div class="sk-toggleable__content fitted"><pre>FunctionTransformer(func=&lt;function lemmatize_X at 0x7f5c1a052ca0&gt;)</pre></div> </div></div><div class="sk-item"><div class="sk-estimator fitted sk-toggleable"><input class="sk-toggleable__control sk-hidden--visually" id="sk-estimator-id-3" type="checkbox" ><label for="sk-estimator-id-3" class="sk-toggleable__label fitted sk-toggleable__label-arrow"><div><div>TfidfVectorizer</div></div><div><a class="sk-estimator-doc-link fitted" rel="noreferrer" target="_blank" href="https://scikit-learn.org/1.6/modules/generated/sklearn.feature_extraction.text.TfidfVectorizer.html">?<span>Documentation for TfidfVectorizer</span></a></div></label><div class="sk-toggleable__content fitted"><pre>TfidfVectorizer(max_df=0.95, min_df=2,stop_words=[&#x27;if&#x27;, &#x27;when&#x27;, &#x27;most&#x27;, &#x27;ourselves&#x27;, &#x27;your&#x27;, &#x27;having&#x27;,&quot;didn&#x27;t&quot;, &#x27;@&#x27;, &quot;you&#x27;ve&quot;, &#x27;hasn&#x27;, &#x27;at&#x27;, &quot;mightn&#x27;t&quot;,&quot;mustn&#x27;t&quot;, &#x27;these&#x27;, &quot;it&#x27;s&quot;, &#x27;our&#x27;, &#x27;had&#x27;, &#x27;ll&#x27;,&#x27;too&#x27;, &#x27;this&#x27;, &#x27;by&#x27;, &#x27;it&#x27;, &#x27;further&#x27;, &#x27;wasn&#x27;,&#x27;before&#x27;, &#x27;all&#x27;, &#x27;{&#x27;, &#x27;herself&#x27;, &#x27;other&#x27;, &#x27;above&#x27;, ...],tokenizer=&lt;function tokenize_quote at 0x7f5c1a09da60&gt;)</pre></div> </div></div><div class="sk-item"><div class="sk-estimator fitted sk-toggleable"><input class="sk-toggleable__control sk-hidden--visually" id="sk-estimator-id-4" type="checkbox" ><label for="sk-estimator-id-4" class="sk-toggleable__label fitted sk-toggleable__label-arrow"><div><div>RandomForestClassifier</div></div><div><a class="sk-estimator-doc-link fitted" rel="noreferrer" target="_blank" href="https://scikit-learn.org/1.6/modules/generated/sklearn.ensemble.RandomForestClassifier.html">?<span>Documentation for RandomForestClassifier</span></a></div></label><div class="sk-toggleable__content fitted"><pre>RandomForestClassifier()</pre></div> </div></div></div></div></div></div>
 ## Evaluation Results

 | Hyperparameter                | Value                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
 |-------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
 | memory                        |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
+| steps                         | [('lemmatizer', FunctionTransformer(func=<function lemmatize_X at 0x7f79b376cca0>)), ('tfidf', TfidfVectorizer(max_df=0.95, min_df=2,<br />                stop_words=['if', 'when', 'most', 'ourselves', 'your', 'having',<br />                            "didn't", '@', "you've", 'hasn', 'at', "mightn't",<br />                            "mustn't", 'these', "it's", 'our', 'had', 'll',<br />                            'too', 'this', 'by', 'it', 'further', 'wasn',<br />                            'before', 'all', '{', 'herself', 'other', 'above', ...],<br />                tokenizer=<function tokenize_quote at 0x7f79b37b1a60>)), ('rf', RandomForestClassifier())]                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | transform_input               |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | verbose                       | False                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
+| lemmatizer                    | FunctionTransformer(func=<function lemmatize_X at 0x7f79b376cca0>)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
+| tfidf                         | TfidfVectorizer(max_df=0.95, min_df=2,<br />                stop_words=['if', 'when', 'most', 'ourselves', 'your', 'having',<br />                            "didn't", '@', "you've", 'hasn', 'at', "mightn't",<br />                            "mustn't", 'these', "it's", 'our', 'had', 'll',<br />                            'too', 'this', 'by', 'it', 'further', 'wasn',<br />                            'before', 'all', '{', 'herself', 'other', 'above', ...],<br />                tokenizer=<function tokenize_quote at 0x7f79b37b1a60>)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | rf                            | RandomForestClassifier()                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
 | lemmatizer__accept_sparse     | False                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
 | lemmatizer__check_inverse     | True                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
 | lemmatizer__feature_names_out |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
+| lemmatizer__func              | <function lemmatize_X at 0x7f79b376cca0>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
 | lemmatizer__inv_kw_args       |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | lemmatizer__inverse_func      |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | lemmatizer__kw_args           |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | tfidf__strip_accents          |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | tfidf__sublinear_tf           | False                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
 | tfidf__token_pattern          | (?u)\b\w\w+\b                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
+| tfidf__tokenizer              | <function tokenize_quote at 0x7f79b37b1a60>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |
 | tfidf__use_idf                | True                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
 | tfidf__vocabulary             |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | rf__bootstrap                 | True                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
 #sk-container-id-1 a.estimator_doc_link:hover {/* unfitted */background-color: var(--sklearn-color-unfitted-level-3);color: var(--sklearn-color-background);text-decoration: none;
 }#sk-container-id-1 a.estimator_doc_link.fitted:hover {/* fitted */background-color: var(--sklearn-color-fitted-level-3);
 }
+</style><div id="sk-container-id-1" class="sk-top-container" style="overflow: auto;"><div class="sk-text-repr-fallback"><pre>Pipeline(steps=[(&#x27;lemmatizer&#x27;,FunctionTransformer(func=&lt;function lemmatize_X at 0x7f79b376cca0&gt;)),(&#x27;tfidf&#x27;,TfidfVectorizer(max_df=0.95, min_df=2,stop_words=[&#x27;if&#x27;, &#x27;when&#x27;, &#x27;most&#x27;, &#x27;ourselves&#x27;,&#x27;your&#x27;, &#x27;having&#x27;, &quot;didn&#x27;t&quot;, &#x27;@&#x27;,&quot;you&#x27;ve&quot;, &#x27;hasn&#x27;, &#x27;at&#x27;, &quot;mightn&#x27;t&quot;,&quot;mustn&#x27;t&quot;, &#x27;these&#x27;, &quot;it&#x27;s&quot;, &#x27;our&#x27;,&#x27;had&#x27;, &#x27;ll&#x27;, &#x27;too&#x27;, &#x27;this&#x27;, &#x27;by&#x27;,&#x27;it&#x27;, &#x27;further&#x27;, &#x27;wasn&#x27;, &#x27;before&#x27;,&#x27;all&#x27;, &#x27;{&#x27;, &#x27;herself&#x27;, &#x27;other&#x27;,&#x27;above&#x27;, ...],tokenizer=&lt;function tokenize_quote at 0x7f79b37b1a60&gt;)),(&#x27;rf&#x27;, RandomForestClassifier())])</pre><b>In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook. <br />On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.</b></div><div class="sk-container" hidden><div class="sk-item sk-dashed-wrapped"><div class="sk-label-container"><div class="sk-label fitted sk-toggleable"><input class="sk-toggleable__control sk-hidden--visually" id="sk-estimator-id-1" type="checkbox" ><label for="sk-estimator-id-1" class="sk-toggleable__label fitted sk-toggleable__label-arrow"><div><div>Pipeline</div></div><div><span class="sk-estimator-doc-link fitted">i<span>Fitted</span></span></div></label><div class="sk-toggleable__content fitted"><pre>Pipeline(steps=[(&#x27;lemmatizer&#x27;,FunctionTransformer(func=&lt;function lemmatize_X at 0x7f79b376cca0&gt;)),(&#x27;tfidf&#x27;,TfidfVectorizer(max_df=0.95, min_df=2,stop_words=[&#x27;if&#x27;, &#x27;when&#x27;, &#x27;most&#x27;, &#x27;ourselves&#x27;,&#x27;your&#x27;, &#x27;having&#x27;, &quot;didn&#x27;t&quot;, &#x27;@&#x27;,&quot;you&#x27;ve&quot;, &#x27;hasn&#x27;, &#x27;at&#x27;, &quot;mightn&#x27;t&quot;,&quot;mustn&#x27;t&quot;, &#x27;these&#x27;, &quot;it&#x27;s&quot;, &#x27;our&#x27;,&#x27;had&#x27;, &#x27;ll&#x27;, &#x27;too&#x27;, &#x27;this&#x27;, &#x27;by&#x27;,&#x27;it&#x27;, &#x27;further&#x27;, &#x27;wasn&#x27;, &#x27;before&#x27;,&#x27;all&#x27;, &#x27;{&#x27;, &#x27;herself&#x27;, &#x27;other&#x27;,&#x27;above&#x27;, ...],tokenizer=&lt;function tokenize_quote at 0x7f79b37b1a60&gt;)),(&#x27;rf&#x27;, RandomForestClassifier())])</pre></div> </div></div><div class="sk-serial"><div class="sk-item"><div class="sk-estimator fitted sk-toggleable"><input class="sk-toggleable__control sk-hidden--visually" id="sk-estimator-id-2" type="checkbox" ><label for="sk-estimator-id-2" class="sk-toggleable__label fitted sk-toggleable__label-arrow"><div><div>lemmatize_X</div><div class="caption">FunctionTransformer</div></div><div><a class="sk-estimator-doc-link fitted" rel="noreferrer" target="_blank" href="https://scikit-learn.org/1.6/modules/generated/sklearn.preprocessing.FunctionTransformer.html">?<span>Documentation for FunctionTransformer</span></a></div></label><div class="sk-toggleable__content fitted"><pre>FunctionTransformer(func=&lt;function lemmatize_X at 0x7f79b376cca0&gt;)</pre></div> </div></div><div class="sk-item"><div class="sk-estimator fitted sk-toggleable"><input class="sk-toggleable__control sk-hidden--visually" id="sk-estimator-id-3" type="checkbox" ><label for="sk-estimator-id-3" class="sk-toggleable__label fitted sk-toggleable__label-arrow"><div><div>TfidfVectorizer</div></div><div><a class="sk-estimator-doc-link fitted" rel="noreferrer" target="_blank" href="https://scikit-learn.org/1.6/modules/generated/sklearn.feature_extraction.text.TfidfVectorizer.html">?<span>Documentation for TfidfVectorizer</span></a></div></label><div class="sk-toggleable__content fitted"><pre>TfidfVectorizer(max_df=0.95, min_df=2,stop_words=[&#x27;if&#x27;, &#x27;when&#x27;, &#x27;most&#x27;, &#x27;ourselves&#x27;, &#x27;your&#x27;, &#x27;having&#x27;,&quot;didn&#x27;t&quot;, &#x27;@&#x27;, &quot;you&#x27;ve&quot;, &#x27;hasn&#x27;, &#x27;at&#x27;, &quot;mightn&#x27;t&quot;,&quot;mustn&#x27;t&quot;, &#x27;these&#x27;, &quot;it&#x27;s&quot;, &#x27;our&#x27;, &#x27;had&#x27;, &#x27;ll&#x27;,&#x27;too&#x27;, &#x27;this&#x27;, &#x27;by&#x27;, &#x27;it&#x27;, &#x27;further&#x27;, &#x27;wasn&#x27;,&#x27;before&#x27;, &#x27;all&#x27;, &#x27;{&#x27;, &#x27;herself&#x27;, &#x27;other&#x27;, &#x27;above&#x27;, ...],tokenizer=&lt;function tokenize_quote at 0x7f79b37b1a60&gt;)</pre></div> </div></div><div class="sk-item"><div class="sk-estimator fitted sk-toggleable"><input class="sk-toggleable__control sk-hidden--visually" id="sk-estimator-id-4" type="checkbox" ><label for="sk-estimator-id-4" class="sk-toggleable__label fitted sk-toggleable__label-arrow"><div><div>RandomForestClassifier</div></div><div><a class="sk-estimator-doc-link fitted" rel="noreferrer" target="_blank" href="https://scikit-learn.org/1.6/modules/generated/sklearn.ensemble.RandomForestClassifier.html">?<span>Documentation for RandomForestClassifier</span></a></div></label><div class="sk-toggleable__content fitted"><pre>RandomForestClassifier()</pre></div> </div></div></div></div></div></div>
 ## Evaluation Results

skops.yaml ADDED Viewed

	@@ -0,0 +1,52 @@

+hf_repo: kantundpeterpan/skopush-test
+local_repo:
+  name: tmp
+  init: True
+model_path: tfidf_rf.skops
+dataset:
+  name: "QuotaClimat/frugalaichallenge-text-train"
+  source: datasets
+  target_col: label
+  evaluate_on: test
+model_deps: # import dynamically before loading model, add to repo
+  - tools.py
+deps: # import dynamically and write versions to repo init method
+  - scikit-learn:sklearn
+  - nltk:nltk
+model_card:
+  filename: README.md
+  task: text-classification
+  description:
+    main: |
+      This model is an attempt to solve the 2025 FrugalAI challenge.
+      *Nice*.
+    Intended uses & limitations: |
+      Better than random label assignment, still room for improvement.
+    Training Procedure: |
+      Trained with a lot of care
+  sections:
+    A lot of info: |
+      Does this work?
+  metrics:
+    sklearn: # module name?
+      - accuracy:accuracy_score(normalize=True)
+      - f1_score:f1_score(average="macro")
+    tools:
+      - super_config:test_scorer(blubb=2)
+  confusion_matrix:
+    title: "Confusion Matrix"
+    filename: "confusion_matrix.png"
+    # labels: model_classes
+    plt:
+      xticks:
+        - rotation=90
+push:
+  commit_message: "push push push"
+  create_remote: True