malteklaes
/

based-CodeBERTa-language-id-llm-module_uniVienna

Text Classification

Generated from Trainer

text-embeddings-inference

Model card Files Files and versions

Metrics Training metrics Community

malteklaes commited on Apr 17, 2024

Commit

8c531af

·

verified ·

1 Parent(s): 8d0cfc1

Update README.md

Files changed (1) hide show

README.md +37 -0

README.md CHANGED Viewed

@@ -91,6 +91,43 @@ For a given code, the following programming language can be determined:
 - Ruby
 - C++
 ## Training and evaluation data
 - training arguments:

 - Ruby
 - C++
+## Usage
+```python
+checkpoint = "malteklaes/based-CodeBERTa-language-id-llm-module_uniVienna"
+tokenizer = AutoTokenizer.from_pretrained(checkpoint)
+modelPOST = AutoTokenizer.from_pretrained(checkpoint)
+myPipeline = TextClassificationPipeline(
+    model=AutoModelForSequenceClassification.from_pretrained(checkpoint, ignore_mismatched_sizes=True),
+    tokenizer=AutoTokenizer.from_pretrained(checkpoint)
+)
+CODE_TO_IDENTIFY_py = """
+def is_prime(n):
+    if n <= 1:
+        return False
+    if n == 2 or n == 3:
+        return True
+    if n % 2 == 0:
+        return False
+    max_divisor = int(n ** 0.5)
+    for i in range(3, max_divisor + 1, 2):
+        if n % i == 0:
+            return False
+    return True
+number = 17
+if is_prime(number):
+    print(f"{number} is a prime number.")
+else:
+    print(f"{number} is not a prime number.")
+"""
+myPipeline(CODE_TO_IDENTIFY_py)
+```
 ## Training and evaluation data
 - training arguments: