File size: 1,425 Bytes

de86483
 
 
 
 
 
5aad6cd

---
license: mit
language:
- en
library_name: pytorch
---
# Athena- Intent

Classifies intent of the query for Athena

## Architecture
distilbert-base-uncased backbone, finetuned over a multiclass classification problem

### Description

Classifies user intent of queries into the following classes:
0: Keyword Search
1: Semantic Search
2: Direct Question Answering


## Uses

This model is intended to be used in Athena for performing QA on enterprise document stores.


## Bias, Risks, and Limitations

Dataset was generated using ChatGPT (gpt-3.5-turbo). It consists of 5000 English sentences and the nature of their intent, annotated manually.

## Usage

```
from transformers import AutoTokenizer
from transformers import TFDistilBertForSequenceClassification
import tensorflow as tf

model = TFDistilBertForSequenceClassification.from_pretrained("sourcerersupreme/athena-intent")
tokenizer = AutoTokenizer.from_pretrained("sourcerersupreme/athena-intent")

class_semantic_mapping = {
        0: "Keyword",
        1: "Semantic",
        2: "QA"
    }

# Get user input
user_query = "What is a CDP?"

# Encode the user input
inputs = tokenizer(user_query, return_tensors="tf", truncation=True, padding=True)

# Get model predictions
predictions = model(inputs)[0]

# Get predicted class
predicted_class = tf.math.argmax(predictions, axis=-1)

print(f"Predicted class: {class_semantic_mapping[int(predicted_class)]}")
```