--- license: apache-2.0 datasets: - harixn/indian_news_sentiment language: - en base_model: - google-bert/bert-base-uncased pipeline_tag: text-classification tags: - finance --- # FinBERT Model Card ## Model Details * **Model Name:** FinBERT * **Model Type:** BERT (bert-base-uncased) * **Task:** Sentiment Analysis (Stock Market) * **Number of Labels:** 3 (positive, negative, neutral) * **Intended Use:** Predict sentiment of financial news and social media posts related to the Indian stock market. ## How to Use ```python import torch from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("harixn/IN-finbert") model = AutoModelForSequenceClassification.from_pretrained("harixn/IN-finbert") text = "The stock price of XYZ surged today." inputs = tokenizer(text, return_tensors="pt") outputs = model(**inputs) # Get probabilities probs = torch.softmax(outputs.logits, dim=1) print("Probabilities:", probs) # Get predicted class pred_class = torch.argmax(probs, dim=1).item() classes = ["negative", "neutral", "positive"] print("Predicted class:", classes[pred_class]) ``` ## Training Data * Fine-tuned on labeled Indian stock market news and social media datasets. * Labels: positive, negative, neutral. ## Limitations and Risks * Trained specifically for Indian stock market context. * May not generalize well to other financial markets. * Predictions should not be used as financial advice. ## Evaluation * Evaluated on held-out validation set of Indian stock market texts. * Metrics: Accuracy, F1-score per class. ## Model Files * `pytorch_model.bin`: Trained model weights * `config.json`: Model configuration * `vocab.txt`, `tokenizer_config.json`, `special_tokens_map.json`, `tokenizer.json`: Tokenizer files ## Citation If you use this model, please cite it as: ``` FinBERT: Sentiment Analysis Model for Indian Stock Market, harixn, 2025 ```