File size: 1,517 Bytes
30bc39c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
<<<<<<< HEAD
# Risk-clause-classifiers
=======
---
license: mit
---
>>>>>>> 17efa33e6a905f74adced154117a7489f1672bab


# Risk Clause Classifier

A lightweight AI system for classifying legal contract clauses into:

- Clause Category (10 classes)
- Risk Level (Low / Medium / High)

This project is designed for low-resource environments and CPU-based execution.

## πŸ” Project Overview

The system uses a distilled domain-specific language model to generate clause embeddings and applies lightweight classifiers for:

- Category Prediction
- Risk Assessment

The goal is to build an efficient, scalable, and hackathon-ready legal clause analysis tool.

## πŸ— Architecture

Clause Text  
β†’ Transformer Embeddings  
β†’ Category Classifier  
β†’ Risk Level Classifier  

## βš™ Tech Stack

- Python
- Transformers
- Hugging Face Hub
- Scikit-learn
- FastAPI
- Git & GitHub

## πŸ“‚ Repository Structure

- `/src` – Training & inference scripts  
- `/models` – Saved classifier models  
- `/app.py` – API server  
- `requirements.txt` – Dependencies  

## πŸš€ Deployment

The model files are hosted on Hugging Face.  
Codebase is maintained on GitHub for team collaboration.

## πŸ‘₯ Team Collaboration

- GitHub β†’ Code management  
- Hugging Face β†’ Model storage  
- Multi-device sync enabled via Git remotes  

## πŸ“Œ Hackathon Focus

- CPU-friendly design  
- Lightweight inference  
- Structured JSON output  
- Legal domain specialization  

---

Built for efficient legal risk clause analysis.