Placement Season Prediction Model
Model Details
- Model Name: Placement Season Prediction
- Model Type: Random Forest Classifier
- Framework: scikit-learn
- Language: Python
- Task: Binary Classification
- License: Apache 2.0
- Author: SkyLeopard
This model predicts whether a college student will get placed based on academic and experiential features.
Model Description
This is a supervised machine learning model trained to predict student placement outcome (Yes/No) using structured tabular data. The model uses a Random Forest classifier trained on preprocessed student attributes such as:
- IQ
- CGPA
- Academic performance
- Internship experience
- Projects
- Certifications
- Technical and soft skills
The final output is a binary prediction:
1โ Placed0โ Not Placed
Example Applications
- Colleges analyzing placement trends
- Students estimating placement probability
- Career guidance systems
Limitations
- Trained on a single dataset, may not generalize to all institutions.
- Performance depends heavily on feature quality.
- Should not be used for high-stakes real-world decisions (e.g., hiring, admissions).
Training Data
The model was trained on a structured CSV dataset containing student records with features such as:
| Column Name | Type | Description |
|---|---|---|
College_ID |
Categorical (string) | Unique college/student identifier (dropped during training) |
IQ |
Integer | IQ score of the student |
Prev_Sem_Result |
Float | Previous semester result (GPA/percentage) |
CGPA |
Float | Cumulative Grade Point Average |
Academic_Performance |
Integer | Academic performance rating |
Internship_Experience |
Categorical | Internship experience (Yes / No) |
Extra_Curricular_Score |
Integer | Score for extracurricular activities |
Communication_Skills |
Integer | Communication skills rating |
Projects_Completed |
Integer | Number of projects completed |
Target column:
Placement(mapped to 0/1)
Preprocessing
The following preprocessing steps were applied:
Dropped non-informative ID column.
Categorical mapping:
Yes โ 1No โ 0
Feature scaling using StandardScaler.
Train-test split.
Evaluation Results
Evaluation performed on a held-out test set.
Metrics used:
Accuracy: ~0.99 (approx)
How to Use
Get model from the hub
from huggingface_hub import hf_hub_url, cached_download
import joblib
import pandas as pd
REPO_ID = "SkyLeopard/placement-season"
FILENAME = "rf-placement.pkl"
model = joblib.load(
cached_download(
hf_hub_url(REPO_ID, FILENAME)
)
)
# model is a sklearn RandomForestClassifier
Get sample data from this repo
data_file = cached_download(
hf_hub_url(REPO_ID, "college_student_placement_dataset.csv")
)
df = pd.read_csv(data_file)
X = df.drop(["College_ID", "Placement"], axis=1)
y = df["Placement"]
print(X.head(3))
Predict
# Encode Internship_Experience manually
X["Internship_Experience"] = X["Internship_Experience"].map({"Yes": 1, "No": 0})
labels = model.predict(X[:3])
print(labels)
Eval
model.score(X, y)
Ethical Considerations
- Predictions may reflect biases in the training dataset.
- Should not be used to deny opportunities.
- Use as a decision-support tool only.
Model Card Authors
- SkyLeopard
Citation
If you use this model:
@misc{placement_season_model,
author = {SkyLeopard},
title = {Placement Season Prediction Model},
year = {2026},
publisher = {Hugging Face}
}
- Downloads last month
- -