YAML Metadata Warning: The pipeline tag "text2text-generation" is not in the official list: text-classification, token-classification, table-question-answering, question-answering, zero-shot-classification, translation, summarization, feature-extraction, text-generation, fill-mask, sentence-similarity, text-to-speech, text-to-audio, automatic-speech-recognition, audio-to-audio, audio-classification, audio-text-to-text, voice-activity-detection, depth-estimation, image-classification, object-detection, image-segmentation, text-to-image, image-to-text, image-to-image, image-to-video, unconditional-image-generation, video-classification, reinforcement-learning, robotics, tabular-classification, tabular-regression, tabular-to-text, table-to-text, multiple-choice, text-ranking, text-retrieval, time-series-forecasting, text-to-video, image-text-to-text, image-text-to-image, image-text-to-video, visual-question-answering, document-question-answering, zero-shot-image-classification, graph-ml, mask-generation, zero-shot-object-detection, text-to-3d, image-to-3d, image-feature-extraction, video-text-to-text, keypoint-detection, visual-document-retrieval, any-to-any, video-to-video, other

Model Card: Resume Information Extractor (LLM-based)

Overview

This model is a distilled, instruction-tuned version of the DeepSeek-R1-Distill-Llama-8B language model, optimized for extracting structured information from resumes in English. It was built using the Unsloth library for efficient fine-tuning and inference.

Given a raw resume text, the model outputs structured JSON containing:

  • skills: list of skills mentioned
  • education: simplified school-degree-major format
  • experience: list of job roles

Intended Uses

This model is designed for:

  • HR software to parse applicant resumes automatically
  • Applicant tracking systems (ATS)
  • AI assistants helping with recruiting and screening
  • EdTech or job board platforms classifying user profiles

Example Input Prompt:

You are an experienced HR and now you will review a resume then extract key information from it.

# Input
Here is the resume text:
[PASTE RESUME TEXT HERE]

### Response
<think>

Expected Output:

{
  "skills": [...],
  "education": [...],
  "experience": [...]
}

Training & Technical Details

  • Base model: unsloth/DeepSeek-R1-Distill-Llama-8B
  • Library: Unsloth with support for 4-bit quantization (bitsandbytes)
  • Fine-tuning style: Instruction-tuning using formatted HR task prompts
  • Max sequence length: 8096 tokens
  • Hardware requirements: ~16GB GPU RAM (with 4-bit loading)

Limitations

  • Performance may degrade with non-English or poorly formatted resumes
  • Only extracts roles (not company names or dates)
  • Cannot handle multi-lingual documents
  • Does not validate output schema; use external validators if needed

Citation

If you use this model, please cite the following components:

License

Apache 2.0

Downloads last month
78
GGUF
Model size
8B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support