💼 DeepMatch: Transformer-Based Resume–JD Matching

DeepMatch is an advanced pipeline that revolutionizes resume-to-job-description matching by leveraging Named Entity Recognition (NER) and transformer-based embeddings. Extract structured data, compute semantic similarities, and automate candidate screening with precision! 🚀

📜 Table of Contents

🌟 Overview

DeepMatch uses state-of-the-art NLP to extract structured entities (e.g., skills, experience, degrees) from resumes and job descriptions, then compares them using dense vector embeddings. It supports semantic matching, skill relevance scoring, and automated candidate ranking — making it ideal for modern HR automation.

🔍 Features

🧠 Named Entity Recognition (NER)

Extract structured information from resumes and job descriptions with high accuracy.

Models Used:

spaCy
- en_core_web_sm (pretrained, lightweight)
- en_core_web_trf (transformer-based, high accuracy)
- Custom-trained spaCy model on resume NER data
Hugging Face Transformers
- bert-base-cased
- distilbert-base-uncased

Entities Extracted:

Name
Email
Phone
Location
Degree
Designation
Company
Years of Experience
Skills

📊 Embedding Models

Converts entity-level text into dense vectors for semantic comparison.

Models Supported:

all-MiniLM-L6-v2
paraphrase-MiniLM-L12-v2
sentence-t5-base
sentence-t5-large

Embedding Modes:

Per-Entity: Individual embeddings for each entity
Combined: Joint embeddings for concatenated entities

📏 Similarity Scoring

Measures alignment between resumes and job descriptions.

Metrics:

Cosine Similarity (default)
Dot Product (alternative)
Euclidean Distance (optional)

Scoring Options:

Per-entity similarity for granular insights
Joint profile-level comparison for overall match

🧪 Example Outputs

Sample NER Output
Similarity Score Heatmap

(Files available in the output/ folder.)

⚙️ Setup Instructions

Get DeepMatch up and running with these simple steps:

🔁 1. Clone the Repository

git clone https://github.com/prakadeesh01/deepmatch.git
cd deepmatch

📦 2. Install Dependencies

pip install -r requirements.txt

▶️ 3. Run the Notebooks

jupyter notebook

📁 Data Notes

Input:
Place resumes and job descriptions in the data/ folder.
Supported Formats: .pdf, .docx

Output:
NER results, embeddings, and similarity scores are saved in output/.

Privacy:
No actual resume data is included in the majority of the repository to protect personal information.

💼 Use Cases

DeepMatch powers a range of HR and recruitment solutions:

✅ Resume Screening Systems: Automate candidate evaluation with precision.
✅ Job Recommendation Engines: Match candidates to ideal roles.
✅ Candidate–Job Fit Matching: Rank candidates by semantic alignment.
✅ Automated Skill Gap Analysis: Identify areas for upskilling.

🛠️ Tech Stack

Languages: Python 3.9+
NER: spaCy, Hugging Face Transformers
Embeddings: SentenceTransformers, T5
Similarity Metrics: scikit-learn, SciPy
Environment: Jupyter Notebooks, VS Code

📜 License

This project is licensed under the MIT License.

👨‍💻 Author

Prakadeesh K S
GitHub: @prakadeesh01

🙏 Acknowledgements

spaCy for robust NER capabilities
Hugging Face Transformers for pretrained language models
SentenceTransformers for efficient semantic embeddings

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
img		img
output		output
.gitignore		.gitignore
EMBED_MODELS.ipynb		EMBED_MODELS.ipynb
LICENSE		LICENSE
NER_MODELS.ipynb		NER_MODELS.ipynb
README.md		README.md
deepmatch_workflow.ipynb		deepmatch_workflow.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

💼 DeepMatch: Transformer-Based Resume–JD Matching

📜 Table of Contents

🌟 Overview

🔍 Features

🧠 Named Entity Recognition (NER)

📊 Embedding Models

📏 Similarity Scoring

🧪 Example Outputs

⚙️ Setup Instructions

🔁 1. Clone the Repository

📦 2. Install Dependencies

▶️ 3. Run the Notebooks

📁 Data Notes

💼 Use Cases

🛠️ Tech Stack

📜 License

👨‍💻 Author

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

prakadeesh01/deepmatch-x

Folders and files

Latest commit

History

Repository files navigation

💼 DeepMatch: Transformer-Based Resume–JD Matching

📜 Table of Contents

🌟 Overview

🔍 Features

🧠 Named Entity Recognition (NER)

📊 Embedding Models

📏 Similarity Scoring

🧪 Example Outputs

⚙️ Setup Instructions

🔁 1. Clone the Repository

📦 2. Install Dependencies

▶️ 3. Run the Notebooks

📁 Data Notes

💼 Use Cases

🛠️ Tech Stack

📜 License

👨‍💻 Author

🙏 Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages