A scalable document similarity system built using MinHash + LSH, with support for: Local Python processing Apache Spark batch processing Optional Kafka streaming Flask API for ingestion Streamlit ...
Automated-Resume-Screening-Tool/ │ ├── resumes/ # Sample resume files │ ├── candidate_1.txt # Arjun Mehta (Python/NLP expert) │ ├── candidate_2.txt # Sara Lindqvist (ML/scikit-learn) │ ├── candidate_3 ...