ML Internship

Future Interns Projects

3 production-grade ML projects built during January 2026 internship

View on GitHub

3 Projects

Jan 2026 Duration

NLP + ML Domains

LoR Letter of Rec.

FIT/JAN26 CIN Prefix

Internship Overview

Completed an intensive Machine Learning internship at Future Interns in January 2026, building three production-grade ML systems across different domains: NLP-based recruitment screening, text classification for customer support, and time-series demand forecasting.

The internship culminated in a Letter of Recommendation and Certificate of Completion (CIN: FIT/JAN26/ML4913), recognizing work that demonstrated end-to-end ML pipeline design, real-world data handling, and deployment-ready code architecture.

Python Scikit-learn TF-IDF NLP Pandas NumPy Matplotlib Random Forest

Project 1: AI Resume Ranking System

Resume Parsing

Text Extraction, Cleaning, Normalization

→

TF-IDF Vectorization

Feature Extraction, Term Weighting

→

Cosine Ranking

Similarity Scoring, Skill Gap Analysis

Built an AI-powered resume screening system that processes 1000+ resumes using TF-IDF vectorization for candidate-job matching and skill gap analysis. The system transforms both resumes and job descriptions into high-dimensional vector representations, then ranks candidates by cosine similarity.

Processes 1000+ resumes with vectorized NLP operations
TF-IDF vectorization for meaningful skill representation
Cosine similarity scoring for objective candidate ranking
Automated skill gap identification for recruiter feedback
Batch processing with structured CSV output reports

View on GitHub →

Project 2: Support Ticket Classifier

Text Preprocessing

Ticket Parsing, Cleaning, Tokenization

→

Feature Engineering

TF-IDF, N-grams, Category Labels

→

Classification

Ensemble Models, Priority Assignment

Developed an NLP-based support ticket classification and prioritization system. The system automatically categorizes incoming support tickets into predefined categories and assigns priority levels using ensemble machine learning models trained on TF-IDF features.

Multi-class text classification for support ticket categorization
TF-IDF with n-gram features for capturing phrase-level patterns
Ensemble methods (Random Forest + Gradient Boosting) for robust predictions
Automatic priority assignment based on ticket content and category
Evaluation using precision, recall, F1-score, and confusion matrix analysis

View on GitHub →

Project 3: Sales Demand Forecasting

Data Processing

Time Features, Rolling Stats, Lag Variables

→

Model Training

Random Forest Regressor, Cross-Validation

→

Forecasting

Demand Predictions, Trend Visualization

Created a sales demand forecasting model using Random Forest Regressor trained on real-world sales data. The system performs time-series feature engineering — extracting temporal patterns, seasonal components, and lag variables — to predict future sales volumes.

Time-series feature engineering with rolling statistics and lag variables
Random Forest Regressor for non-linear demand pattern capture
Temporal feature extraction (day of week, month, quarter, year)
Rolling mean and rolling standard deviation features for trend capture
Visualizations using Matplotlib for trend analysis and forecast comparison
R-squared and MAE evaluation metrics for regression performance

View on GitHub →

Achievements

Letter of Recommendation — Received from Future Interns recognizing exceptional work across all three projects.
Certificate of Completion — Official ML internship certification (CIN: FIT/JAN26/ML4913).
Production-Grade Code — All three projects follow clean code practices with proper documentation, modular design, and reproducible pipelines.

View All on GitHub ← Back to Portfolio