I build production LLMs & RAG systems for real products

ML Engineer specializing in fine-tuning transformers, edge AI deployment, and RAG pipelines. Currently exploring agentic AI and vector databases.

Experience

Python Developer Intern
Helo.ai by Vivaconnect — Mumbai, Maharashtra
Jan 2026 – Present
  • Developed PySpark behavioral profiling pipeline (“WHEN” model) with Docker deployment.
  • Built Customer Data Platform using RFM features and K-Means clustering on 500K+ records.
  • Engineered Python/PostgreSQL pipelines for churn risk and customer analytics.
  • Trained ML models (Random Forest etc.) for churn and value scoring.
PYSPARK MACHINE LEARNING CUSTOMER ANALYTICS DOCKER POSTGRESQL DATA PIPELINES

About Me

Hey, I'm Zayed :), an AI & Data Science student from Mumbai who's genuinely obsessed with building things that work, not just things that demo well.

I got into AI because I wanted to understand how machines actually learn, not just call an API and call it a day. That curiosity pushed me toward fine-tuning transformers, building RAG systems from scratch, and deploying models on edge hardware like Raspberry Pi real constraints, real tradeoffs.

4+
Projects Shipped
500K+
Records Processed
3+
Models Deployed
$35
Cheapest Deploy

Agentic AI Systems

Building multi-step AI agents with LangGraph that can reason, plan, and act autonomously across tasks.

Advanced Vector Databases

Going deeper into Pinecone and Weaviate for production-grade semantic search at scale.

MLOps & Experiment Tracking

Learning MLflow and Weights & Biases to bring proper experiment tracking into my ML workflow.

Technical Writing

Documenting what I build — because the best engineers can explain their systems as well as they code them.

Experience

Python Developer Intern
Helo.ai by Vivaconnect — Mumbai, Maharashtra
Jan 2026 – Present
  • Developed PySpark behavioral profiling pipeline ("WHEN" model) with Docker deployment.
  • Built Customer Data Platform using RFM features and K-Means clustering on 500K+ records.
  • Engineered Python/PostgreSQL pipelines for churn risk and customer analytics.
  • Trained ML models (Random Forest etc.) for churn and value scoring.
PYSPARK MACHINE LEARNING CUSTOMER ANALYTICS DOCKER POSTGRESQL DATA PIPELINES

Projects

Multi-Document RAG System

Production-ready Retrieval-Augmented Generation system enabling multi-document querying across PDFs with automatic source citations. Processes 27-page documents with sub-second response times and 90%+ answer relevance.

LangChain ChromaDB Gemini API Streamlit RAG
90%+
Answer Relevance (via RAGAS eval)
40%
Coherence Improvement (Ollama → Gemini)

Sonnet Generator

Fine-tuned GPT-2 on Shakespeare's sonnets to generate real-time poetry with preserved meter and rhyme schemes. Engineered custom loss functions optimizing for poetic structure.

GPT-2 Fine-tuning PyTorch Flask Docker
90%
User Satisfaction (beta tester survey)
15%
BLEU Improvement (vs baseline GPT-2)

Real-Time Pothole Detection

Contributed to an ensemble CNN (Xception + InceptionV3) for road condition monitoring. Deployed on Raspberry Pi for edge inference, demonstrating low-cost AI solution for infrastructure monitoring. My contributions focused on model optimization and edge deployment.

Computer Vision Transfer Learning Edge AI Raspberry Pi OpenCV
93%
Accuracy
$35
Hardware Cost

Classify-153

Wildlife species classifier processing 10K+ images across 153 classes. Optimized with ONNX runtime for real-time inference. Includes Grad-CAM visualizations for model interpretability.

InceptionV3 Xception ONNX Grad-CAM Hugging Face
96%
Accuracy
25%
Latency Reduction

Skills & Technologies

Machine Learning

  • PyTorch & TensorFlow
  • Hugging Face Transformers
  • Fine-tuning & Transfer Learning
  • Computer Vision (CNNs, Object Detection)
  • NLP & Text Generation

LLM & AI

  • LangChain & LangGraph
  • RAG Systems
  • Vector Databases (ChromaDB, Pinecone)
  • Prompt Engineering
  • Model Optimization (LoRA, QLoRA)

MLOps & Deployment

  • Docker & Containerization
  • Hugging Face Spaces
  • Git & CI/CD
  • Flask & FastAPI
  • AWS Cloud Services

Programming

  • Python (Advanced)
  • C++ (Intermediate)
  • JavaScript (Basics)
  • SQL & Data Processing
  • NumPy, Pandas, scikit-learn

Let's Connect