I build fast ML systems & low-level AI pipelines

ML Engineer who goes deep from training pipelines and model deployment to systems-level work in Rust and CUDA. I care about what happens after the notebook closes.

About Me

Hey, I'm Zayed :), an AI & Data Science student from Mumbai who's genuinely obsessed with building things that work, not just things that demo well.

I got into AI because I wanted to understand how machines actually learn not just call an API and call it a day. That curiosity pushed me toward ML pipelines, edge deployment, and more recently systems-level engineering in Rust and CUDA. Real constraints, real tradeoffs.

Right now I'm exploring the intersection of ML and low-level systems: writing fast data pipelines, understanding what happens under the hood, and building things that are measurably better not just "it works on my machine."

4+
Projects Shipped
500K+
Records Processed
3+
Models Deployed
$35
Cheapest Deploy

Rust for ML Systems

Building ML pipelines and tokenizers in Rust where Python can't go. Learning the hard way: no GIL, no runtime overhead, no hiding from the compiler.

CUDA & GPU Programming

Writing custom CUDA kernels for ML workloads. Understanding what actually happens on the GPU when a model trains not just calling .cuda().

ML Infrastructure

MLflow experiment tracking, Docker-based deployments, and building systems that make models reproducible and production-ready not just accurate.

Open Source Contribution

Actively looking to contribute to Rust ML crates like linfa and candle. Real codebases, real PRs not README fixes.

Experience

Python Developer Intern
Helo.ai by Vivaconnect Mumbai, Maharashtra
Jan 2026 – Present
  • Built a structured Python logger (no classes, JSON output, file/line metadata) with email alerting; shipped it ahead of schedule and integrated it into a production pipeline.
  • Developed PySpark behavioral profiling pipeline ("WHEN" model) with Docker deployment.
  • Built Customer Data Platform using RFM features and K-Means clustering on 500K+ records.
  • Engineered Python/PostgreSQL pipelines for churn risk and customer analytics.
  • Trained ML models (Random Forest etc.) for churn and value scoring.
PYTHON PYSPARK MACHINE LEARNING DOCKER POSTGRESQL DATA PIPELINES

Projects

ML in Rust From Scratch

Implementing ML algorithms in Rust using the linfa ecosystem. Built a BPE tokenizer from scratch, then trained Decision Tree and Random Forest classifiers on Iris. Manual train/test splits, fixed random seeds, all without Python's safety net.

Rust linfa rustlearn BPE Tokenizer Random Forest
95.6%
RF Accuracy (Iris)
Day 1
First model trained in Rust

Multi-Document RAG System

Production-ready Retrieval-Augmented Generation system enabling multi-document querying across PDFs with automatic source citations. Processes 27-page documents with sub-second response times and 90%+ answer relevance.

LangChain ChromaDB Gemini API Streamlit RAG
90%+
Answer Relevance (via RAGAS eval)
40%
Coherence Improvement (Ollama → Gemini)

Sonnet Generator

Fine-tuned GPT-2 on Shakespeare's sonnets to generate real-time poetry with preserved meter and rhyme schemes. Engineered custom loss functions optimizing for poetic structure.

GPT-2 Fine-tuning PyTorch Flask Docker
90%
User Satisfaction (beta tester survey)
15%
BLEU Improvement (vs baseline GPT-2)

Real-Time Pothole Detection

Contributed to an ensemble CNN (Xception + InceptionV3) for road condition monitoring. Deployed on Raspberry Pi for edge inference, demonstrating low-cost AI solution for infrastructure monitoring. My contributions focused on model optimization and edge deployment.

Computer Vision Transfer Learning Edge AI Raspberry Pi OpenCV
93%
Accuracy
$35
Hardware Cost

Classify-153

Wildlife species classifier processing 10K+ images across 153 classes. Optimized with ONNX runtime for real-time inference. Includes Grad-CAM visualizations for model interpretability.

InceptionV3 Xception ONNX Grad-CAM Hugging Face
96%
Accuracy
25%
Latency Reduction

Skills & Technologies

Machine Learning

  • PyTorch & TensorFlow
  • Hugging Face Transformers
  • Fine-tuning & Transfer Learning
  • Computer Vision (CNNs, Object Detection)
  • NLP & Text Generation

LLM & AI

  • LangChain & LangGraph
  • RAG Systems
  • Vector Databases (ChromaDB, Pinecone)
  • Prompt Engineering
  • Model Optimization (LoRA, QLoRA)

MLOps & Deployment

  • Docker & Containerization
  • MLflow Experiment Tracking
  • Git & CI/CD
  • Flask & FastAPI
  • Hugging Face Spaces

Programming

  • Python (Advanced)
  • Rust LEARNING
  • C++ / CUDA (Intermediate)
  • SQL & Data Processing
  • NumPy, Pandas, scikit-learn

Blog

Posts Coming Soon

Writing about what I actually build and break Rust for ML, CUDA internals, production data pipelines, and the gap between tutorials and real systems.

Rust × ML CUDA Internals ML Pipelines Systems Engineering Learning in Public

Follow along on @zayedansari2004 in the meantime.

Let's Connect