Zayed Ansari

Who I Am

About Me

Hey, I'm Zayed :), an AI & Data Science student from Mumbai who's genuinely obsessed with building things that work, not just things that demo well.

I got into AI because I wanted to understand how machines actually learn not just call an API and call it a day. That curiosity pushed me toward ML pipelines, edge deployment, and more recently systems-level engineering in Rust and CUDA. Real constraints, real tradeoffs.

Right now I'm exploring the intersection of ML and low-level systems: writing fast data pipelines, understanding what happens under the hood, and building things that are measurably better not just "it works on my machine."

Projects Shipped

500K+

Records Processed

Models Deployed

$35

Cheapest Deploy

Currently Exploring

Rust for ML Systems

Building ML pipelines and tokenizers in Rust where Python can't go. Learning the hard way: no GIL, no runtime overhead, no hiding from the compiler.

CUDA & GPU Programming

Writing custom CUDA kernels for ML workloads. Understanding what actually happens on the GPU when a model trains not just calling .cuda().

ML Infrastructure

MLflow experiment tracking, Docker-based deployments, and building systems that make models reproducible and production-ready not just accurate.

Open Source Contribution

Actively looking to contribute to Rust ML crates like linfa and candle. Real codebases, real PRs not README fixes.

Career

Experience

Python Developer Intern

Helo.ai by Vivaconnect Mumbai, Maharashtra

Jan 2026 – Present

Built a structured Python logger (no classes, JSON output, file/line metadata) with email alerting; shipped it ahead of schedule and integrated it into a production pipeline.
Developed PySpark behavioral profiling pipeline ("WHEN" model) with Docker deployment.
Built Customer Data Platform using RFM features and K-Means clustering on 500K+ records.
Engineered Python/PostgreSQL pipelines for churn risk and customer analytics.
Trained ML models (Random Forest etc.) for churn and value scoring.

PYTHON PYSPARK MACHINE LEARNING DOCKER POSTGRESQL DATA PIPELINES

Featured Work

Projects

ML in Rust From Scratch

Implementing ML algorithms in Rust using the linfa ecosystem. Built a BPE tokenizer from scratch, then trained Decision Tree and Random Forest classifiers on Iris. Manual train/test splits, fixed random seeds, all without Python's safety net.

Rust linfa rustlearn BPE Tokenizer Random Forest

95.6%

RF Accuracy (Iris)

Day 1

First model trained in Rust

Multi-Document RAG System

Production-ready Retrieval-Augmented Generation system enabling multi-document querying across PDFs with automatic source citations. Processes 27-page documents with sub-second response times and 90%+ answer relevance.

LangChain ChromaDB Gemini API Streamlit RAG

90%+

Answer Relevance (via RAGAS eval)

40%

Coherence Improvement (Ollama → Gemini)

Sonnet Generator

Fine-tuned GPT-2 on Shakespeare's sonnets to generate real-time poetry with preserved meter and rhyme schemes. Engineered custom loss functions optimizing for poetic structure.

GPT-2 Fine-tuning PyTorch Flask Docker

90%

User Satisfaction (beta tester survey)

15%

BLEU Improvement (vs baseline GPT-2)

Collaborative · Code Private

Real-Time Pothole Detection

Contributed to an ensemble CNN (Xception + InceptionV3) for road condition monitoring. Deployed on Raspberry Pi for edge inference, demonstrating low-cost AI solution for infrastructure monitoring. My contributions focused on model optimization and edge deployment.

Computer Vision Transfer Learning Edge AI Raspberry Pi OpenCV

93%

Accuracy

$35

Hardware Cost

Classify-153

Wildlife species classifier processing 10K+ images across 153 classes. Optimized with ONNX runtime for real-time inference. Includes Grad-CAM visualizations for model interpretability.

InceptionV3 Xception ONNX Grad-CAM Hugging Face

96%

Accuracy

25%

Latency Reduction

Tech Stack

Skills & Technologies

Machine Learning

PyTorch & TensorFlow
Hugging Face Transformers
Fine-tuning & Transfer Learning
Computer Vision (CNNs, Object Detection)
NLP & Text Generation

LLM & AI

LangChain & LangGraph
RAG Systems
Vector Databases (ChromaDB, Pinecone)
Prompt Engineering
Model Optimization (LoRA, QLoRA)

MLOps & Deployment

Docker & Containerization
MLflow Experiment Tracking
Git & CI/CD
Flask & FastAPI
Hugging Face Spaces

Programming

Python (Advanced)
Rust LEARNING
C++ / CUDA (Intermediate)
SQL & Data Processing
NumPy, Pandas, scikit-learn

I build fast ML systems & low-level AI pipelines