Skills
Python, C++, CUDA, JavaScript, SQL, PyTorch, Triton, TensorFlow, LangChain,
Scikit-learn, OpenCV, FastAPI, Streamlit, MLFlow, PySpark, Git, Azure Databricks,
Docker, Hugging Face, AWS, PostgreSQL.
Featured Projects
A standalone tensor computation library built from scratch with custom C++/CUDA
kernels and Python bindings — no NumPy, PyTorch, or cuBLAS. Implements full
automatic differentiation with a PyTorch-like API, neural network layers,
optimizers, and loss functions. Custom CUDA matmul kernels (tiling, shared memory,
memory coalescing) achieve up to 2.3× speedup over NumPy. Published on
PyPI.
A 95M-parameter LLM pretrained from scratch in PyTorch on a single RTX 3070 Ti
(8 GB VRAM), using mixed-precision (AMP) and gradient accumulation to avoid OOM.
Supervised fine-tuned on the Alpaca dataset for instruction-following. Deployed
via Streamlit with dynamic model switching; weights on
Hugging Face and live at
tinygpt.streamlit.app.
A multi-agent orchestration platform using LangGraph for intent classification,
planning, and automated tool execution. Scalable workflow engine with custom
scheduling (cron / interval), JWT authentication, and Docker-based deployment.
A fault-tolerant receipt parsing engine with a multi-LLM fallback strategy
(Gemini → Llama 3), achieving 99% extraction reliability. Uses a graph-based
debt simplification algorithm and Celery/RabbitMQ for asynchronous OCR processing.
A GPT decoder block with self-attention and multi-headed attention implemented in
PyTorch. Two pre-trained models — Shakespearean GPT and Harpoon GPT —
with a Streamlit playground, FastAPI microservice, Docker containerization, and a
custom tokenizer with portable serialization.
A multimodal retrieval-augmented generation system integrating OpenAI CLIP and
Whisper for cross-modal retrieval across text, images, and audio, responding in
2–3 seconds. Streamlit interface supports annotation, custom datasets, CLIP
fine-tuning, URL-based image indexing, and FAISS similarity search.
ParkVision
AI-driven parking analytics and customer segmentation dashboard. Fine-tuned YOLOv8
(0.968 mAP) and EasyOCR (85% accuracy) for vehicle plate recognition with IDFY API
integration. Django REST Framework + FastAPI backend, React PWA frontend,
PostgreSQL, Celery, Redis, Docker.
GitHub.
DermaCare.ai
AI-driven portal for early detection of skin diseases in rural areas, with
dermatologist consultation. Built a TensorFlow CV model deployed via FastAPI with
S3 image uploads, plus offline TFLite inference in a Flutter mobile app. Web
frontend in React, backend in Django REST Framework. Built end-to-end in a 24-hour
hackathon.
Alumni networking platform for MMCOE. Django REST Framework backend, React +
Tailwind frontend, Docker-containerized. Real-time NSFW filtering via image
classification and profanity detection (95% accuracy, 6–7s response). Custom
Scikit-learn recommendation system for connections, posts, and jobs.
A chatbot that suggests songs from user input using LangChain and a FAISS
vectorstore for lyrics similarity search, with 2–3s responses.
DevFinder
A social platform for college students and developers with AI-powered
recommendations and real-time messaging. Flutter app, Django REST Framework
backend, React frontend, TensorFlow recommendations, AWS deployment.
GitHub.
A from-scratch Python neural network training library similar to TensorFlow 2,
implementing layers, activations, losses, and optimizers. Install via
pip install open-nn-python.