AI/ML Engineer | Edge & IoT AI | Generative & Quantized LLMs | Vision‑Language & Healthcare AI | Quantum‑Inspired ML
Master’s in Computer Science – Lawrence Technological University (GPA 3.67) | F1 OPT (STEM)
Let’s ConnectI am an AI/ML Engineer specializing in bridging cutting‑edge AI research with production‑grade enterprise systems. My expertise lies in architecting Generative AI, Retrieval‑Augmented Generation (RAG) pipelines and Vision‑Language models that are scalable, cost‑efficient and deterministically safe.
I focus on optimizing complex AI architectures using techniques like 4‑bit quantization, LoRA/QLoRA fine‑tuning and hybrid edge‑to‑cloud deployments, ensuring models run efficiently on constrained hardware. My work ranges from healthcare‑focused generative AI and quantum‑inspired vision‑language adapters to TinyLlama‑based multimodal systems.
Whether publishing healthcare NLP research at IEEE or deploying quantized LLMs on edge devices, I thrive at the intersection of research and implementation. I design rules‑driven RAG pipelines and compliant AI systems that turn theoretical advances into reliable, cost‑effective software, and I have delivered ML‑based anomaly detection frameworks in industry and built clinical NLP pipelines in academia.
Dec 2025 – Present | Detroit, MI
Feb 2025 - Mar 2025 (Virtual Experience)
May 2024 – Dec 2024 | Southfield, MI
Nov 2021 – Dec 2022 | India
Quantum-inspired adapter layer for compressing and accelerating vision-language models without sacrificing multimodal reasoning quality.
Tech: TinyLlama-VLM, LoRA, PyTorch, Hugging Face Transformers
View on GitHubResearch codebase backing my IEEE paper on generative AI for healthcare data systems.
Tech: BioBERT, CRF, PyTorch, OCR (Tesseract), Flask, Transformers
View on GitHubBuilt a deep learning pipeline for classifying lumbar spine degeneration from MRI scans in the RSNA 2024 challenge.
Tech: PyTorch, OpenCV, NumPy, Docker, AWS S3
View on GitHubMultimodal TinyLlama-based VLM that injects CLIP vision tokens into the language model context via LoRA adapters.
Tech: TinyLlama, CLIP, LoRA, PyTorch, Transformers
View on GitHubReal-time AI call center prototype combining streaming ASR with a lightweight LLM to handle customer interactions.
Tech: Faster-Whisper, TinyLLaMA, Flask, WebSockets, PyTorch
View on GitHubSmart waste classifier that distinguishes recyclable vs non-recyclable items using transformer-based image models.
Tech: PyTorch, Transformers, FastAPI, Docker
View on GitHubA multimodal analytics assistant that ingests CSVs, Excel, PDFs, DOCX, JSON, images, and DICOM files to generate insights and visualizations.
Tech: Python, Flask, GPT-2, pandas, matplotlib, DICOM processing
View on GitHubLawrence Technological University – Southfield, MI
Graduated: Dec 2024 | GPA: 3.67 / 4.0
Relevant Coursework: Deep Learning, Natural Language Processing, Computer Vision, Advanced Algorithms, Data Mining.
Open to AI/ML engineering roles, VLM/LLM research collaborations, and quantum-inspired ML projects.