Core Expertise
Agentic Orchestration
Autonomous loops (ReAct, Plan-and-Execute), multi-agent systems, MCP server development, LLM-to-tool communication.
Advanced RAG
High-precision retrieval with semantic routing, GraphRAG, LlamaIndex, LlamaParse, agentic chunking, and reranking pipelines.
AI Infrastructure
Productionizing LLMs with guardrails, observability, and evals (DeepEval, RAGAS). High-throughput runtimes in Rust, Go, and Bun.
Edge AI
On-device inference with Candle (Rust), ONNX, Llama.cpp. Low-latency edge orchestration and quantization.
Systems & Architecture
Low-latency concurrent backends, resilient microservices, distributed data processing, high-throughput design.
Full-Stack & Runtimes
Scalable web ecosystems on Node.js, Deno, Bun. Browser extensions, enterprise apps, build-less PWAs.
Stack
Experience
Full Stack Software Engineer — AI/ML
2023 – Present- Designed and deployed scalable GenAI applications integrating RAG pipelines and Model Context Protocol for LLM-driven user experiences
- Implemented agentic workflows and LLM-based assistants to automate data analysis, increasing user engagement and operational efficiency
- Architected ETL pipelines and real-time data systems feeding retrieval-augmented generation models
- Modernized backend infrastructure with microservices, Docker, and Kubernetes for distributed LLM deployment at scale
Machine Learning Software Engineer
2021 – 2023- Developed AI-driven solutions for digital marketing, enhancing campaign performance and targeting accuracy
- Refactored legacy code in Python and Go, improving server performance by 50%
- Deployed predictive models using PyTorch and Scikit-learn for targeted marketing strategies
Full Stack Software Engineer — AI/ML
2020 – 2022- Designed and deployed a scalable coaching platform used by millions globally
- Integrated AI features leveraging NLP and TensorFlow for personalized user experiences
- Architected REST and GraphQL APIs using Node.js and Express
- Developed CI/CD pipelines using Docker and Kubernetes, reducing deployment times
Senior Full Stack Software Engineer — AI/ML
2018 – 2020- Designed scalable, high-performance applications enhancing healthcare operations
- Built end-to-end features across JavaScript/TypeScript, Ruby on Rails, and Go
- Developed ML models for NLP tasks using TensorFlow and PyTorch
- Improved CI/CD pipelines, achieving 30% reduction in deployment time