Data Scientist & AI Researcher with experience building production ML systems, LLM evaluation frameworks, and agentic AI pipelines. I turn complex data problems into deployed, measurable solutions.
MS Computer Science (Data Science) from UNC Charlotte · Previously Data Scientist at CVS Health and TCS · AWS Certified Data Engineer · 2 IEEE publications
- LLM Evaluation & Robustness — Built RobustnessPilot, an automated framework that executed 38 evaluation runs across 3 LLMs (14B–70B params), generating 663 tests across 23 failure modes. First-authored paper submitted to IEEE SRDS 2026.
- Agentic AI Systems — Architected a 5-agent API test suite generator using CrewAI, MCP, and A2A Protocol with GPT-4o/Claude Sonnet routing via LiteLLM. Exposed as an MCP server for Claude Desktop tool invocation.
- Production RAG & NLP — Built a legal research assistant over 22,809 court opinion vectors using LangChain, LangGraph, and Qdrant — 277% retrieval improvement, ~30% reduction in hallucinated citations.
| Paper | Venue | Year |
|---|---|---|
| LLM-Based Robustness Testing of Microservice Applications (First Author) | IEEE SRDS 2026 | 2026 |
| Crop Yield Prediction using ML & Deep Learning | [IEEE] (https://ieeexplore.ieee.org/document/10689872) | 2024 |
| Legal Precedent Retrieval using Sentence Embeddings and Clustering | arXiv:2406.01609 | 2024 |
ML & AI: Python · PyTorch · TensorFlow · Scikit-Learn · XGBoost · PySpark MLlib · Hugging Face Transformers
LLMs & Agents: LangChain · LangGraph · CrewAI · MCP · A2A Protocol · RAG · Prompt Engineering · LLM Evaluation · RAGAS
Data: SQL · Pandas · NumPy · Snowflake · Spark SQL · Tableau · Streamlit · Matplotlib
Infrastructure: FastAPI · Docker · AWS (SageMaker, Lambda, S3) · MLflow · Qdrant · Git · CI/CD · Airflow
