Projects

9 production-grade AI systems spanning the full applied AI stack.

Project 01

Completed

Synthetic Data Generation Pipeline

I built a pipeline that generates synthetic training data, validates it with an LLM judge, and self-corrects until every record passes. Started with a 20% failure rate, ended at zero.

Python Pydantic OpenAI API GPT-4o-mini GPT-4o +1

Project 02

Completed

RAG Evaluation Pipeline

I tested 16 RAG configurations and found that semantic chunking + OpenAI embeddings + Cohere reranking gets 0.747 Recall@5 on structured Markdown docs. This is how I got there.

Python LangChain RAGAS Sentence-Transformers Braintrust +4

Project 03

Completed

Contrastive Embedding Fine-Tuning

I fine-tuned all-MiniLM-L6-v2 on 1,475 dating profile pairs and flipped Spearman from -0.22 to +0.85. LoRA got 96.9% of that using 0.32% of the parameters.

Python Sentence-Transformers PEFT/LoRA PyTorch UMAP +2

Project 04

Completed

AI-Powered Resume Coach

I generated 250 synthetic resumes across 5 fit levels, labeled them for 5 failure modes with zero LLM calls, and found that writing template choice accounts for a 66-percentage-point difference in failure rates (chi-squared=32.74, p<0.001).

Python Pydantic v2 Instructor FastAPI ChromaDB +3

Project 05

Completed

ShopTalk Knowledge Management Agent

I built a RAG system from scratch with no LangChain, tested 46 configurations across 5 chunking strategies, 4 embedding models, and 3 retrieval methods, and found that heading-aware chunking + OpenAI embeddings hits NDCG@5 = 0.896 and Recall@5 = 1.0.

Python PyMuPDF FAISS SentenceTransformers OpenAI +6

Project 06

Planned

Digital Clone System

Multi-agent system that learns your writing style from samples and generates content that sounds like you. CrewAI agents handle style analysis, content planning, and draft generation.

Python CrewAI OpenAI LangChain

Project 07

Planned

Feedback Intelligence Agent

Multi-agent system that helps Product Managers go from raw customer feedback to prioritized roadmap gaps. CrewAI agents handle sentiment scoring, theme extraction, roadmap alignment, and gap detection.

Python CrewAI OpenAI LangChain

Project 08

Planned

Jira Sprint Planning Agent

Multi-agent system that analyzes a Jira backlog, estimates effort, identifies dependencies, and proposes sprint composition. Built with CrewAI.

Python CrewAI OpenAI Jira API

Project 09

Planned

DevOps Root-Cause Analysis Assistant

Multi-agent DevOps assistant that monitors CI/CD pipelines, correlates logs and metrics, diagnoses root causes, and suggests remediations with risk-based approval workflows. Built with CrewAI.

Python CrewAI OpenAI LangChain