my projects

→

multimodal voice agent

•

deployed a multimodal voice AI agent combining voice and screen context using Whisper (STT), Gemini 2.0 Multimodal model, and gpt-4o-mini-tts (TTS) through EC2 on AWS with WebSocket streaming.

•

developed a perplexity sub-agent that can search the web and answer questions using the agent's context using perplexity MCP. orchestrated with langgraph workflows and DSPy for prompt optimization.

LangGraphAWS EC2OpenAI SDKFastAPIDSPyWebSocketsMCP

→

fine-tuned qwen2.5-1.5b with grpo rl

view model

•

fine-tuned Qwen 2.5 (1.5B) model using Hugging Face TRL with LoRA-based SFT warmup & reinforcement learning via GRPO, aligning model outputs to first-principles reasoning across ~750 annotated interactions using RLAIF with curriculum scheduling.

TRLLoRAGRPOSFTRLAIFUnsloth

→

the munger talks (agentic rag)

view repo

•

built an agentic rag system that emulates charlie munger's thinking style (including his mental models) by integrating langgraph workflows, DSPy prompt optimization, and multi-agent orchestration (planner, retriever, mental model analyzer, synthesizer, verifier).

•

integrated DSPy for prompt optimization across planner, synthesizer, and verifier modules; retrieval pipeline uses FAISS with cross-encoder reranking, sustaining ~2.5s median end-to-end latency for multi-turn queries.

LangGraphDSPyMulti-Agent OrchestrationRAGFAISS + Cross-Encoder

→

podcast summarization rag (podnotes)

view repo

•

designed a RAG-based podcast summarization using OpenAI Whisper for transcription, Gemma3 for summarization, and hybrid RAG with semantic and BM25 retrieval through LangChain, with speaker diarization and DynamoDB storage.

WhisperGemma3LangChainChromaDynamoDBBM25RAG

other repos

🤖

multiagents

AI agents with LangChain, LangGraph, DSPy for orchestration patterns.

view repo

🚇

mta routing

NYC subway routing with RAPTOR algorithm through a custom MCP server used with RAG search via an MCP client.

view repo

🧠

reinforcement learning

Reinforcement learning experiments and algorithm implementations.

view repo

tech stack across projects

🤖

ML/DL & LLM Frameworks

building and fine-tuning large language models with cutting-edge frameworks

PyTorchTransformersLangChainLangGraphDSPyTRLUnslothOllamaLlamaIndex

⚡

LM Techniques & Training

advanced techniques for model alignment, fine-tuning, and optimization

LoRASFTRLHFRLAIFPPOGRPOQuantizationRAGFlash AttentionMCP

🔧

MLOps & Infrastructure

deploying and scaling ML systems in production environments

DockerKubernetesGitHub ActionsRedisAWS EC2SageMakerS3DynamoDB

🌐

Web & APIs

building robust web applications and real-time communication systems

ReactFastAPIREST APIsWebSocketsWebRTCJavaScriptTypeScriptAngular