Projects & Experiments
My personal lab — side projects, proofs of concept, and experiments at the intersection of AI Agents and Data Governance.
Featured Builds

Sidekick — AI Co-Worker
A proof-of-concept AI co-worker for exploring LLM agents, RAG, memory, and project-scoped document workflows using LangChain and LangGraph.
Whether a local agent loop (worker + evaluator) with ChromaDB-backed RAG and per-project memory could serve as a genuine productivity tool, not just a demo.
Built a working multi-mode agent with full-file RAG retrieval, SQLite checkpoints, and LangSmith tracing. Validated the worker+evaluator pattern at scale.
Personal experiment / POC

SQL Agent Loop
Demonstrates that with a data dictionary and an agentic AI, you can answer any business question in natural language — even when column names are cryptic.
That metadata and governance are the true foundation for effective AI. Without good data definitions, the agent guesses wrong. With them, it reliably translates natural language to accurate SQL.
Validated the pattern end-to-end. The agent uses tools to inspect schema, read the dictionary, and run SQL in a loop — no hardcoded queries.
Personal experiment / POC

Agentic AI POC
Experimental proof of concept for agentic AI patterns — tool-augmented reasoning, agent orchestration loops, memory integration, and plan-and-execute workflows.
How an LLM chooses and invokes tools, receives results, and continues reasoning — without relying on framework abstractions.
Built a custom agent loop with tool registry, long-term memory, planner, and worker from first principles. Useful reference for understanding what frameworks abstract away.
Personal experiment / POC

Career Chatbot
AI agent that acts as a personal career assistant. Uses function calling, a retry loop, and an LLM-as-judge evaluator to stay grounded and in character.
The LLM-as-judge pattern for output quality control — a second model evaluates every reply and triggers a retry with feedback if it fails.
Deployed to Hugging Face Spaces. The evaluator pattern significantly improves response consistency and prevents hallucinations.
Personal experiment / POC

Data CoPilot
Local-first, chat-centric data copilot that routes messages to internal agents for profiling, harmonization, and governance document generation.
A scalable full-stack foundation: React SPA + FastAPI + Postgres (pgvector) + Redis + MinIO — learning each layer step by step.
Core infrastructure operational via Docker Compose. API, workers, and web UI scaffolded. Validates the architecture and data model before full implementation.
Personal experiment / POC

Asparagus Operations POC
Full data lake architecture built on free Google services: raw storage (Drive), AI-powered ETL (GPT-4o-mini), structured serving (Sheets), and browser-side analytics via DuckDB WASM — plus a full operations workflow for invoice entry and master data management.
A complete data lake — raw zone, AI ETL, structured serving, data catalog, and interactive analytics — can be built entirely on free Google services with no dedicated database, no data warehouse, and no cost per query.
Proved the concept: DuckDB WASM runs SQL in the browser querying data from Google Sheets, GPT-4o-mini extracts structured JSON from PDF/XML invoices, and the entire stack costs under $5/month vs $50–$1,000+ for AWS or Snowflake equivalents.
Personal experiment / POC

Agricultural Settlement Platform
Fully operational agricultural intake, quality, contract, and settlement platform built on Google AppSheet + Apps Script. Replaced manual spreadsheet-driven settlement with a structured, rule-based, auditable engine.
Enterprise architecture principles — governed data, controlled calculation engines, workflow automation — can be applied in low-code environments.
Production-deployed system handling multiple contract types, quality-based payout thresholds, advance payments, PDF generation, and automated email distribution.
Personal project

Local Image Generation POC
Ran Stable Diffusion XL locally to explore text-to-image, image-to-image, and inpainting pipelines — including device support (CUDA, MPS, CPU).
How local diffusion models work in practice: download, memory requirements, precision issues on Apple Silicon, and pipeline differences.
Working t2i, i2i, and inpainting pipelines. Key lesson: MPS requires float32; attention_slicing helps on limited VRAM.
Personal experiment / POC
Experiments & OSS
Interested in collaborating on a technical project?
I'm always open to discussing research prototypes, data infrastructure, or open-source tooling.
Get in Touchmail