Hi, my name is
Tanmay Sule.
I build production AI systems at scale.
I'm a Lead Engineer at Eightfold AI, where I lead the company's agentic AI initiative. I build LLM-powered systems that automate real engineering workflows — from production issue triage to multi-tenant RAG platforms.
View my work ↓01. About Me
I'm a platform engineer who builds production AI systems. At Eightfold, I've spent 3+ years owning core infrastructure — horizontally-sharded databases across 40+ clusters, multi-cloud architecture (AWS & Azure), and real-time analytics platforms — and now lead the company's agentic AI initiative on top of that foundation.
On the side, I'm building a multi-tenant agentic RAG platform from scratch — a full-stack system with Go ETL pipelines, hybrid retrieval with automatic quality grading, two-level agent orchestration, and AWS Bedrock deployment.
I hold a Master's from the Indian Institute of Science (IISc Bangalore) and ranked 2nd out of 100,000+ in India's GATE CS exam.
02. Featured Work
Agentic AI Developer Productivity Platform
Technical & Project Lead · Eightfold AI
Engineers were spending hours manually triaging production alerts, debugging build failures, and searching internal docs. I built the agent framework from scratch and 3 production LLM agents that automate these workflows, reducing time-to-resolution across the engineering organization. Mentored a team of new engineers into AI engineering while establishing org-wide standards.
- Agent Framework — Custom 4-layer architecture on LangGraph/LangChain. Reusable abstractions with built-in observability middleware (per-invocation token tracking), multi-model LLM factory (GPT-5/4o/4o-mini), and centralized tool registry (9 tools).
- Sentry Production Triager — 3-stage pipeline: root cause analysis (GPT-5, 7 tools) → code fix classification → GitHub issue creation with automatic Copilot assignment for code fix. End-to-end AI-to-AI handoff: agent triages the issue, then assigns it to Copilot to generate the fix. Redis distributed locking for concurrent issue prevention.
- Build Failure Triager — Per-failure root cause analysis with cross-failure deduplication. Same root cause across multiple test failures → single GitHub issue, eliminating duplicate engineering effort. Multi-channel Slack reporting.
- Developer Chatbot — Slack-native conversational assistant that answers engineering questions by orchestrating specialized sub-agents. 8-step pipeline with intent classification, dynamic routing, concurrent tool execution, and structured Block Kit responses. Built-in multi-level error recovery.
- Python
- LangGraph
- LangChain
- GPT-5 / 4o / 4o-mini
- Redis
- S3
- Slack API
- GitHub API
- Sentry API
- Pydantic
Multi-Tenant Agentic RAG Platform
Sole Architect & Engineer · Independent Project
Full-stack conversational AI platform I designed and built from scratch. The core challenge: answering cross-domain business queries that require joining structured database records with unstructured communications (Slack, Email, Notion) — resolving entity references across data sources and producing evidence-based, source-attributed responses.
- Agent Orchestration — Two-level architecture: L0 orchestrator (9-node LangGraph, 29-field state) handles classification, planning, and evaluation with persistent working memory (entity tracking, pronoun resolution, topic continuity across turns). L1 domain agents (5-node, plan-then-execute) decompose queries into DAGs of typed/SQL/search tasks with concurrent execution. Two-tier human-in-the-loop with crash-safe DynamoDB persistence.
- Data Pipeline — Go ETL with medallion architecture (Bronze → Silver → Gold). 5 event-driven Lambdas, 6 source adapters, self-registering mapper registry normalizing 40+ entity types. Content enrichment prepends source context (channel, sender, date) before embedding — so vectors capture provenance, improving filtered retrieval. Hybrid chunking, dual embedding providers (OpenAI + Bedrock Titan). Multi-layer error tracking with partial-success handling.
- Retrieval Engine — Corrective RAG with hybrid search: parallel pgvector cosine similarity and PostgreSQL full-text search (tsvector, ts_rank_cd weighting) over 50 candidates each, fused via single-pass Reciprocal Rank Fusion SQL (k=60, FULL OUTER JOIN). Cross-source fan-out across all data sources with global re-ranking. LLM-based relevance grading with automatic query reformulation when confidence is low. Separate JSONB metadata filter path for structured queries (channel, sender, date range).
- MCP Tool Layer — 30 tools across 3 Lambda backends: 24 typed ORM handlers (Go, ~20 entity types), 2 SQL tools with read-only guardrails (write blocklist, identifier validation, 500-row cap), 4 search tools (Python). JWT-based tenant authentication (Cognito). Bedrock AgentCore runtime with 13 SSE event types for real-time streaming.
- Infrastructure — 8 ARM64 Graviton Lambdas, Pulumi IaC (Go), per-tenant isolation (PostgreSQL schema-per-tenant, DynamoDB partitioning, dedicated MCP gateway instances). Provider-agnostic LLM factory supporting Anthropic and OpenAI, configurable per tenant.
- Python
- Go
- LangGraph
- Claude Opus
- OpenAI
- Aurora PostgreSQL
- pgvector
- DynamoDB
- Bedrock AgentCore
- MCP
- Lambda
- Cognito
- Pulumi
03. Experience
Lead Engineer — Core Infrastructure / Platform
Eightfold AI — AI-powered talent intelligence platform serving Fortune 500 enterprises
Aug 2022 — Present
Agentic AI Initiative
Launched and led the company's first agentic AI effort, growing a team from 0 to 4 engineers. Piloted Cursor Agent and Copilot Agent adoption across engineering. Established org-wide standards for LLM integration, prompt management, and evaluation.
Cloud-Native Analytics Platform
Consolidated Redshift + Databricks into a unified StarRocks-based analytics layer, cutting query latency and simplifying BI reporting for product and data teams. Designed the end-to-end event-driven ETL pipeline (Firehose → S3 → SQS → Airflow).
Multi-Cloud Architecture
Core member of year-long AWS → Azure initiative, unblocking enterprise customers with Azure-only mandates. Designed cloud-agnostic service abstractions (Redis, OLTP, Blob Storage) and extended the distributed async platform — 100+ containers, 700K+ operations daily.
OLTP Database Platform
Owned Aurora MySQL at scale — 40+ clusters across 4 AWS regions. Drove zero-downtime major version upgrades via blue-green deployments. Delivered BYoK encryption saving $5,000/customer/year. Built internal CLI tooling for cluster management and migration automation.
04. Skills
AI & LLM Engineering
- LangGraph
- LangChain
- Agent Orchestration
- MCP Servers
- RAG Pipelines
- Prompt Engineering
- Pydantic
- Embeddings
- Vector Search
- pgvector
Infrastructure & Cloud
- AWS
- Azure
- Bedrock
- Lambda
- ECS
- Aurora
- DynamoDB
- S3
- SQS
- Docker
- Kubernetes
- CloudFormation
- Terraform
- Pulumi
- Grafana
- Prometheus
Languages & Data
- Python
- Go
- Java
- Rust
- C++
- Bash
- SQL
- MySQL
- PostgreSQL
- Redis
- StarRocks
- Airflow
Architecture & Patterns
- Distributed Systems
- Event-driven Architecture
- ETL Pipelines
- Medallion Architecture
- Multi-tenant Isolation
- Blue-Green Deployments
- Concurrency
- Message Brokers
05. Education & Achievements
Master of Technology
Computer Science & Automation — Indian Institute of Science, Bangalore
Bachelor of Engineering
Computer Engineering — Mumbai University
- All India Rank 2 out of 100,000+ in Graduate Aptitude Test in Engineering (GATE CS, 2020)
- Selected as one of the Top 25 individuals in company-wide annual performance recognition (Eightfold AI, 2024)
- JLPT N4 & N5 certification — Japanese-Language Proficiency Test
06. What's Next?
Get In Touch
I'm always interested in hearing about new opportunities in AI engineering, infrastructure, and platform engineering. Whether you have a question or just want to connect, feel free to reach out.
Say Hello