Hi, my name is

Tanmay Sule.

I build production AI systems at scale.

I'm a Lead Engineer at Eightfold AI, where I lead the company's agentic AI initiative. I build LLM-powered systems that automate real engineering workflows — from production issue triage to multi-tenant RAG platforms.

View my work ↓

01. About Me

I'm a platform engineer who builds production AI systems. At Eightfold, I've spent 3+ years owning core infrastructure — horizontally-sharded databases across 40+ clusters, multi-cloud architecture (AWS & Azure), and real-time analytics platforms — and now lead the company's agentic AI initiative on top of that foundation.

On the side, I'm building a multi-tenant agentic RAG platform from scratch — a full-stack system with Go ETL pipelines, hybrid retrieval with automatic quality grading, two-level agent orchestration, and AWS Bedrock deployment.

I hold a Master's from the Indian Institute of Science (IISc Bangalore) and ranked 2nd out of 100,000+ in India's GATE CS exam.

3+ Years at Eightfold AI
Rank 2 GATE CS / 100K+
Top 25 of 1,000+ · 2024
M.Tech IISc Bangalore

02. Featured Work

Featured Project

Agentic AI Developer Productivity Platform

Technical & Project Lead · Eightfold AI

Engineers were spending hours manually triaging production alerts, debugging build failures, and searching internal docs. I built the agent framework from scratch and 3 production LLM agents that automate these workflows, reducing time-to-resolution across the engineering organization. Mentored a team of new engineers into AI engineering while establishing org-wide standards.

  • Agent Framework — Custom 4-layer architecture on LangGraph/LangChain. Reusable abstractions with built-in observability middleware (per-invocation token tracking), multi-model LLM factory (GPT-5/4o/4o-mini), and centralized tool registry (9 tools).
  • Sentry Production Triager — 3-stage pipeline: root cause analysis (GPT-5, 7 tools) → code fix classification → GitHub issue creation with automatic Copilot assignment for code fix. End-to-end AI-to-AI handoff: agent triages the issue, then assigns it to Copilot to generate the fix. Redis distributed locking for concurrent issue prevention.
  • Build Failure Triager — Per-failure root cause analysis with cross-failure deduplication. Same root cause across multiple test failures → single GitHub issue, eliminating duplicate engineering effort. Multi-channel Slack reporting.
  • Developer Chatbot — Slack-native conversational assistant that answers engineering questions by orchestrating specialized sub-agents. 8-step pipeline with intent classification, dynamic routing, concurrent tool execution, and structured Block Kit responses. Built-in multi-level error recovery.
  • Python
  • LangGraph
  • LangChain
  • GPT-5 / 4o / 4o-mini
  • Redis
  • S3
  • Slack API
  • GitHub API
  • Sentry API
  • Pydantic
Featured Project

Multi-Tenant Agentic RAG Platform

Sole Architect & Engineer · Independent Project

Full-stack conversational AI platform I designed and built from scratch. The core challenge: answering cross-domain business queries that require joining structured database records with unstructured communications (Slack, Email, Notion) — resolving entity references across data sources and producing evidence-based, source-attributed responses.

  • Agent Orchestration — Two-level architecture: L0 orchestrator (9-node LangGraph, 29-field state) handles classification, planning, and evaluation with persistent working memory (entity tracking, pronoun resolution, topic continuity across turns). L1 domain agents (5-node, plan-then-execute) decompose queries into DAGs of typed/SQL/search tasks with concurrent execution. Two-tier human-in-the-loop with crash-safe DynamoDB persistence.
  • Data Pipeline — Go ETL with medallion architecture (Bronze → Silver → Gold). 5 event-driven Lambdas, 6 source adapters, self-registering mapper registry normalizing 40+ entity types. Content enrichment prepends source context (channel, sender, date) before embedding — so vectors capture provenance, improving filtered retrieval. Hybrid chunking, dual embedding providers (OpenAI + Bedrock Titan). Multi-layer error tracking with partial-success handling.
  • Retrieval Engine — Corrective RAG with hybrid search: parallel pgvector cosine similarity and PostgreSQL full-text search (tsvector, ts_rank_cd weighting) over 50 candidates each, fused via single-pass Reciprocal Rank Fusion SQL (k=60, FULL OUTER JOIN). Cross-source fan-out across all data sources with global re-ranking. LLM-based relevance grading with automatic query reformulation when confidence is low. Separate JSONB metadata filter path for structured queries (channel, sender, date range).
  • MCP Tool Layer — 30 tools across 3 Lambda backends: 24 typed ORM handlers (Go, ~20 entity types), 2 SQL tools with read-only guardrails (write blocklist, identifier validation, 500-row cap), 4 search tools (Python). JWT-based tenant authentication (Cognito). Bedrock AgentCore runtime with 13 SSE event types for real-time streaming.
  • Infrastructure — 8 ARM64 Graviton Lambdas, Pulumi IaC (Go), per-tenant isolation (PostgreSQL schema-per-tenant, DynamoDB partitioning, dedicated MCP gateway instances). Provider-agnostic LLM factory supporting Anthropic and OpenAI, configurable per tenant.
  • Python
  • Go
  • LangGraph
  • Claude Opus
  • OpenAI
  • Aurora PostgreSQL
  • pgvector
  • DynamoDB
  • Bedrock AgentCore
  • MCP
  • Lambda
  • Cognito
  • Pulumi

03. Experience

Lead Engineer — Core Infrastructure / Platform

Eightfold AI — AI-powered talent intelligence platform serving Fortune 500 enterprises

Aug 2022 — Present

Agentic AI Initiative

Launched and led the company's first agentic AI effort, growing a team from 0 to 4 engineers. Piloted Cursor Agent and Copilot Agent adoption across engineering. Established org-wide standards for LLM integration, prompt management, and evaluation.

Cloud-Native Analytics Platform

Consolidated Redshift + Databricks into a unified StarRocks-based analytics layer, cutting query latency and simplifying BI reporting for product and data teams. Designed the end-to-end event-driven ETL pipeline (Firehose → S3 → SQS → Airflow).

Multi-Cloud Architecture

Core member of year-long AWS → Azure initiative, unblocking enterprise customers with Azure-only mandates. Designed cloud-agnostic service abstractions (Redis, OLTP, Blob Storage) and extended the distributed async platform — 100+ containers, 700K+ operations daily.

OLTP Database Platform

Owned Aurora MySQL at scale — 40+ clusters across 4 AWS regions. Drove zero-downtime major version upgrades via blue-green deployments. Delivered BYoK encryption saving $5,000/customer/year. Built internal CLI tooling for cluster management and migration automation.

04. Skills

AI & LLM Engineering

  • LangGraph
  • LangChain
  • Agent Orchestration
  • MCP Servers
  • RAG Pipelines
  • Prompt Engineering
  • Pydantic
  • Embeddings
  • Vector Search
  • pgvector

Infrastructure & Cloud

  • AWS
  • Azure
  • Bedrock
  • Lambda
  • ECS
  • Aurora
  • DynamoDB
  • S3
  • SQS
  • Docker
  • Kubernetes
  • CloudFormation
  • Terraform
  • Pulumi
  • Grafana
  • Prometheus

Languages & Data

  • Python
  • Go
  • Java
  • Rust
  • C++
  • Bash
  • SQL
  • MySQL
  • PostgreSQL
  • Redis
  • StarRocks
  • Airflow

Architecture & Patterns

  • Distributed Systems
  • Event-driven Architecture
  • ETL Pipelines
  • Medallion Architecture
  • Multi-tenant Isolation
  • Blue-Green Deployments
  • Concurrency
  • Message Brokers

05. Education & Achievements

Master of Technology

Computer Science & Automation — Indian Institute of Science, Bangalore

Bachelor of Engineering

Computer Engineering — Mumbai University

  • All India Rank 2 out of 100,000+ in Graduate Aptitude Test in Engineering (GATE CS, 2020)
  • Selected as one of the Top 25 individuals in company-wide annual performance recognition (Eightfold AI, 2024)
  • JLPT N4 & N5 certification — Japanese-Language Proficiency Test

06. What's Next?

Get In Touch

I'm always interested in hearing about new opportunities in AI engineering, infrastructure, and platform engineering. Whether you have a question or just want to connect, feel free to reach out.

Say Hello