Anmol Jaiswal anmolg1997

About

I build production AI systems that reason, plan, and execute autonomously — from multi-agent orchestration to enterprise RAG pipelines, LoRA fine-tuning at scale, and multi-adapter inference serving.

🔭 Currently Building:

Focus Area	Technologies
🤖 Multi-Agent AI	Google ADK, A2A Protocol, MCP Tools, Agent Orchestration
🔗 Knowledge Graphs & RAG	Neo4j GraphRAG, Hybrid Search, Reranking, Guardrails
⚡ LLM Fine-Tuning & Serving	LoRA/QLoRA, Unsloth, vLLM, Multi-Adapter Inference
👁️ LLM Observability	Langfuse, MLflow, OpenSearch, Domain Evaluation
🧠 Model Building	Transformers from Scratch, DeepSpeed, Alignment (SFT/DPO/RLHF)

Featured Work

🤖 Multi-Agent Framework

Production multi-agent orchestration with Google ADK — coordinator, planner, coder & reviewer agents

🔗 KG_RAG

Knowledge Graph + RAG with Neo4j for intelligent document QA

🏢 Enterprise RAG

Hybrid search RAG with guardrails, reranking & Langfuse observability

🔍 NL2SQL Engine

Natural language → SQL with self-correction & multi-dialect support

🧠 SLM From Scratch

Build language models from zero — tokenizer, transformer, training, alignment

⚡ LLM Finetuning

Production LoRA/QLoRA fine-tuning — YAML recipes, MLflow, vLLM serving

🚀 Multi-LoRA Serve

One base model, many adapters per request — OpenAI-compatible inference gateway

🏭 LoRA Factory

Adapter lifecycle — train, evaluate, merge (TIES/DARE), version & publish

🏥 Domain-Adaptive LLM

Specialize LLMs for medical, legal, finance & code — domain benchmarks, curriculum training, safety guardrails

🛡️ Insurance Fraud Detection

ML-powered insurance fraud detection — 10 expert rules, PyCaret AutoML, explainable decisions

Open Source Contributions

Google ADK Python

Designed the first Firestore session service — transactional state, subcollection events, batch deletes. Design patterns adopted in the official implementation by a Google engineer who credited the work.

ADK Community

Contributed FirestoreSessionService with 19 unit tests, in-memory mocks, and production-grade design — race-safe transactions, N+1 query elimination, async batch deletes.

ag-ui Protocol

Bug fixes to the LangGraph adapter — trailing slash route fix, fork config passthrough, and message ID validation for regenerate streams.

Pydantic AI

Built-in history processor for orphaned tool call/result repair preventing provider 400 errors, and fixed LLM-as-judge reason field pollution from reasoning models.

llama.cpp

Added type and integer range validation to GGUFWriter.add_key_value — catches type mismatches and integer overflow/underflow before they silently corrupt model metadata.

Tech Stack

📋 Full Tech Breakdown

LLM Providers      OpenAI • Anthropic • Google Gemini • Llama • Mistral
Agent Frameworks   Google ADK • A2A Protocol • MCP Tools • LangGraph • CrewAI
RAG Stack          LlamaIndex • LangChain • Neo4j • OpenSearch • Pinecone • Weaviate
Observability      Langfuse • MLflow • Weights & Biases • OpenTelemetry
Inference          vLLM • Multi-LoRA Serving • TensorRT-LLM • ONNX Runtime
Fine-tuning        LoRA • QLoRA • DoRA • Unsloth • Axolotl • DeepSpeed • RLHF/DPO
Model Building     PyTorch Transformers • BPE Tokenizers • GGUF/ONNX Export
Frontend           React • Vite • Next.js • TypeScript • TailwindCSS
Backend            FastAPI • Python • Node.js • GraphQL
Cloud              AWS (Bedrock, SageMaker) • GCP (Vertex AI) • Azure
Infrastructure     Docker • Kubernetes • Terraform • GitHub Actions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly