Skip to content
Sudesh P

Sudesh P (Sudhii)

AI Systems Engineer & Creator of OmniSLM

I build production-ready AI applications that prioritize privacy, security, and unit economics. My current focus is on operationalizing Small Language Models (SLMs) and designing scalable RAG infrastructure.

AI Architecture

Designing decoupled, resilient systems for LLM inference, continuous batching, and agent orchestration.

Vector Search (RAG)

Implementing highly isolated, multi-tenant vector databases using FAISS, Qdrant, and Pinecone.

Local Inference

Running optimized SLMs on edge devices and VPCs using Ollama, Llama.cpp, and vLLM.

Published Notes