Sovereign AI · Production Infrastructure

Enterprise AI, engineered to run in production.

LMXAI is an AI engineering & consultancy studio building LLM, agentic and RAG systems on sovereign, EU AI Act-compliant infrastructure — independent of hyperscalers, from prototype to production.

Fractional AI CTO
Full product ownership
On-prem & sovereign
vLLM on Kubernetes
Leiden, NL
Serving the EU
deploy/sovereign_stack.py
# EU-sovereign inference — no data leaves the cluster
from lmxai import ServingStack, Agent

stack = ServingStack(
    model   = "mdAgent-Hermes-32B",  # Qwen3-VL LoRA
    runtime = "vllm",
    quant   = "awq-int4",
    gpus    = 8,                    # A100 cluster
    region  = "eu-sovereign",
)

agent = Agent(stack, tools=["rag", "mcp"])
agent.serve(compliance="eu_ai_act")

# illustrative API — not a published package
EU AI Act compliant Hyperscaler-independent LoRA / QLoRA LangGraph · MCP
Positioning

One partner across the full AI stack.

Most AI projects stall between a promising demo and a system that survives production. LMXAI closes that gap with end-to-end ownership.

Software engineering, AI architecture, data science, product strategy and commercialization — delivered by a single accountable partner acting as your fractional AI CTO.

Sovereign by design. On-prem or private cloud — your data and weights stay under your control, EU AI Act-aligned.
Production-first. Inference optimization, observability and evaluation baked in — not bolted on after the demo.
Full ownership. From data pipelines and fine-tuning to deployment, product and go-to-market.
Capabilities

What I build & ship.

Deep, hands-on engineering across the modern LLM lifecycle — model, system and strategy.

LLM Fine-tuning & Optimization

Custom models tuned for your domain and tool-use, then compressed to fit your hardware budget.

LoRAQLoRAMoEAWQINT4/INT2expert profiling

Agentic & RAG Systems

Reliable retrieval and multi-step agents with tool integration and rigorous evaluation.

LangGraphLangChainMCPhybrid retrievalElasticsearchBFCL-v3

Sovereign Inference & Deployment

High-throughput, cost-efficient serving on your own infrastructure — fully on-premise capable.

vLLMKubernetesA100KTransformersspeculative decodingexpert offloading

AI Strategy & Compliance

From POC to a commercialized product — architected to stay independent of hyperscalers and EU-compliant.

EU AI Actdata sovereigntycommercializationFastAPIOpenTelemetryPhoenix
Read the AI Act guide
Selected work

Systems in production.

A selection of shipped platforms across enterprise, allied health and education.

Enterprise AI Platform

Sovereign AI Platform

TSG Group · 500+ employees

An enterprise AI workspace positioned against Copilot & ChatGPT Enterprise on EU data sovereignty — keeping company data inside the org's own boundary.

vLLMKubernetesFastAPIon-prem
View case study
EUdata sovereignty
Model · Fine-tune

mdAgent-Hermes-32B

Qwen3-VL-32B · LoRA

A multimodal tool-use fine-tune of Qwen3-VL-32B, optimized for reliable single-turn function calling and document understanding.

LoRAmultimodaltool-useBFCL-v3
View case study
82.3%single-turn success
Infrastructure · Gateway

mdGPT Gateway

LLM gateway & observability

A FastAPI-based LLM gateway with token streaming, full observability and per-user namespace isolation for multi-tenant deployments.

FastAPIasyncpgRedisSSEOpenTelemetry
View case study
SSEstreaming · multi-tenant
Engineering · Agentic Systems

Agentic Capabilities

LMXAI · design patterns & tool ecosystem

How LMXAI builds reliable production agents — LangGraph orchestration, MCP tool integration, fine-tuned tool reliability (BFCL-v3), observability, and the real workspace tool ecosystem they operate.

LangGraphMCPBFCL-v3OpenTelemetry
Explore capabilities
6production patterns
Clinical AI

Savion

Allied health · dietitians

A clinical AI platform for dietitians built on LangGraph, automating documentation and clinical workflows for regulated allied-health practice.

LangGraphagenticclinical
View case study
clinician productivity
Education · Product

Learning Matrix

LMXAI · AI assistant for students

A personalized AI study companion for students — a reliable, fast-responding mentor that connects new topics to existing knowledge and supports teachers with content and planning.

RAGexam servicelesson planning
View case study
EdTechstudent mentor
Research · NLP

In-Depth NLP Analysis

LMXAI · corpus & topic modeling

Corpus statistics, LDA topic modeling and interactive topic-term visualization — the research layer underpinning Learning Matrix's semantic retrieval.

LDApyLDAvisgensimtopic modeling
View research
20+topics discovered
Industries

Built for regulated, high-stakes domains.

Where data sovereignty, compliance and reliability are non-negotiable.

Enterprise AI

SMB to large enterprise — internal copilots and AI workspaces.

Allied Health

Clinical AI for regulated healthcare and practitioner workflows.

Regulated Industries

Fintech, insurance, agriculture, customs & logistics.

EU Sovereignty

EU AI Act-aligned, hyperscaler-independent deployments.

Stack

The toolchain.

PythonPyTorchvLLMKubernetesLangGraphLangChainMCPFastAPIasyncpgRedisElasticsearchOpenTelemetryPhoenixDockerPyMuPDFRapidOCRevalscopeA100 / GPU
The person behind LMXAI

Vahit — Data Scientist & AI Engineer.

I'm an AI engineer and enterprise AI consultant based in Leiden, the Netherlands, working at the intersection of LLM research and production engineering. My approach is evidence-driven and skeptical but constructive — I care about systems that hold up under real load, not benchmarks that look good in a slide.

RoleAI Engineer · Fractional CTO
Based inLeiden, Netherlands
LanguagesTurkish · English · Dutch
FocusSovereign & production AI
Contact

Let's build something that ships.

Tell me about your project — fine-tuning, agentic systems, sovereign deployment or AI strategy.

Location

Leiden, Netherlands

Email

info@lmxai.com

Profiles