System Architecture
Quint is a multi-service platform that intercepts AI agent actions, scores them through a graph-native pipeline, and optionally blocks dangerous actions in real time.High-Level Architecture
Component Map
| Component | Language | Purpose | Deployment |
|---|---|---|---|
| quint-proxy | Go | MCP gateway proxy, stdio interception, signed audit logs | Customer environment |
| quint-infra | Python (FastAPI) | REST API, event scoring, policy management | Railway |
| quint-graph | Python | GraphReasoner, forward-chaining, GNN, RAG, Memgraph | Library (pip) |
| quint-cli | TypeScript | Developer CLI, policy management, audit verification | npm package |
| risk-engine | Python | GPU model training (Qwen3-8B-AWQ), LoRA fine-tuning | Modal |
| quint-proto | Protobuf | Shared schema contract for all services | buf.build |
Request Flow
Data Flow
Event Lifecycle
Interception
The proxy captures an outbound agent action (MCP tool call, API request, database query). It normalizes the action to canonical
domain:scope:verb format and extracts metadata (agent, session, target, data fields).Ingestion
The event is sent to the REST API via
POST /events. The API validates the API key, checks rate limits, and persists the event to PostgreSQL.Scoring
The GraphReasoner evaluates the event through 4 layers: intrinsic risk, GNN structural analysis, policy violations, and temporal anomaly detection. If confidence is below 0.8, compliance context is retrieved from Memgraph and injected into a Gemini LLM call.
Response
The score (1-100), risk level, violations, compliance references, and mitigations are returned to the proxy. The proxy enforces the verdict (allow, flag, or block).
Deployment Topology
Railway (Production API)
| Service | Resources | Purpose |
|---|---|---|
| API Server | 2 vCPU, 1GB RAM | FastAPI + Uvicorn (4 workers) |
| PostgreSQL | Managed | Event storage, scores, customers |
| Redis | Managed | L1 cache, rate limiting |
| Memgraph | 256MB RAM (optional) | Graph reasoning co-processor |
https://api-production-56df.up.railway.app
Modal (GPU Training)
| Service | Resources | Purpose |
|---|---|---|
| Risk Engine | A10G GPU, 40GB VRAM | Qwen3-8B-AWQ inference |
| Training Jobs | A10G GPU | LoRA fine-tuning per tenant |
| Weights Volume | Persistent | Customer-specific model weights |
Customer Environment
| Component | Resources | Purpose |
|---|---|---|
| Proxy (Go binary) | Minimal (< 50MB RAM) | MCP interception, audit logging |
| CLI | Node.js | Policy management, audit verification |