System Architecture

Quint is a multi-service platform that intercepts AI agent actions, scores them through a graph-native pipeline, and optionally blocks dangerous actions in real time.

High-Level Architecture

┌─────────────────────────────────────────────────────────────────┐
│                    QUINT PLATFORM                                │
│                                                                  │
│  ┌──────────┐    ┌──────────────┐    ┌───────────────────┐     │
│  │  Proxy   │───▶│  REST API    │───▶│  GraphReasoner    │     │
│  │  (Go)    │    │  (FastAPI)   │    │  (quint-graph)    │     │
│  └──────────┘    └──────┬───────┘    └────────┬──────────┘     │
│                         │                      │                 │
│                    ┌────┴─────┐          ┌─────┴──────┐         │
│                    │          │          │            │          │
│               ┌────▼──┐  ┌───▼───┐  ┌───▼────┐  ┌───▼────┐   │
│               │Postgre│  │ Redis │  │Memgraph│  │ Gemini │   │
│               │SQL    │  │       │  │        │  │ (LLM)  │   │
│               └───────┘  └───────┘  └────────┘  └────────┘   │
│                                                                  │
│  ┌──────────────┐    ┌───────────────┐                         │
│  │ Risk Engine  │    │  CLI          │                          │
│  │ (Modal GPU)  │    │  (TypeScript) │                          │
│  └──────────────┘    └───────────────┘                         │
└─────────────────────────────────────────────────────────────────┘

Component Map

Component	Language	Purpose	Deployment
quint-proxy	Go	MCP gateway proxy, stdio interception, signed audit logs	Customer environment
quint-infra	Python (FastAPI)	REST API, event scoring, policy management	Railway
quint-graph	Python	GraphReasoner, forward-chaining, GNN, RAG, Memgraph	Library (pip)
quint-cli	TypeScript	Developer CLI, policy management, audit verification	npm package
risk-engine	Python	GPU model training (Qwen3-8B-AWQ), LoRA fine-tuning	Modal
quint-proto	Protobuf	Shared schema contract for all services	buf.build

Request Flow

Data Flow

Event Lifecycle

Interception

The proxy captures an outbound agent action (MCP tool call, API request, database query). It normalizes the action to canonical domain:scope:verb format and extracts metadata (agent, session, target, data fields).

Ingestion

The event is sent to the REST API via POST /events. The API validates the API key, checks rate limits, and persists the event to PostgreSQL.

Scoring

The GraphReasoner evaluates the event through 4 layers: intrinsic risk, GNN structural analysis, policy violations, and temporal anomaly detection. If confidence is below 0.8, compliance context is retrieved from Memgraph and injected into a Gemini LLM call.

Response

The score (1-100), risk level, violations, compliance references, and mitigations are returned to the proxy. The proxy enforces the verdict (allow, flag, or block).

Audit

The proxy creates a signed audit entry (Ed25519) chain-linked to the previous entry. The full audit trail is tamper-evident and exportable.

Deployment Topology

Railway (Production API)

Service	Resources	Purpose
API Server	2 vCPU, 1GB RAM	FastAPI + Uvicorn (4 workers)
PostgreSQL	Managed	Event storage, scores, customers
Redis	Managed	L1 cache, rate limiting
Memgraph	256MB RAM (optional)	Graph reasoning co-processor

Production URL: https://api-production-56df.up.railway.app

Service	Resources	Purpose
Risk Engine	A10G GPU, 40GB VRAM	Qwen3-8B-AWQ inference
Training Jobs	A10G GPU	LoRA fine-tuning per tenant
Weights Volume	Persistent	Customer-specific model weights

Customer Environment

Component	Resources	Purpose
Proxy (Go binary)	Minimal (< 50MB RAM)	MCP interception, audit logging
CLI	Node.js	Policy management, audit verification

Dependencies

quint-infra
  ├── quint-graph[memgraph]  (pip, from GitHub)
  ├── FastAPI + Uvicorn
  ├── SQLAlchemy (async) + asyncpg
  ├── Redis (aioredis)
  ├── google-generativeai (Gemini)
  ├── grpcio (risk-engine client)
  ├── modal (GPU deployment)
  └── pydantic + pydantic-settings

quint-graph
  ├── networkx (in-memory graph)
  ├── neo4j (Memgraph Bolt driver)
  ├── torch + torch-geometric (optional, GNN)
  └── compliance_ontology.json (1,948 nodes)

quint-proxy
  ├── gen/go/quint/v1 (from quint-proto)
  └── Go stdlib (crypto, net)

quint-cli
  ├── gen/ts (from quint-proto)
  └── Node.js

Architecture & Workflows

​System Architecture

​High-Level Architecture

​Component Map

​Request Flow

​Data Flow

​Event Lifecycle

​Deployment Topology

​Railway (Production API)

​Modal (GPU Training)

​Customer Environment

​Dependencies