QANATIX Documentation
Data middleware for AI — connect your enterprise data, query it from any AI model with zero hallucination.
What is QANATIX?
QANATIX is a data middleware that sits between your enterprise data sources and AI models. You push in messy data — CSVs, JSON, PDFs, database dumps, SAP exports — and QANATIX normalizes it, stores it in Postgres, and makes it instantly queryable via MCP or REST API.
Your AI agent calls qanatix_search() and gets back ranked, verified results from your actual data — not scraped web content, not hallucinated guesses.
Why QANATIX?
LLMs hallucinate on 61% of business data queries. Pricing, stock levels, compliance data, supplier specs — all locked in ERPs and databases that AI can't reach. QANATIX fixes this.
| Problem | QANATIX solution |
|---|---|
| AI can't see your ERP/database data | 10+ connectors: PDF, CSV, JSON, XML, Postgres, MySQL, MongoDB, Neo4j, SAP IDoc, streaming |
| Hallucinated answers | Every result comes from your verified data with source attribution |
| 800+ tokens per result from web scraping | ~120 tokens per result, structured and compressed |
| No tenant isolation | Every tenant is fully isolated — database and cache |
| Vendor lock-in | Works with any LLM: Claude, GPT, Gemini, open-source |
Architecture
Your Data Sources QANATIX AI Consumers
───────────────── ───────────────────────── ─────────────────
PDF / CSV / JSON ──→ Extract Claude (MCP)
PostgreSQL ──→ Normalize GPT-4o (REST)
MySQL / MongoDB ──→ Index ──→ Gemini (REST)
Neo4j ──→ Store (Postgres) LangChain / SDK
SAP IDoc XML ──→ Instantly queryable Your Agent
API / Webhooks ──→ Cursor IDE
NDJSON stream ──→Key features
- 10+ data sources — file upload, database pull, real-time push
- Instant availability — data is queryable the moment it's pushed
- Full-text search + structured filters — every field in your data is automatically filterable
- MCP native — Claude calls
qanatix_search()directly via Streamable HTTP - REST API — works with any LLM or application
- Python SDK —
pip install qanatix— typed client with sync + async - Self-service collections — upload your data, fields are auto-detected
- Tenant isolation — every query scoped to your tenant_id
- Self-hostable — Docker Compose on your infra, air-gapped support
- EU Cloud — managed deployment in Frankfurt
- Response caching — 30s response cache for repeated queries
- QANATIX Open — public data hub, free to query, no auth needed
Quick links
- Quickstart — zero to first query in 5 minutes
- Python SDK — pip install qanatix
- QANATIX Open — free public data hub (no auth needed)
- Authentication — API keys and scopes
- Data Import — connect your data sources
- Search — how search works
- MCP Integration — connect to Claude
- Self-hosted — deploy on your infra
- API Reference — all endpoints