RAG engine for secure document intelligence
Query your files securely with grounded, traceable source citations.
RAG Pipeline Trace
> What is our log retention period?
"...Log data is retained for 90 days. After this retention period, audit logs are permanently deleted..."
Ingestion & Retrieval Pipeline
Every document is split, embedded, and indexed with user isolation. Access your database through raw REST endpoints.
Pipeline Stages
Multipart binary upload stream
FastAPI handles PDF, DOCX, or TXT file buffer ingestion
Text extraction & layout parsing
Extract metadata, pages, structural headers, and layouts
Recursive character chunking
500-token sliding windows with 10% semantic token overlap
Vector representation
Generate 1024-dimensional embeddings via Gemini API
Namespace vector search
Pinecone top-k cosine similarity filtered by document scope
Context-grounded LLM inference
LLM parses query grounded strictly on matching document chunks
Source verification
Verify token offsets and map citations directly to source pages
import httpx
client = httpx.Client(base_url="https://api.simplify.ai/v1")
# Scope vector search to document namespaces
response = client.post(
"/chat/query",
headers={"Authorization": "Bearer sk_live_9a2f"},
json={
"query": "What is the log retention period?",
"document_ids": ["doc_policy_v4"],
"response_mode": "rag_mode",
"parameters": {
"temperature": 0.2,
"max_tokens": 1024
}
}
)
payload = response.json()
print(f"Answer: {payload['content']}")
print(f"Sources: {len(payload['citations'])} cited.")Console Interface
Monitor vector spaces, verify source citations, and manage isolated document namespaces.
Based on the uploaded board_deck_example.pdf:
- QoQ revenue expansion reached 18%, bringing consolidated revenue to $12.4M.
- Enterprise RAG contract expansions accounted for 72% of new recurring growth.
- Operational hosting overhead decreased by 14% via vector similarity cache optimization.
"...consolidated Q4 revenue reached $12.4M, representing an 18% growth quarter-over-quarter..."
"...recurring growth expansion vectors were heavily anchored in enterprise client accounts..."