NS-DocNavigator — intelligent document retrieval and knowledge system for manufacturing, engineering and quality teams.
Turn thousands of pages of manuals, SOPs, QMS documents, engineering specs and training material into instant, AI-powered answers — with page-level source citations and deployment on your own infrastructure.
A seven-minute walkthrough of how DocNavigator turns technical documents into cited, searchable AI answers.
Most organisations accumulate documentation faster than they can organise it. The result is hours lost in manuals, knowledge locked in PDFs and legacy formats, inconsistent interpretation, weak traceability, and slow onboarding for new staff.
NS-DocNavigator is not just a chatbot over files. It builds a searchable vector + graph knowledge base, then answers with cited evidence.
Upload documents, batch ingest files, fetch URLs, or crawl SharePoint/intranet sources. Docling parses text, tables and images from 30+ formats.
Semantic chunking, contextual retrieval, embeddings, image captioning and entity extraction build a LanceDB + FalkorDB knowledge layer.
Hybrid search, reranking, CRAG, multi-hop decomposition and graph context feed the LLM, producing answers with clickable citations.
Eight layered retrieval and reasoning techniques handle the failure modes that simple semantic search misses.
BM25 + dense vector search with Reciprocal Rank Fusion for exact part numbers and conceptual matches.
A cross-encoder reranks the shortlist by direct query-document relevance.
Checks relevance after rerank, reformulates and retries when the first pass misses.
Breaks complex questions into sub-questions across multiple documents and procedures.
Uses cause-and-effect relationships for troubleshooting and root-cause analysis.
FalkorDB maps components, procedures, specifications and their relationships.
Each chunk gets document-level context so isolated snippets remain meaningful.
Optional MCP tool calls let answers combine document knowledge with live system state.
PDF, DOCX, PPTX, XLSX, HTML, images, CSV, Markdown and legacy Office formats. Image-only PDFs trigger OCR; text PDFs skip OCR for speed.
Pages and figures are rendered, extracted and captioned by a vision model so schematics and illustrations are included in retrieval.
Single URL ingest, recursive crawl, NTLM authentication, dry-run preview, dead-link reports, scheduled refresh and pause/resume.
Group documents by plant, line, system or customer. Collection permissions overlay Admin, Editor and Viewer roles.
Export Q&A sessions with citations preserved for compliance hand-off, training material or offline reference.
Measure context precision, recall, faithfulness, answer relevancy and NS answer-completeness across evaluation runs.
The system is self-hosted, containerised and built around specialised stores for metadata, vectors and graph relationships.
Python 3.12+, FastAPI, Pydantic, asyncio, httpx and aiosqlite coordinate ingestion, retrieval, settings, authentication and streaming chat.
SQLite stores metadata, LanceDB stores vectors with Tantivy BM25 full-text indexing, and FalkorDB stores the knowledge graph.
BGE-M3 embeddings, BGE reranker and local vision models can run on GPU; CPU mode remains usable for smaller deployments and demos.
Built for teams that need fast answers without losing provenance or control of sensitive technical content.
Real UI from the current implementation: chat, document ingestion, provider settings, ingestion controls, retrieval controls and built-in help.
Ask natural-language questions and get cited answers with source documents, page references and follow-up prompts.
Upload files, fetch SharePoint/intranet URLs, crawl sites and manage processed documents by collection.
Configure LLM, embedding and vision providers, test connections and keep API keys masked in the UI.
Tune chunking, contextual retrieval, metadata enrichment, OCR, table recognition and parse timeouts.
Tune top‑k results, reranking, query expansion, knowledge graph retrieval, CausalRAG and corrective RAG.
Inline documentation covers overview, getting started, chat tips, query tips, settings and MCP tooling.
docker compose up starts the backend, MCP server and FalkorDBNS-DocNavigator is not isolated behind its own UI. It can expose its retrieval capability to other agents, and it can call external tools while answering.
Retrieval and chat endpoints are exposed through a stateless HTTP MCP endpoint at /mcp, protected by separate API-key authentication.
The generation loop can call configured MCP tools mid-answer, then continue reasoning with tool output included in the final response.
Useful for questions that combine manuals and procedures with live production state, tickets, analytics or internal systems.
Common questions about privacy, deployment, citations and real-world use.
No. The system builds a searchable vector + graph knowledge base, uses hybrid search and reranking, checks relevance, handles multi-hop queries, and cites its sources.
Yes. It is designed for self-hosted deployment. Documents and queries stay on your infrastructure unless you explicitly configure a cloud AI provider.
Yes. Answers include citations that link back to the document and page, with the supporting passage available for verification.
Yes. It supports single URL ingest, recursive web crawl, NTLM-authenticated SharePoint/intranet sources, scheduled refresh and dead-link reporting.
Looking for broader AI help? See AI Solutions or AI Agents for Manufacturing.
If your teams waste time searching manuals and SOPs, NS-DocNavigator is built to make that knowledge usable. Book a short call and we’ll discuss your documents, constraints, and a sensible pilot.
Prefer to start with quick wins? Try our Free Tools or browse all products.