Platform · Ingestion & Connectors

Ingest anything.

Code, schemas, APIs, docs. Multimodal ingestion. everything your enterprise produces, parsed, chunked, embedded, and linked to your ontology. AI from step one, not bolted on later.

What We Ingest

Code at function granularity.
Not line-level chunks.

tree-sitter reads source code and breaks it into structural components (functions, classes, methods) rather than treating it as raw text. AST-based chunking guarantees every chunk is a complete, meaningful unit.

9 Languages · AST-Level Parsing

🐍Python
🔵Go
Java
📘TypeScript
📒JavaScript
🦀Rust
⚙️C
⚙️C++
🟣C#

Plus: database schemas, API contracts (OpenAPI/Swagger, GraphQL), BPMN workflows, documentation (Confluence, Markdown). Every chunk gets a content-addressed SHA-256 ID.

Local AI First

AI from step one. Not bolted on later.

Embeddings

FastEmbed

BAAI/bge-small-en-v1.5. 384 dimensions. ONNX runtime. CPU-only. No cloud API calls.

NER · AI

GLiNER

Zero-shot NER. Technologies, organizations, standards. No training per entity type.

NER · Rules

spaCy EntityRuler

Hand-written patterns for known entities. Deterministic matching complementing GLiNER.

Two Ingestion Modes

Fast mode or enriched mode.
You choose.

Fast Mode

$0/chunk

Local math only. Steps 1–7. Parse, embed, link, propagate.

  • AST parsing + chunking
  • Local embedding + NER
  • HyDE concept linking
  • Hierarchy propagation
🧠

Enriched Mode

Default
$0.008/chunk

Full 8-step pipeline. 4-pass LLM enrichment.

  • Everything in Fast Mode
  • Contextual embedding
  • Semantic analysis
  • Cross-reference
  • Quality scoring
  • Smart model routing

Plug-and-Play Connectors

Your stack. No custom middleware.

Source Code

GitHubGitLabBitbucket

APIs

OpenAPI / SwaggerGraphQLgRPC / Protobuf

Databases

PostgreSQLMySQLMongoDBSnowflakeBigQuery

Infrastructure

TerraformKubernetesDockerCloudFormationAzure ARM

Project Management

JiraConfluenceLinear

Data Pipelines

dbtAirflowDagster

Deterministic Foundation

The foundation stays deterministic.

Humans own the concepts. Math does the linking.
Every AI-generated enrichment carries provenance.
Enterprise reliability without enterprise complexity.

Platform · Ingestion & Connectors

See ingestion in action.

Watch the platform ingest, parse, and link a live codebase. 9 languages, AST-level chunking, $0.008/chunk enriched.

This site uses cookies

We use essential cookies for the site to function and analytics cookies (Google Analytics) to understand how you use it. Analytics cookies are only activated with your consent. We do not track you across other websites. Your data is stored in the EU and processed in accordance with GDPR. Read our Privacy Policy

CoherenceCoherence