Features
Everything that ships in SynapCores Community Edition. Items marked Enterprise require an EE license.
Quick summary
SynapCores is a single-binary AI database. Community Edition is free and runs everything below on a single host. Enterprise Edition adds clustering, fine-grained RBAC/SSO, CDC, encryption-at-rest, and SLA-backed support.
- Graph + Vector + LLM in a single SQL/Cypher query
- 161 ready-to-run recipes shipped with the binary
- Bundled local LLM (Llama 3.2 1B GGUF) — no API keys required
- Multi-provider vision (OpenAI / Anthropic / Ollama LLaVA), configurable in the UI
- Filesystem Collections — drop a file in a folder, query it as SQL
- Built in Rust — single binary, no JVM, no Python runtime
1. SQL engine
1.1 SQL surface
- CREATE / DROP / ALTER / INSERT / UPDATE / DELETE / SELECT
- JOINs (INNER / LEFT / RIGHT / FULL / CROSS)
- Subqueries, CTEs (including recursive)
- UNION / INTERSECT / EXCEPT
- Window functions (ROW_NUMBER / RANK / LAG / LEAD / ...)
- GROUP BY ... HAVING, ORDER BY multi-column
- Stored procedures with control flow + error handling
- Triggers (BEFORE / AFTER on INSERT / UPDATE / DELETE)
- Views (regular and materialized)
- ACID transactions
1.2 Data types
Standard: INTEGER, BIGINT, REAL, DOUBLE, TEXT, VARCHAR, CHAR, BOOLEAN, JSON, JSONB, UUID, TIMESTAMP, DATE, TIME, DECIMAL, BYTEA. AI-native: VECTOR(dimensions), AUDIO, VIDEO, IMAGE, PDF.
1.3 Query optimizer
- Cost-based optimizer (default)
- Plan cache with literal-stripped hashing + PG-style hedging
- Result cache (ON by default)
- Predicate / projection pushdown
- Join reordering
- PK fast-path for point lookups (102× on oltp_point_select)
2. Vector database
2.1 Vector storage + search
-- Cosine distance
embedding <=> query_vector
-- Euclidean distance
embedding <-> query_vector
-- Inner product
embedding <#> query_vector- HNSW index (approximate NN, configurable M / ef)
- Flat index (exact NN for small collections)
- Cosine / euclidean / inner-product distance metrics
- Metadata filtering at query time
- MVCC-aware reads (transactional consistency)
2.2 Embedding generation
-- Bundled MiniLM (no external service)
SELECT EMBED('wireless headphones');
-- Or specify a different model
SELECT EMBED(text, 'minilm');
SELECT EMBED(text, 'bert-base');MiniLM is bundled and runs in-process. Other embedding models can be plugged via the AI service config (Ollama / OpenAI / Cohere / HuggingFace).
3. Graph database (new in v1.5)
SynapCores v1.5 ships a property-graph engine that runs Cypher inside the same query surface as SQL. You can mix structural pattern matches, vector similarity hops, and LLM scoring in a single query.
3.1 Cypher language
- MATCH / WHERE / RETURN / ORDER BY / LIMIT
- MERGE for upsert semantics
- WITH chaining + UNWIND
- UNION / UNION ALL
- Multi-statement bodies (recipe-style setup blocks)
- CALL <procedure> for graph algorithms
3.2 Hybrid pattern + similarity (SIMILAR_TO)
MATCH (seed:Complaint {id: $id})-[:SIMILAR_TO > 0.85]->(c:Complaint)
WHERE c.id <> seed.id
MATCH (c)-[:DESCRIBES]->(p:Product)<-[:SUPPLIES]-(s:Supplier)
WHERE s.audited_year = 2025
WITH s, p, c, llm_score('rate severity 0..1', c) AS severity
WHERE severity > 0.5
RETURN s.name, p.name, c.id, severity
ORDER BY severity DESC LIMIT 50;SIMILAR_TO is a synthetic edge type backed by HNSW — you ship vector hops as a regular Cypher relationship. llm_score() is a scalar that pipes the row through a configured LLM and parses the numeric score, usable in WHERE and ORDER BY.
3.3 Graph algorithms (GDS)
- PageRank
- Louvain (community detection)
- Label Propagation
- Triangle Count
- Single-source shortest path (more in v1.6)
3.4 Graph CDC + replication
Change data capture from a graph (replicate node / edge mutations out to a downstream sink) is Enterprise.
4. AI Chat + NL2SQL
4.1 Bundled local LLM
CE ships with llama-3.2-1b-instruct-q4_k_m.gguf for in-process inference via llama-cpp. No external API calls, no Ollama daemon, no API keys to start. Drop a different GGUF in the models directory to swap to a larger model (Phi-3-mini, Mistral-7B, etc.).
4.2 External provider plugins
- OpenAI (GPT-4 / GPT-4o / GPT-4o-mini)
- Anthropic (Claude 3 / 3.5 Sonnet / Haiku)
- Ollama (any locally-pulled model)
- Cohere
- HuggingFace inference API
- Native (in-process llama-cpp via GGUF)
4.3 NL2SQL
ASK 'Show me top 5 products by revenue last quarter'
ASK 'Find similar complaints to CX-9001 from suppliers we audited in 2025'Schema-aware: the planner injects the live table catalog into the LLM prompt. Falls back to a deterministic pattern matcher when no LLM is configured.
5. Vision / multimodal (new in v1.5)
Filesystem Collections and Media Gallery use a vision-capable LLM to describe images you ingest. The provider is configured from the Settings UI (or via env vars):
- OpenAI GPT-4o / GPT-4o-mini (~$0.001 / image)
- Anthropic Claude 3.5 Sonnet / Haiku
- Local Ollama LLaVA (free, slower)
- OCR (Tesseract bundled — works zero-config)
API key encrypted at rest with AES-256-GCM (key derived from AIDB_JWT_SECRET via HKDF-SHA256). Settings live at <data_dir>/system/vision.json, mode 0600.
6. Filesystem Collections
Drop a PDF, image, CSV, audio, or video into a watched folder and SynapCores ingests it automatically: text gets chunked + embedded, images get captioned + OCR'd, audio/video get transcribed (when a Whisper model is on the host).
- Drop-a-file-into-a-folder ingest pipeline
- PDF / Office / Markdown text extraction
- OCR for screenshots and scanned docs
- Image captioning via configured vision provider
- Whisper transcription for audio and video (drop the model file)
- Live progress events via WebSocket
- Re-process individual documents
- CREATE COLLECTION SQL syntax
7. Recipes + AutoML
7.1 Recipe library
161 ready-to-run recipes ship inside the binary, organized by category (graph, core foundations, image management, document processing, ML hello-world, security & surveillance, ...). Browse + execute from the bundled web UI.
7.2 AutoML
CREATE EXPERIMENT churn AS
SELECT customer_id, age, tenure, monthly_charges, churned
FROM customers
WITH (
task_type='binary_classification',
target_column='churned',
max_trials=50,
algorithms=['random_forest', 'xgboost', 'neural_network']
);
DEPLOY MODEL best_model FROM EXPERIMENT churn;
PREDICT churn_probability USING churn_model
AS SELECT * FROM new_customers;- Classification (binary + multi-class)
- Regression
- Time-series forecasting
- Clustering
- Anomaly detection
- Automated feature engineering
- Hyperparameter sweeps
8. Performance
Community Edition ships ten waves of optimizer + storage work as on-by-default behavior:
- Unified optimizer with PG-style plan-cache hedging (default ON)
- Result cache (default ON)
- Binary big-endian PK keys for point-select fast path
- Bincode V1 doc storage (replaces JSON)
- RocksDB range_iter for efficient range scans
- Audit-log O(1) hot path (no more O(n) per query)
- Lock-free executor reads (ArcSwap)
- 102× improvement on oltp_point_select vs. v1.4 baseline
Reproducible benchmark harness ships in aidb-bench; PG / MariaDB / MySQL comparison numbers are published in the release notes.
9. Multi-tenancy
Tenant isolation is Enterprise. CE binaries actively license-lock the tenant_isolation.enabled config flag — setting it in CE has no effect (ignored at config-load with a warning). Single-tenant deployments — laptops, single VMs, dedicated single-team hosts — are CE's sweet spot.
10. Clustering / replication / CDC
Multi-node clustering (Raft consensus + replication) and inbound CDC from MySQL/Postgres binlog are Enterprise. CE is single-host by design. Backup and restore are in CE; scheduled backups + PITR are Enterprise.
11. Security
- JWT auth (CE)
- API key auth (CE)
- Argon2 password hashing (CE)
- TLS via rustls (CE)
- Encrypted vision-provider keys (CE — AES-256-GCM)
Fine-grained RBAC, SSO/SAML, LDAP, encryption-at-rest for the whole datastore, and audit-unlimited are Enterprise.
12. Developer experience
- Single binary — no JVM, no Python runtime
- REST API + WebSocket + native socket protocol
- Bundled React web UI on the same port
- Bundled docs (161 recipes + reference)
- Cypher and SQL on the same connection
- Prepared statements with $1 / $2 PG-style placeholders
Where to next
Download CE to try everything above on your laptop. For clustering, RBAC/SSO, or paid SLA-backed support, contact sales.