MatrixOne

Git for Data

The Cloud-Native, Hyper-Converged Database

Snapshot·Branch·Merge·Time Travel·Rollback

The industry's first database to bring Git-style version control to data. Unify transactional, analytical, vector, and full-text workloads in a single system — MySQL compatible, AI-native, cloud-native.

MySQL CompatibleAI-NativeCloud-Native

Try It Free GitHub

Why It Matters

Git for Data is what lets Agents explore — safely

An autonomous agent is only as trustworthy as the data layer beneath it. Git for Data gives every agent an isolated branch to experiment in, and gives you snapshot, time-travel, and rollback so exploration never turns into risk.

Time Travel · Rollback

Production Data

RAG Agent

isolated branch

Fine-tune Agent

isolated branch

ETL Agent

isolated branch

Zero-Risk · Snapshot · Immutable Audit

Free to Explore

Every agent forks production data at zero cost and experiments inside a fully isolated sandbox — run wild, break nothing, and merge back only what works.

Branch per AgentZero-Cost CloneNo InterferenceSafe Merge

Safe by Design

Snapshot before every step, time-travel to any past state, roll back in one click, and keep an immutable audit trail — agents act on real data with zero risk.

SnapshotTime TravelInstant RollbackImmutable Audit

Powering Every Data Application

Ingest multimodal data, AI-driven processing, run hybrid workloads

Business Systems

OA, ERP, CRM, BI Analytics

IoT Applications

Real-time monitoring, tracing, prediction

AI Agents Applications

Smart assistants, doc AI, multimodal search

MatrixOne

AI-Driven Processing

Multimodal Data Ingestion

Cost Optimization

Performance Optimization

Hybrid Workloads

Flexible Deployment Options

Edge DevicesData CentersPrivate CloudPublic Cloud

One Database for Everything

Say goodbye to 4 databases, multiple ETL jobs, hours of data lag, and sync nightmares

MySQL

Transactions

ClickHouse

Analytics

Elasticsearch

Full-Text Search

Pinecone

Vector Search

MatrixOne

OLTPOLAPVectorSearch

Manage Data Like Code

The first database with Git for Data — every data change is traceable, reversible, and collaborative

Instant Snapshots

Zero-copy snapshots in milliseconds, no storage explosion

Time Travel

Query data as it existed at any point in history

Branch & Merge

Test migrations and transformations in isolated branches

Instant Rollback

Restore to any previous state without full backups

Complete Audit Trail

Track every data change with immutable history

Built for the AI Era

Built-in vector search, full-text search, and Agent data sandbox — no external vector databases needed

Vector + Full-Text Hybrid Search

Built-in IVF/HNSW vector indexes and full-text search engine. Supports billions-scale vector retrieval, directly powering RAG applications and semantic search.

IVF/HNSW IndexesHybrid QueriesBillions-Scale

AI Agent Data Sandbox

Git for Data branching naturally provides isolated data environments for AI Agents — one branch per agent, no interference, safe merging.

Branch IsolationZero-Cost CloneSafe Merge

Deep MOI AI Integration

As the core data engine of MatrixOne Intelligence, deeply integrated with MatrixPipeline, MatrixGenesis, and other AI components — providing a unified data foundation for enterprise AI applications.

MOI Data EngineAI PipelineUnified Foundation

vector_search.py

from matrixone import Client
client = Client()
client.connect(database='demo')

# Vector search
query = [0.2, 0.3, 0.4, 0.25, 0.35]
results = client.query(
    Article.title,
    Article.embedding.l2_distance(query)
).filter(
    Article.embedding.l2_distance(query) < 0.1
).execute()

Key Features

Built for Consolidation, Scale, and Intelligence

OLTP App

OLAP / BI App

MatrixOneHTAP

Workload Isolation

TP Zone

AP Zone

Shared Storage

Three-Layer Cloud-Native Architecture

Disaggregated storage-compute design — each layer scales independently

Compute Layer

•CN Compute Nodes
•Resource Group Isolation
•Auto-Scaling
•Hot Data Cache

Transaction Layer

•TN Transaction Nodes
•Raft Shared Log
•Conflict Detection
•HAKeeper

Storage Layer

•S3 Object Storage
•Unlimited Scaling
•Hot/Cold Separation
•Low Cost

Kubernetes

Compute Layer

CN Compute Nodes

Hot Data Cache

Transaction Layer

TN Transaction Nodes

Logtail

HAKeeper

Raft Shared Log

Storage Layer

S3 Object Storage

Unlimited Scaling

Hot/Cold Separation

S3 Object Storage

Flexible Deployment Options

From single-node to distributed, from private cloud to public cloud

Standalone

Primary-replica architecture for small to medium workloads

ReplicationCDC Sync

Object Storage

Single node + S3 object storage, balancing simplicity and elasticity

S3 StorageHot/Cold Tiering

K8s Distributed

Fully distributed deployment with unlimited horizontal scaling

K8s NativeAuto-ScalingMulti-Tenant

Get Started with MatrixOne

Start your AI data journey today

Community Edition

Free forever. Latest features, fast iteration, community-driven.

Download

Enterprise Edition

Professional support, dedicated solutions, expert SLA response.

Free Trial