Data Index Documentation
What is Data Index?
Data Index is a read-only query service for Serverless Workflow 1.0.0 runtime execution data. It provides a GraphQL API for querying workflow instances and task executions from Quarkus Flow applications.
How It Works
Data Index captures workflow execution events and makes them queryable via GraphQL:
Quarkus Flow App
↓ (structured logging to stdout)
FluentBit DaemonSet
↓ (tail container logs)
Storage Backend (PostgreSQL or Elasticsearch)
↓ (normalization)
GraphQL API
Storage Backends
Data Index supports two production-ready storage backends:
| Backend | Best For | Status |
|---|---|---|
PostgreSQL (MODE 1) |
Most deployments - ACID transactions, simple operations, <50K workflows/day |
✅ Production Ready |
Elasticsearch (MODE 2) |
High throughput (100K+ workflows/day), full-text search, advanced analytics |
✅ Production Ready |
Both backends provide:
-
Same GraphQL API (backend-agnostic queries)
-
Same data model (WorkflowInstance, TaskExecution)
-
Production-grade reliability and performance
Choose based on your requirements - see Decision Matrix for detailed comparison.
Quick Links
-
Getting Started - Install and verify Data Index
-
Quarkus Flow Apps - Structure and deploy your workflows
-
GraphQL API - Query workflow data
-
Architecture - Understand how it works
What Data Index Does NOT Do
|
Data Index is a read-only query service. It does NOT:
|
System Requirements
Core:
-
Kubernetes cluster (or KIND for local development)
-
FluentBit 2.0+
-
Quarkus Flow applications with structured logging enabled
Storage backend (choose one):
-
PostgreSQL 13+ (MODE 1)
-
Elasticsearch 7.10+ (MODE 2)
Next Steps
Ready to get started? Check out the Getting Started Guide.
Already have Data Index running? Learn how to structure and deploy your Quarkus Flow applications.