Apache Texera
(Incubating)

GitHub Try Now

Apache Texera is a powerful and scalable
workflow system for big data analytics
and visualization.

Texera Features

Collaborative Data Science

Work together in real time from any browser. Share data and workflows, edit simultaneously, and track every change with built-in version control.

Graphical Workflow Interface

Build workflows visually with drag-and-drop tools. See live status updates and interact with executions as they run.

Expandable AI/ML Access

Use AI and ML operators with the integrated editor. Supports Python, R, and Java/Scala for flexible development.

Unified Data Access

Connect to databases, cloud storage, and APIs. Combine different data sources easily in one streamlined workflow.

Interactive Visualization

Watch results appear as you work. Monitor performance, explore outputs, and share interactive charts instantly.

Integration & Extensibility

Extend Texera with custom plugins or connect to various tools. Designed to grow with your data ecosystem.

Texera screenshot
Apache Incubator
License Apache 2.0
Users 332 Projects 86 Workflows 2,481 Executions 51K Workflow Versions 357K Deployments 7 Largest Deployment 100 nodes, 400 cores
apache/texera

Collaborative Machine-Learning-Centric Data Analytics Using Workflows

Scala 208 109