Proton OSS V3 – Fastest Vectorized Streaming Processing SQL Engine

22 hours ago 1

Website Demo Apache 2.0 License

The Timeplus company logo.

Fastest SQL pipeline engine for stream processing, analytics, observability and AI

    Release  YouTube  Slack  follow on LinkedIn  X 

🚀 The fastest SQL pipeline engine in a single C++ binary, for stream processing, analytics, observability and AI. A simple, fast and efficient alternative to ksqlDB and Apache Flink, powered by ClickHouse engine.

🔥 SQL for everything : Native source/sink (Kafka, ClickHouse, MySQL, Postgres, S3/Iceberg etc.), Append-only or mutable stream, Multi-stream JOINs, Incremental Materialized View, Alert, Task, UDF in Python/JS etc.

⚡ No JVM. No ZooKeeper. Zero dependencies. Just speed, control and scale.

Timeplus Architecture

Get started in seconds

curl https://install.timeplus.com/oss | sh
  • Apache Flink or ksqlDB alternative. Timeplus Proton provides powerful stream processing functionalities, such as streaming ETL, tumble/hop/session windows, watermarks, incremental materialized views maintenance, CDC and data revision processing. In contrast to pure stream processors, it also stores queryable analytical/row based materialized views within Proton itself for use in analytics dashboards and applications.

  • Fast. Timeplus Proton is written in C++, with optimized performance through SIMD. For example, on an Apple MacBookPro with M2 Max, Timeplus Proton can deliver 90 million EPS, 4 millisecond end-to-end latency, and high cardinality aggregation with 1 million unique keys.

  • Lightweight. Timeplus Proton is a single binary (<500MB). No JVM or any other dependencies. You can also run it with Docker, or on an AWS t2.nano instance (1 vCPU and 0.5 GiB memory).

  • Powered by the fast, resource efficient and mature ClickHouse. Timeplus Proton extends the historical data, storage, and computing functionality of ClickHouse with stream processing. Thousands of SQL functions are available in Timeplus Proton. Billions of rows are queried in milliseconds.

  • Best streaming SQL engine for Kafka or Redpanda. Query the live data in Kafka or other compatible streaming data platforms, with external streams.

See our architecture doc for technical details and our FAQ for more information.

How is it different from ClickHouse

ClickHouse is an extremely performant Data Warehouse built for fast analytical queries on large amounts of data. While it does support ingesting data from streaming sources such as Apache Kafka, it is itself not a stream processing engine which can transform and join streaming event data based on time-based semantics to detect patterns that need to be acted upon as soon as it happens. ClickHouse also has incremental materialized view capability but is limited to creating materialized view off of ingestion of blocks to a single table. Proton uses ClickHouse as a table store engine inside of each stream (alongside a Write Ahead Log and other data structures) and uses to unify real-time and historical data together to detect signals in the data. In addition, Proton can act as an advanced data pre-processor for ClickHouse (and similar systems) where the bulk of the data preparation and batching is done ahead of ingestion. See Timeplus and ClickHouse for more details on this.

Timeplus Proton empowers you to build a wide range of real-time applications and data pipelines. Common use cases include:

  • Streaming ETL & Data Preparation: Efficiently ingest data from sources like Kafka, perform in-flight transformations (filtering, enrichment, masking), and route it to downstream systems, including data warehouses like ClickHouse, other Kafka topics, or analytical stores.

  • Real-time Analytics & Dashboards: Continuously transform and aggregate high-volume streaming data (e.g., user activity, IoT sensor data, application logs) to populate live dashboards, enabling immediate operational insights and data-driven decisions.

  • Real-time Monitoring & Alerting: Define complex event patterns and continuous queries to monitor key performance indicators (KPIs), detect anomalies or threshold breaches in real-time, and trigger immediate alerts or automated actions.

  • Personalization & Recommendation Engines: Analyze streaming user interaction data (clicks, views,purchases) to update user profiles dynamically and serve personalized content or product recommendations with low latency.

  • Log Analytics & Observability: Process and analyze application and system logs as they are generated to gain insights into system behavior, troubleshoot issues faster, and improve overall observability.

2-minute short video👇. Check out the full video at YouTube.

DataTalksProtonHandbrake.mp4
curl https://install.timeplus.com/oss | sh

Once the proton binary is available, you can run proton server to start the server and put the config/logs/data in the current folder proton-data. Then use proton client in the other terminal to start the SQL client.

For Mac users, you can also use Homebrew to manage the install/upgrade/uninstall:

brew install timeplus-io/timeplus/proton
docker run -d --pull always -p 8123:8123 -p 8463:8463 --name proton d.timeplus.com/timeplus-io/proton:latest

Please check Server Ports to determine which ports to expose, so that other tools can connect to Timeplus, such as DBeaver.

The Docker Compose stack demonstrates how to read/write data in Kafka/Redpanda with external streams.

Don't want to setup by yourself? Try Timeplus in Cloud

SQL is the main interface. You can start a new terminal window with proton client to start the SQL shell.

Note

You can also integrate Timeplus Proton with Python/Java/Go SDK, REST API, or BI plugins. Please check Integrations

In the proton client, you can write SQL to create External Stream for Kafka or External Table for ClickHouse.

For example, you can read from AWS MSK and write the data to ClickHouse for the following SQL:

-- Read from AWS MSK using IAM Role CREATE EXTERNAL STREAM aws_msk_stream ( device string, temperature float ) SETTINGS type='kafka', brokers='prefix.kafka.us-west-2.amazonaws.com:9098', topic='topic', security_protocol='SASL_SSL', sasl_mechanism='AWS_MSK_IAM'; -- Write to ClickHouse CREATE EXTERNAL TABLE ch_aiven SETTINGS type='clickhouse', address='abc.aivencloud.com:28851', user='avnadmin', password='..', secure=true, table='events'; -- Setup a long-running materialized view to write aggregated data to ClickHouse CREATE MATERIALIZED VIEW mv_msk2ch INTO ch_aiven AS SELECT window_start as timestamp, device, avg(temperature) as avg_temperature FROM tumble(aws_msk_stream, 10s) GROUP BY window_start, device;

If you don't have immediate access to Kafka or ClickHouse, you can also run the following SQL to generate random data:

-- Create a stream with random data CREATE RANDOM STREAM devices( device string default 'device'||to_string(rand()%4), temperature float default rand()%1000/10); -- Run the streaming SQL SELECT device, count(*), min(temperature), max(temperature) FROM devices GROUP BY device;

You should see data like the following:

┌─device──┬─count()─┬─min(temperature)─┬─max(temperature)─┐ │ device0 │ 2256 │ 0 │ 99.6 │ │ device1 │ 2260 │ 0.1 │ 99.7 │ │ device3 │ 2259 │ 0.3 │ 99.9 │ │ device2 │ 2225 │ 0.2 │ 99.8 │ └─────────┴─────────┴──────────────────┴──────────────────┘

To see more examples of using Timeplus Proton, check out the examples folder.

To access more features, such as sources, sinks, dashboards, alerts, and data lineage, try Timeplus Enterprise locally.

What features are available with Timeplus Proton versus Timeplus Enterprise?

Timeplus ProtonTimeplus Enterprise
Deployment
  • Single-node Docker image
  • Single binary on Mac/Linux
  • Single node, or
  • Cluster
  • Kubernetes-based self-hosting
Data sources
  • Random streams
  • External streams to Apache Kafka, Apache Pulsar, Confluent Cloud, Redpanda
  • External streams to another Timeplus Proton or Timeplus Enterprise deployment
  • External tables to ClickHouse
  • Streaming ingestion via REST API (compact mode only)
  • Everything in Timeplus Proton
  • WebSocket and HTTP Stream
  • NATS
  • CSV upload
  • Streaming ingestion via REST API (with API key and flexible modes)
  • Hundreds of connectors from Redpanda Connect
Data destinations (sinks)
  • External streams to Apache Kafka, Apache Pulsar, Confluent Cloud, Redpanda
  • External streams to another Timeplus Proton or Timeplus Enterprise deployment
  • External tables to ClickHouse
  • Everything in Timeplus Proton
  • Slack
  • Webhook
  • Hundreds of connectors from Redpanda Connect
Support
  • Community support from GitHub and Slack
  • Enterprise support via email, Slack, and Zoom, with a SLA

The following drivers are available:

Integrations with other systems:

We publish full documentation for Timeplus Proton at docs.timeplus.com alongside documentation for Timeplus Enterprise.

We also have a FAQ for detailing how we chose Apache License 2.0, how Timeplus Proton is related to ClickHouse, and more.

We welcome your contributions! If you are looking for issues to work on, try looking at the issue list.

Please see the wiki for more details, and BUILD.md to compile Timeplus Proton in different platforms.

If you are using Timeplus Proton and would like your company logo displayed on our Home page, please email [email protected] with your request.

Please use GitHub Discussions to share your feedbacks or questions for Timeplus Proton.

For filing bugs, suggesting improvements, or requesting new features, open GitHub Issues.

To connect with Timeplus engineers or inquire about Timeplus Enterprise, join our Timeplus Community Slack.

Proton uses Apache License 2.0. See details in the LICENSE.

Read Entire Article