ScyllaDB Open Source

ScyllaDB Open Source has a rich set of new production-ready features, including Lightweight Transactions (LWT), DynamoDB API compatibility, Change Data Capture, offline installers and more.
Already the best high-performance NoSQL database for big data workloads, ScyllaDB builds upon the best attributes of Apache Cassandra and Amazon DynamoDB by improving performance, scalability, and cost-efficiency.

Note: ScyllaDB Open Source 5.X is also available and users are encouraged to upgrade to it.

DISCOVER SCYLLADB OPEN SOURCE 5.X

What’s New in ScyllaDB Open Source 4.X

Production Ready Features

AWS Graviton2 / Arm Support (4.6+)

ScyllaDB has been compiled and tested to run on Arm-based architectures to support a range of new instance types on AWS. For example, the Im4gn and Is4gen storage-optimized instances use Graviton2 processors and are now ready for production workloads. Meanwhile the low cost T4g burstable instance is appropriate for development of cloud-based applications. Since ScyllaDB is now compiled to run on any Aarch64 architecture, you can even run it on an Arm-based M1-powered Macintosh using Docker for local development.

Repair Base Node Operations (4.6+)

Repair-Based Node Operations (RBNO) use the same underlying implementation for repair and node operations such as bootstrap, decommission, removenode, and replace. Row-based repair is oriented towards small amounts of data, not an entire node’s worth. This resulted in smaller, more atomic updates that allowed users to begin, pause and resume operations. It also uses offstrategy compaction to compact SSTables efficiently without impacting the main workload.

Learn More

Service Level Properties (4.6+)

Service Levels allows the user to attach attributes to Rules and Users. These attributes apply to each session the user opens, enabling granular control of the session properties, like time out and shedding (overload handling). ScyllaDB Open Source supports two main properties: per service level timeouts and workload types.

Service Level Timeouts are useful when some workloads, like ETL, are less sensitive to latency than others.
Workload types provide more context for ScyllaDB to decide how to handle the sessions. For instance, if a coordinator node receives requests with a rate higher than it can handle, it will make different decisions depending on the declared workload type.

Reverse Queries Improvements (4.6+)

A reverse query is a query SELECT that uses a reverse order compared to the one used in the table schema. If no order was defined, the default order is ascending (ASC). Improving the performance of reverse query is an ongoing process, with the following updates in ScyllaDB 4.6:

The internal layer for managing queries now supports reversed queries natively. This lays the groundwork for reversed reads in memtables, cache, and SSTables, so that reversed queries will perform efficiently.
A new SSTable reader can now read partitions in reversed order. This is a step towards supporting reversed reads of large partitions.
Memtables now efficiently support reversed reads, which makes reversed reads with BYPASS CACHE now more efficient, especially with memory consumption.

Guardrails (4.6+)

ScyllaDB is a very powerful tool, with many features and options. In many cases, these options, such as experimental or performance-impacting features, or a combination of them, are not recommended to run in production. Guardrails are a collection of reservations that make it harder for the user to use non-recommended options in production. A few examples:

Prevent users from using SimpleReplicationStrategy.
Warn or prevent usage of DateTieredCompactionStrategy, which has long since been deprecated in favor of TimeWindowCompactionStrategy.
Disable Thrift, a legacy interface, by default
Ensure that all nodes use the same snitch mode

ScyllaDB administrators can use our default settings or customize guardrails for their own environment and needs.

Improvements to Alternator (4.5+)

The Amazon DynamoDB-compatible interface has been updated to include a number of new features:

Support for Cross-Origin Resource Sharing (CORS)
Fully supports nested attribute paths
Support attribute paths in ConditionExpression, FilterExpression Support, and ProjectionExpression

Timeout per Operation (4.4+)

There is now new syntax for setting timeouts for individual queries with “USING TIMEOUT”. The new Timeout per Operation allows you to define the timeout in a more granular way. Conversely, some queries might have tight latency requirements, in which case it makes sense to set their timeout to a small value. Such queries would get time out faster, which means that they won’t needlessly hold the server’s resources. You can use the new TIMEOUT parameters for both queries (SELECT) and updates (INSERT, UPDATE, DELETE).

I/O Scheduler 2.0 (4.4+)

The Seastar I/O scheduler is used to maximize the requests throughput from all shards to the storage. Till now, the scheduler was running in a per shard scope: each shard runs its own scheduler, balanced between its I/O tasks, like reads, updates and compactions. This works well when the workload between shards is approximately balanced; but when, as often happened, one shard was more loaded, it could not take more I/O, even if other shards were not using their share. I/O scheduler 2.0 included in ScyllaDB 4.4 fixes this. As storage bandwidth and IOPS are shared, each shard can use the whole disk if required.

Change Data Capture (CDC) (4.3+)

This feature allows you to track changes made to a base table in your database for visibility or auditing. Changes made to the base table are stored in a separate table that can be queried by standard CQL. Our CDC implementation uses a configurable Time To Live (TTL) to ensure it does not occupy an inordinate amount of your disk.

Learn More

Binary Search in SSTable Promoted Index (4.2+)

Prior to ScyllaDB Open Source 4.2, lookups in the promoted index were done by scanning the index linearly, so that the lookup took O(n) time. This is inefficient for large partitions, consuming a great deal of CPU and I/O. Now the reader scans the SSTable promoted index with a binary search, reducing search time to O(log n). In our testing, searches were conducted 12x faster, CPU utilization was only 1/10th (10%), and disk I/O was reduced to 1/20th (5%) the prior rate.

DynamoDB API Compatibility (Project Alternator) (4.0+)

Our Amazon DynamoDB API implementation, known as Project Alternator, enables you to connect applications/services built for the DynamoDB API to ScyllaDB without changing your client code. This gives your team multi-cloud, multi-vendor flexibility to improve system resilience and include a disaster recovery plan in your playbook.

Learn More

Lightweight Transactions (LWT) (4.0+)

ScyllaDB LWTs allow stronger consistency guarantees using the Paxos consensus algorithm. They ensure requests on a distributed database are processed in a strict, linearized (serial) method, in a process known as ‘Compare and Set.’. They are also called ‘Conditional Updates,’ because they can test the databases’ existing values before submitting the update. This provides atomic consistency for single keys, allowing updates to be performed in order on a global basis. They can also be used for batches, to ensure all conditions are met before submitting a batch update.

Learn More

ScyllaDB Operator

Kubernetes has become the go-to technology for the cloud devops community. It allows for the automated provisioning of cloud-native containerized applications, supporting the entire lifecycle of deployment, scaling, and management. ScyllaDB Operator is our extension to enable the management of ScyllaDB clusters. It currently supports deploying multi-zone clusters, scaling up or adding new racks, scaling down, and monitoring ScyllaDB clusters with Prometheus and Grafana.

Learn More

Experimental Features

Alternator Streams (4.3+)

Based on CDC, the following DynamoDB API stream operations are now supported in ScyllaDB:

DescribeStream
GetRecords
GetShardIterator
ListStreams

Learn More

User Defined Functions (UDF) in Lua (3.3+)

User Defined Functions (UDF) in Lua adds the ability for teams to build server-side scripts that can run complex transforms such as aggregations, sums, averages, minimums, and maximums. This allows developers to simplify their database queries and the use of multiple large queries with large payloads.

Learn More

New with ScyllaDB Open Source 3.x

See what we introduced previous to ScyllaDB 4.0. Read More

Resources

Introducing ScyllaDB Open Source 4.0

Learn the details about ScyllaDB’s newest features and capabilities. Get the most out of our highly performant open source NoSQL database.

Learn More

Introducing ScyllaDB 4.0

Join our co-founders, CEO Dor Laor and CTO Avi Kivity for an overview of ScyllaDB 4.0.

Learn More

Alternator: Open Source DynamoDB-compatible API

Our open source Amazon DynamoDB-compatible API allows you to run your database on any cloud or on premises.

Learn More

Lightweight Transactions in ScyllaDB versus Apache Cassandra

How to use lightweight transactions in ScyllaDB; learn the similarities and differences between ScyllaDB’s Paxos implementation and Cassandra’s.

Learn More

Change Data Capture - Track NoSQL Data updates in real time

CDC enables the user to track updates to tables in real time. Learn how they work under the hood, and why they are a vast improvement over Cassandra.

Learn More

Kubernetes Operator

Take the new lesson on how to deploy and manage ScyllaDB using the new Kubernetes Operator. Learn how to scale up and scale down clusters and use stateful sets effectively.

Learn More

ScyllaDB University

Get started on the path to ScyllaDB expertise

ScyllaDB Cloud

It’s easy to get started with our NoSQL DBaaS

Apache® and Apache Cassandra® are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Amazon DynamoDB® and Dynamo Accelerator® are trademarks of Amazon.com, Inc. No endorsements by The Apache Software Foundation or Amazon.com, Inc. are implied by the use of these marks.

Why ScyllaDB?

Is ScyllaDB right for me?

ScyllaDB University

ScyllaDB Blog

ScyllaDB Open Source

What’s New in ScyllaDB Open Source 4.X

Production Ready Features

AWS Graviton2 / Arm Support (4.6+)

Repair Base Node Operations (4.6+)

Service Level Properties (4.6+)

Reverse Queries Improvements (4.6+)

Guardrails (4.6+)

Improvements to Alternator (4.5+)

Timeout per Operation (4.4+)

I/O Scheduler 2.0 (4.4+)

Change Data Capture (CDC) (4.3+)

Binary Search in SSTable Promoted Index (4.2+)

DynamoDB API Compatibility (Project Alternator) (4.0+)

Lightweight Transactions (LWT) (4.0+)

ScyllaDB Operator

Experimental Features

Alternator Streams (4.3+)

User Defined Functions (UDF) in Lua (3.3+)

New with ScyllaDB Open Source 3.x

Resources

ScyllaDB University

ScyllaDB Cloud

Start scaling with the world's best high performance NoSQL database.