Shard-Per-Core Architecture

ScyllaDB runs one shard (thread) per CPU core with isolated memory and async I/O. This shared nothing architecture eliminates locking, providing linear scalability as you add cores/ or nodes.

Userspace I/O Schedulers

Userspace schedulers adjust I/O and CPU dispatch to maximize hardware utilization. This prevents background tasks from starving latency-sensitive queries.

ScyllaDB caches hot data directly in memory on each CPU core. It self-tunes based on access patterns and memory pressure to keep reads fast as workloads change.

Elastic Scaling Tablets

Tablets partition data into small chunks that dynamically rebalance across the cluster. This provides elastic scaling – helping you handle peaks with lower infrastructure costs.

Workload Prioritization

ScyllaDB lets you control how your workloads compete for system resources. This ensures latency-sensitive queries are fast, even with other heavy workloads running on the same cluster.

How is ScyllaDB different from other NoSQL databases (for example, Cassandra or DynamoDB)?

ScyllaDB was designed for efficiency, with the goal of delivering predictable low tail latency at scale. ScyllaDB’s close to the hardware design leverages a shard-per-core architecture and autotuning capabilities to maximize hardware utilization. This efficiency also translates to lower cost: less hardware is required to run similar workloads on ScyllaDB than with Cassandra or DynamoDB. Also, ScyllaDB runs anywhere; it’s available as a Database as a Service and can be deployed on any public cloud, or even on-premises. See NoSQL database comparisons and benchmarks.

Can ScyllaDB be used to replace a cache?

ScyllaDB’s design includes a built-in efficient cache layer . This internal cache ensures predictable low latency for workloads at scale. For this reason, many teams have replaced caching layers (e.g., Redis, ElasticCache, DAX) with ScyllaDB. In DynamoDB cases, ScyllaDB can replace the underlying database (through its support of the DynamoDB-compatible API) as well as replace the cache associated with it. ScyllaDB’s design helps reduce overall costs by making efficient use of infrastructure, as well as running anywhere. Why teams are replacing their external cache with ScyllaDB.

What workloads are the best fit for ScyllaDB?

ScyllaDB is designed for low latency at scale, including flexible scaling to meet growing needs. Workloads that have thousands to millions of operations per second, as well as multiple terabytes or petabytes of data, will get the greatest benefits from ScyllaDB. ScyllaDB is designed for applications that work with semi-structured or structured data and query that data with known/predictable patterns. High cardinality with evenly distributed access patterns is also helpful. Is ScyllaDB a good fit.

What makes ScyllaDB fast?

ScyllaDB’s performance-focused design relies on its shard-per-core architecture , enabling efficient CPU utilization with a shared-nothing approach. Seastar, the framework ScyllaDB is built upon, allows for maximizing concurrency and reducing the latency of operations. ScyllaDB also bypasses OS-level memory management, performing direct I/O operations and leveraging its internal cache. Since ScyllaDB is written in C++, it reduces the complexity usually associated with tuning JVM parameters and avoids Java’s garbage collection pauses. What Makes ScyllaDB So Fast?

What are the tradeoffs of using ScyllaDB vs other databases?

ScyllaDB is a NoSQL database designed for high performance, high throughput, and predictable low latency. Some of the tradeoffs are the lack of traditional relational database functionalities, such as arbitrary joins and ad-hoc querying. It does provide linearizable, single-partition transactions via lightweight transactions (Paxos) today—and Raft-based strong consistency for metadata (and soon for user tables)—but it does not yet support full ACID transactions spanning multiple partitions or tables. ScyllaDB also requires careful data modeling to ensure data is distributed according to its access patterns. On the other hand, its performance and scalability make it ideal for workloads that require predictable performance at scale. Read about database performance tradeoffs.

How does ScyllaDB store and distribute data (sharding, replication, architecture basics)?

ScyllaDB distributes data using a shard-per-core architecture . It attributes partitions to CPU virtual cores for efficient parallelism and contention reduction. By leveraging consistent hashing for data distribution, it ensures the load is evenly distributed across the cluster and simplifies scalability. It replicates data across nodes distributed across multiple availability zones, which provides fault tolerance and high availability on the cloud. Additionally, ScyllaDB tablets allow rapid scaling of clusters in response to traffic spikes and increasing demand. ScyllaDB Architecture Overview.

Is ScyllaDB free? Is ScyllaDB Open Source?

ScyllaDB is source available. You can access the full ScyllaDB Enterprise feature set for free, up to 10 TB of total storage and 50 vCPUs across all clusters. This includes community support only.

The Real-Time Database for AI

Serving 5 Million Features per Second

Tripadvisor uses ScyllaDB on AWS to power real-time ML personalization. At peak, they handle ~500K ops/sec with P99 latencies of 1-3 ms. Their feature store serves up to 5 million static features/sec and 0.5 million user features/sec.

Rebuilding AI Platform for 10X Growth

Freshworks uses ScyllaDB to power its AI-driven data platform (e.g., feature stores, model training caching layers, workflow automation, and customer data services). After migrating to ScyllaDB, they hit single-digit P99.99 latency and reduced storage 50%.

Recommendations for 325M Users

ShareChat uses ScyllaDB as the backbone of its ML feature store that recommends fresh social media content in 15+ different languages. They scaled from 1M to 1B features per second with 10X cost reduction and P99 latencies from 10-20ms.

Streaming with 5X Cost Reduction

ZEE5 uses ScyllaDB to power “Continue Watching” and recommendations across 190 countries. The platform maintains single-digit millisecond P99 latency while processing 1M+ concurrent requests and 1TB of daily state changes.

Real-Time AI

Is ScyllaDB right for me?

ScyllaDB University

ScyllaDB Blog

Scale Predictably

Extreme Scale. Extreme Performance.

Engineered for Efficiency

Operations Per Second

Of Vector Embeddings

Predictable P99 Latency

Why Teams Choose ScyllaDB

High Throughput

Low Latency

Lower Cost

Scalability

AI at Scale

API Compatibility

High Availability

No Babysitting

Deployment Flexibility

ScyllaDB is a strong fit if you:

ScyllaDB’s Sweet Spot

High-Performance NoSQL

Low Latency

High Throughput

Real-Time Vector Search

Fast Queries

Massive Scale

Designed for Modern Hardware

ScyllaDB Enterprise

ScyllaDB X Cloud

Customer Success with Massive Scale

Serving 5 Million Features per Second

Rebuilding AI Platform for 10X Growth

Recommendations for 325M Users

Streaming with 5X Cost Reduction

Ready to Get Started?

Join thousands of customers already scaling their AI pipelines with ScyllaDB

Trending Resources for ScyllaDB at Scale

Frequently Asked Questions