Indexing is a useful tool that provides more types of queries on your tables. In principle, columns we wish to be queryable should be declared when the table is created, as part of a table’s primary key. Secondary Indexing is a neat way of making other columns queryable, but it comes with a cost of additional storage space and processing power to maintain the secondary index data coherent with the primary index information.
In practical terms, how do you begin to plan your ultra-efficient Scylla topology? The shift can be intimidating for those coming from a background in Cassandra, or for those coming from a primary-replica architecture, or from a plain old monolithic database implementation. In this post we’ll layout a checklist for designing a Scylla cluster that incorporates both horizontal and vertical scaling.
The I3en family is a class of instances clearly targeted at storage-intensive workloads, but CPUs are still needed to process that data. The largest of the I3en, i3en.24xlarge, ships with 48 cores of the Xeon Platinum 8175M, clocking at 2.50GHz.
In this post we introduce the new Scylla workload prioritization mechanism, explaining the vision behind developing this feature and how it is implemented, and most importantly, we show you test results of how it performs in a real-world setting.
In this article, we will compare Scylla Cloud and Google Cloud Bigtable. We show that Scylla Cloud is one-fifth the cost of Cloud Bigtable under optimal conditions (perfect uniform distribution) and that when applied to Zipfian distribution, the difference grows to 25x.
So you heard about Scylla and its superior performance. Maybe you have experience with Apache Cassandra, and are wondering what parts of that experience will you reuse and what you may have to learn anew. Or maybe you’re coming from a totally different background and want to know how to make Scylla fit best into your application environment. In this article we will cover in detail ten basic principles that help users succeed with Scylla. Some of them are also applicable to Apache Cassandra, and some stand in contrast to Cassandra recommendations. Free your mind, and read on! 1. Monitor […]
Scylla Open Source 3.0 ships with a new format for on-disk representation, SSTable 3.0. In this article, we will discuss some of the benefits that emerge from the adoption of this format and the scenarios in which they apply. We will discuss the differences between the old and new formats, and demonstrate use cases in which the new format has significant advantages, and others where the advantages are much smaller. This is truly a situation of “Your Mileage May Vary.” For example, in one test result below, we were able to show a 53% reduction in table size. Other use […]
Scylla Open Source 3.0 is a landmark release for ScyllaDB: Materialized Views and Secondary Indexes are production-ready, and Scylla Open Source 3.0 can now read and write the Cassandra 3.x SSTable (“mc”) format. In addition, Scylla Open Source 3.0 provides a variety of performance improvements to existing functionality. In this article we will explore the nature of a pair of those performance improvements, and the scenarios in which Scylla users can expect to see a significant performance gain. Streaming When one Scylla node needs to transfer data to another, it undertakes a process called streaming. This happens when a new […]
“And now for our main event! Ladies and gentlemen, in this corner, weighing in at 34% of the cloud infrastructure market, the reigning champion and leader of the public cloud…. Amazon!” Amazon has unparalleled expertise at maximizing scalability and availability for a vast array of customers using a plethora of software products. While Amazon offers software products like DynamoDB, it’s database-as-a-service is only one of their many offerings. “In the other corner is today’s challenger — young, lightning quick and boasting low-level Big Data expertise… ScyllaDB!” Unlike Amazon, our company focuses exclusively on creating the best database for distributed data […]
Scylla 2.3 was just recently released, and you can read more about that here. Aside from the many interesting feature developments like improved support for materialized views and hardware enablement like native support for AWS i3.metal baremetal instance, Scylla 2.3 also delivers even more performance improvements on top of our already industry-leading performance. Most of the performance improvements center around three pillars: Improved CPU scheduling, with more work being tagged and isolated The result of a diligent search for latency-inducing events, known as reactor stalls, particularly in the Scylla cache and in the process of writing SSTables A new, redesigned […]