We came up with a new compaction approach, named Incremental Compaction, that considerably reduces space overhead with a hybrid technique that combines properties from both Size-Tiered and Leveled compaction strategies. It is exclusively available in newer Scylla Enterprise releases (2019.1.4 and above).
Today we are releasing a new data integrity testing suite for the open source community. Those who will have the most direct utility for this software will be those testing Scylla and Cassandra databases, or, more broadly, other CQL-compliant databases.
For you to get the most out of your big data applications, let’s explore the effects of concurrency in distributed databases and provide you with tools to correctly configure your infrastructure for maximum performance, including client-side parallelism and timeout settings.
In the first part of this blog we’ve learned a bit about compression theory and how some of the compression algorithms work. In this part we focus on practice, testing how the different algorithms supported in Scylla perform in terms of compression ratios and speeds.
In this two-part blog we’ll focus on the problem of storing as much information as we can in the least amount of space as possible. This first part will deal with the basics of compression theory and implementations in Scylla.
A question we keep getting from customers is: “How can I monitor Scylla with Datadog?” And more generally “How do you integrate Datadog and Prometheus?” Find out how to enable this on your Scylla cluster.
ScyllaDB’s Glauber Costa explores cgroups and systemd, and how these can be used to define slices which can be used to protect database performance.
This article presents the extensions done to the Scylla Migrator to also support data movement between an existing DynamoDB installation and Scylla.
About Scylla’s Alternator Project Alternator is an open source project that gives Scylla compatibility with Amazon DynamoDB™. Our goal is that any application written for Amazon DynamoDB could be run, unmodified, against Scylla with Alternator enabled. Originally, Scylla began as a re-implementation of Apache Cassandra, and it has since proven to be a solid database engine with key performance and TCO benefits over Cassandra. However, we always considered Cassandra to be just a starting point. Now a 5-year old project, Scylla is able to scale to hundreds of machines, petabytes of data and many regions and availability zones. Scylla can […]
In this article we will explore one IoT/time-series classical scenario in which knowledge of how the cache operates can mean the difference between a fully cached workload that will be fast, and a fully storage-bound workload that will of course perform much worse.