ScyllaDB’s Glauber Costa explores cgroups and systemd, and how these can be used to define slices which can be used to protect database performance.
This article presents the extensions done to the Scylla Migrator to also support data movement between an existing DynamoDB installation and Scylla.
About Scylla’s Alternator Project Alternator is an open source project that gives Scylla compatibility with Amazon DynamoDB™. Our goal is that any application written for Amazon DynamoDB could be run, unmodified, against Scylla with Alternator enabled. Originally, Scylla began as a re-implementation of Apache Cassandra, and it has since proven to be a solid database engine with key performance and TCO benefits over Cassandra. However, we always considered Cassandra to be just a starting point. Now a 5-year old project, Scylla is able to scale to hundreds of machines, petabytes of data and many regions and availability zones. Scylla can […]
In this article we will explore one IoT/time-series classical scenario in which knowledge of how the cache operates can mean the difference between a fully cached workload that will be fast, and a fully storage-bound workload that will of course perform much worse.
In our latest Summer Tech Talks series webinar ScyllaDB Field Engineer Juliana Oliveira guided virtual attendees through a series of best practices on data modeling for Scylla.
Repair is one of several anti-entropy mechanisms in Scylla. It is used to synchronize data across replicas. In this post, we introduce a new repair algorithm coming with Scylla Open Source 3.1 that improves performance by operating at the row-level, rather than across entire partitions.
Holden Karau is an open source developer advocate at Google. In her talk, Holden provided an overview of Spark, how it can fail and, based on those different failures, she outlined a number of strategies for how pipelines can be recovered.
With continued and growing interest in Apache Spark, we had two speakers present at Scylla Summit 2018 on the topic. This is the first of a two-part article, covering the talk by ScyllaDB’s Eyal Gutkind. The second part covers the talk by Google’s Holden Karau. With business demanding more actionable insight and ROI out of their big data, it is no surprise that analytics are a common workload on Scylla. Nor is it a surprise that Spark is a perennial favorite on the Scylla Summit agenda, and our annual gathering last year proved to be no exception. The focus was […]
In this post we introduce the new Scylla workload prioritization mechanism, explaining the vision behind developing this feature and how it is implemented, and most importantly, we show you test results of how it performs in a real-world setting.
Anyone who’s tried to build such a solution knows that one of the chief difficulties is encompassing the sheer number and complexity of existing data sources. In order to deliver a true solution, we need to be able to bring this disparate data together. A graph data system, built with JanusGraph and backed by the power of Scylla, is a great fit for solving this problem.