mParticle and ScyllaDB attended The NoSQL & NewSQL Database Meetup at the AWS Loft in NYC. Yuan Ren from mParticle gave a nice presentation that explained their journey from Apache Cassandra to Scylla and how they process 50 billion monthly messages with full availability and performance.
Scylla & Spotinst – A Reliable and powerful combination Spot instances can save you a lot of cash but what if you have a stateful service such as a NoSQL database? The main challenge is that every node in the cluster must sometimes maintain its entire state (IP, Data, and other configurations). This blog describes how a Scylla cluster can be used on AWS’s EC2 Spot without losing consistency with the help of SpotInst’s prediction technology and advanced stateful features. What is Scylla Scylla is an open-source distributed NoSQL database. It was designed to be compatible with Apache Cassandra while […]
To get a better picture of how Apache Cassandra is used, we surveyed 70 expert Apache Cassandra users who actively use Apache Cassandra for in-production workloads. Their workloads represent more than 200 clusters running 600TB of combined data on Apache Cassandra.
I was pleased to sit down with Glauber Costa of ScyllaDB to learn more about his upcoming talk at Scylla Summit 2017. At Scylla Summit 2017, participants will join NoSQL developers and users from start-ups and the enterprise for two days of sharing ideas, hearing innovating use cases, and getting real-life tips and tricks from your peers and NoSQL idols. Let’s begin the interview and learn more about what Glauber will be presenting at the Summit.
Gocqlx is an extension to the Go Scylla / Apache Cassandra driver Gocql. It aims to boost developer productivity while not sacrificing query performance. It’s inspired by Sqlx, a tool for working with SQL databases, but it goes beyond what Sqlx provides. For this blog post, we will pretend we’re a microblogging service and use the following schema: Gocql is a very popular Cassandra driver for the Go programming language. Usually working with it looks more or less like this (source: Gocql README): At first glance, it looks ok but there are some problems: Gocql does not provide you with […]
Visit us at AWS Summit New York on August 14th: Demos, fun games, and cool swag! When: August 14 7:30 AM – 7:00 PM Where: Javits Center Register to attend Going to AWS Summit New York? Be sure to stop by and visit ScyllaDB at booth #214 to see a demo, pick up a t-shirt, and play our fun game to win cool swag, including Scylla the Sea Monster.
Raphael S. Carvalho is a computer programmer here at ScyllaDB who loves open source software and kernel programming. He worked on the Syslinux project to bring new file system support and also worked on MultiFS to allow multiple file systems to co-exist. For his Scylla work, he has been mostly working on SSTable compaction handling and recently developed the support for the Time Window Compaction Strategy on Scylla. This strategy is a considerably better alternative to the DateTieredCompactionStrategy. Raphael has a passion for making products and solutions better with his programming experience. You can learn more about Raphael in his […]
The combination of a database and full-text search analytics becomes unavoidable these days. In this blog post, I will demonstrate a simple way to analyze data from a database with analytics software by using Scylla and Elasticsearch together to perform a simple data mining exercise that gathers data from Twitter. This demonstration will use a series of Docker containers that will run a Scylla and Elasticsearch cluster and a Node.js app that will feed data from Twitter into both platforms. This demo can be run on a laptop or production Docker server. To get started, let’s go over the […]
By default, Scylla SSTables will be compressed when they are written to disk. As mandated by the file format, data is compressed in chunks of a certain size – 4kB if not explicitly set. The size of the chunk is one of the parameters for the compression property to be set at table creation. Chunk-based compression presents trade-offs that users may not be aware of. In this post, I will try to explore what those trade-offs are and how to set them correctly for maximum benefit. As trade-offs imply different results for different loads, we will focus on single-partition read […]
Originally published on The New Stack on July 28th, 2017. Recently AWS unleashed a managed cache solution, Amazon DynamoDB Accelerator (DAX), in front of it’s database. This blog will discuss the pros and cons of external database caches.
Apache®, Apache Cassandra®, are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. No endorsement by The Apache Software Foundation is implied by the use of these marks.