May10

Faster and better: What to expect running Scylla on AWS i3 instances

Amazon recently unveiled a new class of machines—the AWS i3 family. Targeted at I/O intensive applications and featuring up to 15TB of fast storage, these machines offer unprecedented power with a great balance between I/O and CPU. At a lower price than the previous i2 family, we expect the i3 family to become the default class for NoSQL workloads.

This article will cover i3 instances and provide information about the status of Scylla support for the hardware. Although we don’t yet officially provide i3 AMIs, customers are already running them in production with positive results. Scylla’s native architecture takes advantage of the vast resources available on i3 instances. 

We also invite you to register for our upcoming webinar, ‘How to Monitor and Size Different Workloads in AWS i3 Instances’ to learn more and see a live demo with a dashboard featuring an i3 cluster with different data models and workloads.

Need help and guidance setting your Scylla system to run on i3 instances? You can join our Slack channel and ask for help!

Disk I/O

The i3 family brings Non-Volatile Memory Express (NVMe) drives as local ephemeral storage. The performance of these drives is among the best in the public cloud offering claiming up to 3.3 million IOPS at a 4 KB block and up to 16 GB/second of sequential disk throughput.

Network

The i3.16xlarge instance, the larger instance in the i3 family, claims to have up to 20Gbps of network throughput and a decent number of network queues. However, Kernel support is needed to take advantage of the large throughput and network queues. The standard CentOS kernel that ships with Scylla AMIs does not support the i3.16xlarge instance’s network devices, so the kernel has to be replaced with one of the official kernels provided by AWS.

CPU

The i3 family is equipped with powerful Intel processors. This allows up to 64 vCPUs per instance, compared to the largest legacy of i2 family, which allows only 32 vCPUs. Scylla’s shard-per-core, lockless architecture takes advantage of every vCPU, resulting in a much higher overall performance. During the recent AWS Summit in San Francisco, we demonstrated a simple key-value schema yielding more than one million operations per second in a single server—1,087,000 to be precise—with i3 instances (see Figure 1), using 100% of all vCPUs.

Monitor in AWS i3 Instance
Figure 1: Demo dashboard showing Scylla executing north of one million operations a second (1.087M) in a single server for a simple key-value schema. The instance is i3.16xlarge.

Cost

As described above, Scylla deployments on i3 instances will benefit from the I/O, CPU and Memory abundance. The benefits translate to bottom-line margins. The i3 instance family is cheaper on a per-instance comparison by more than 50% from its predecessor i2 family. Users can reduce their cost of operations by more than 75%, taking advantage of the lower cost and better efficiency Scylla brings to the database deployments.  

Scylla on i3 hardware

Users running i3 in production are fully supported, but our automatic configuration will require manual tuning. Scylla’s upcoming 1.8 version, scheduled for release this summer, will fully support i3 instances out of the box

A kernel change is necessary for users who want to run i3 in production before the 1.8 release and are using ScyllaDB-provided AMIs. The CentOS default kernel does not properly support i3’s disks and network cards, and newer AMIs will use the AWS official kernel instead. The network interrupt affinities need to be manually configured, and also the Scylla I/O configuration may have to be manually tweaked.

Ready to start running i3 instances even sooner? Our preview AMIs automatically apply the needed changes. Get them now in these selected regions:

  • us-west-1: ami-78173118
  • us-east-1: ami-3f295c29

What’s next?

The i3 family of instances is still in early stages and Scylla won’t fully support them until the 1.8 release this summer. Scylla’s currently released versions require manual tuning and configuration to run properly in those boxes. However, once the tuning is done, the performance results are significant.

We are presenting a webinar, ‘How to Monitor and Size Workloads on AWS i3 Instances’ on May 18th at 10:00 AM Pacific. Register now so you can join us to learn how to ensure Scylla fully leverages the great resources in the i3 family and effectively navigate the Scylla monitoring system and identify bottlenecks. You’ll also see a live demonstration with a dashboard featuring an i3 cluster with different data models and workloads.

Eyal GutkindAbout Eyal Gutkind

Eyal Gutkind is a solution architect for Scylla. Prior to Scylla Eyal held product management roles at Mirantis and DataStax. Prior to DataStax Eyal spent 12 years with Mellanox Technologies in various engineering management and product marketing roles.Eyal holds a BSc. degree in Electrical and Computer Engineering from Ben Gurion University, Israel and MBA from Fuqua School of Business at Duke University, North Carolina.

Glauber CostaAbout Glauber Costa

Glauber Costa (Lord Glauber I of Sealand) is a Principal Architect at
ScyllaDB. He shares his time between the engineering department
working on upcoming Scylla features and helping customers succeed.

Before ScyllaDB, Glauber worked with Virtualization in the Linux
Kernel for 10 years, with contributions ranging from the Xen
Hypervisor to all sorts of guest functionality and containers.

Tags: AMIs, AWS, hardware, i3, monitoring