Products
- Quick Links
  
  Real-Time AI
  
  ScyllaDB powers real-time AI with low latency, high throughput, and billion-scale data.
  Learn More
  
  Is ScyllaDB right for me?
  
  ScyllaDB is purpose-built for data-intensive apps that require high throughput & predictable low latency.
  Learn More
- Products
- Compare
- Related Technologies
Developers
Customers
Resources
- Featured Resource
  
  ScyllaDB University
  
  Level up your skills with our free NoSQL database courses.
  Take a Course
  
  ScyllaDB Blog
  
  Our blog keeps you up to date with recent news about the ScyllaDB NoSQL database and related technologies, success stories and developer how-tos.
  Read the Blog
- Resource Center
- Events
- Compare
Pricing
Contact Us
Chat Now
Get Started
Sign In
Search
Button Links

See all blog posts

Seastar: The future<> is Here

By Alexander Gallego

January 4, 2018

Bluesky Reddit X LinkedIn

Subscribe to Receive Blog Updates

Subscription Categories

All Posts
Events
Integrations
News
Performance
Releases
ScyllaDB
Seastar
Tutorials
User Stories
Developers Blog

Bluesky Reddit X LinkedIn

oreilly-book-ads

This is a cross-post from https://www.alexgallego.org/concurrency/smf/2017/12/16/future.html.

On June 8, 2016, Avi Kivity came to NYC to present ScyllaDB. During his search for a quick open desk to do some work, I volunteered open spaces we had at Concord¹. We talked lock-free algorithms, memory reclamation techniques, threading models, Concord and distributed streaming engines, even C vs C++. Five hours later I was convinced that Seastar was the best systems framework I’d ever come across.

I’ve now been using Seastar for almost two years and I haven’t changed my mind.

The future<> is all about concurrency

For the truly impatient, the future<> is here².

In 1978 news³, T. Hoare prophetically said the future was about computers getting more cores and not increasing in clock speed. In 2004 Herb Sutter coined the same trend as The Free Lunch is Over⁴. The seastar::future<> is a tool to take advantage of multi-core, multi-socket machines – a way to structure your software to grow gracefully with your hardware. There are many other tools that fit this new modality, from lock-free algorithms and to co-routines, to channels3, not to mention actor-style message passing, among many other paradigms like full-on distributed programming languages^{5 6 7 8}.

“Instead of driving clock speeds and straight-line instruction throughput ever higher, they are instead turning en masse to hyperthreading and multicore architectures” – Herb Sutter

seastar::future<>’s are for concurrent software construction. In addition, their design makes them composable. You can take any two futures and chain them together via .then() operator and yield a new future⁹. Although you can combine, mix, map-reduce, filter, chain, fail, complete, generate, fulfill, sleep, expire futures, etc, they are fundamentally about program structure. Such program structure can execute in parallel, but doesn’t have to. When you have concurrent structure, parallelism is a free variable¹⁰. That is to say, you can turn up or down the number of simultaneous execution units/cores/threads without changing your program. In this paradigm, you worry about correct program structure and someone else worries about the execution.

seastar

Seastar is an intrusive building block. Once you start composing Seastar-driven asynchronous building blocks, you have to go out of your way – really – to build anything synchronous, and that’s powerful. Structurally, Seastar has the same effect as actor frameworks like Akka⁵, Orleans⁶, or even languages like Pony⁷ or Erlang⁸ have. Once you have an actor, they spread virally through your system making everything an actor.

Philosophically, actor frameworks and distributed languages differ from Seastar. While the former try to give the programmer higher abstractions and a runtime to hide machine details like IO or CPU scheduling, Seastar takes the opposite approach. It gives you – the wise programmer – abilities to tune and control almost every part of the future<> runtime. This includes IO shares scheduling, CPU shares scheduling, in addition to batteries included approach when it comes to taking advantage of hardware for dealing with filesystems, networking, DMA, etc.

Both approaches, however, are intrinsically safe. The programmer worries about correctness and construction while the frameworks worry about efficient execution. Counter to general wisdom, it is actually faster and more scalable than the synchronous approach. While the machine does more work, it is executing your code simultaneously. This simultaneity is the key to finishing work sooner.

At its core, from the project site, Seastar promises:

Shared-nothing design¹¹
High-performance networking¹²
Futures and promises¹³
Message passing¹⁴

… but it is much more, so let’s get technical and find out how Seastar executes these concurrency building blocks.

Enter Seastar… at your own risk, you might not come back

In a past life, I helped build Concord.io with facebook’s folly::futures¹⁵, and wangle¹⁶ for networking and async execution. While these libraries enabled us to deliver high-performance code using similar primitives, their use of asynchronous operations is not as pervasive as that of Seastar. They are libraries and not frameworks, which is the first distinction. That is, you can use the parts of the libraries that you need without needing to include or use the rest. You can tick your own clocks, your own IOEventLoops, your own CPU Scheduling, your own syscall() thread pool, etc. Seastar, on the contrary, tells you that you have to operate within their framework. It is not possible to take parts of Seastar and use them on your code base without the IO subsystem or the CPU subsystem.

While this decision seems like a disadvantage, it is actually an enforcer of asynchronicity – very much like actors. It is front and center to everything you do. This is a good thing.

No locks, atomics, cache polluting primitives

Seastar takes one extreme approach to data locality. It uses almost no locks, atomics, or in any way implicit memory sharing with other cores. Your view into any application starts with a seastar::distributed<T> type. This means a copy of the T is thread local.

They, of course, cover all the basics for high-performance applications:

Small type optimizations (although seastar::small_set<T> and seastar::small_map<K,V> are missing).
Non thread safe non-polymorphic shared pointer (local to core) via seastar::lw_shared_ptr<T>¹⁷
Non-thread safe polymorphic shared pointer (local to core) via seastar::shared_ptr<T>
String with small type optimizations¹⁸ nor atomics like the libc++¹⁹
Move only bag-o’-bytes²⁰
Circular buffers
Linux DAIO
and many many more!

A mental model

Figure 1: Seastar Mental Model. Everything in Seastar happens in a `thread_local’ (per hyper-thread) with the exception of explicit cross-core communication. As with all mental models, this is a simplification and omits details.

IRL Impact

I’ve been using Seastar for a year and a half on a project called smf²¹ and it has been eye-opening.

smf is a set of libraries and utilities (like boost:: for C++ or guava for java) designed to be the building blocks of your next distributed systems.

Current benchmarks in microseconds make smf’s RPC (Seastar-backed through DPDK) the lowest tail latency system I’ve tested – including gRPC, Thrift, Cap n’Proto, etc. What matters, however, is not that I’ve managed to build a fast RPC, but the fact that doing it with Seastar was no more work than doing the same thing with facebook::folly and facebook::wangle, boost::asio, or libevent.

In addition to the RPC, smf has its own Write Ahead Log (WAL).

It is a write-ahead log modeled after an Apache Kafka-like interface or Apache Pulsar. It has topics, partitions, etc. It is designed to have a single reader/writer per topic/partition.

Current benchmarks in milliseconds ==> 41X faster than Apache Kafka

These massive gains should be expected of many server-side applications.

Want to learn more about Seastar and smf? Please check out my talk from ScyllaDB Summit 2017 or visit the main Seastar and smf page.

References

concord – my previous startup.
future header file.
csp – t hoare.
herb sutter – free lunch is over.
akka – actor framework for the jvm.
orleans – actor framework by Microsoft.
pony – actor language.
erlang – distributed programming lang.
continuations docs.
parallelism is a free variable.
seastar shared nothing.
seastar networking.
seastar promises.
seastar message passing.
facebook’s folly::futures.
facebook wangle.
non-thread-safe shared ptr.
seastar::sstring – string with small type optimization.
std::basic_string.
seastar::temporary_buffer.
smf – the fastest RPC in the west.

Seastar C++Benchmarks smf performance

About Alexander Gallego

View all posts by this author

Previous Post Next Post

Related Posts

Offloading I/O to Dedicated Cores: An Asymmetric io_uring Backend for Seastar and ScyllaDB

ScyllaDB Vector Search Benchmark: 10M Vectors on a Compact Cluster

The Great Stream Fix: Interleaving Writes in Seastar with AI-Powered Invariants Tracing

Start scaling with the world's best high performance NoSQL database.

Product

Product Overview
Architecture
ScyllaDB Enterprise
ScyllaDB Cloud
Vector Search
Benchmarks
Solutions
Release Notes
Install
Tools
Pricing

Resources

Documentation
Online Training
NoSQL Guides
Resource Center
Events
Customer Support
Customer Portal
Community Projects
Blog

Company

About Us
Team
Customers
Partners
News
Media Kit
Careers
Contact Us

Privacy Policy
Terms of Use
Trust Center
Legal Center
Cookie Policy

2026 ©ScyllaDB | ScyllaDB, and ScyllaDB Cloud, are registered trademarks of ScyllaDB, Inc.