Menu

Products
- Quick Links
  
  Real-Time AI
  
  ScyllaDB powers real-time AI with low latency, high throughput, and billion-scale data.
  Learn More
  
  Is ScyllaDB right for me?
  
  ScyllaDB is purpose-built for data-intensive apps that require high throughput & predictable low latency.
  Learn More
- Products
- Compare
- Related Technologies
Developers
Customers
Resources
- Featured Resource
  
  ScyllaDB University
  
  Level up your skills with our free NoSQL database courses.
  Take a Course
  
  ScyllaDB Blog
  
  Our blog keeps you up to date with recent news about the ScyllaDB NoSQL database and related technologies, success stories and developer how-tos.
  Read the Blog
- Resource Center
- Events
- Compare
Pricing
Contact Us
Chat Now
Get Started
Sign In
Search
Button Links

See all blog posts

Zenly Discusses Going from Elasticsearch to ScyllaDB at ScyllaDB Summit 2017

By Phillip Tribble

October 6, 2017

Bluesky Reddit X LinkedIn

Subscribe to Receive Blog Updates

Subscription Categories

All Posts
Events
Integrations
News
Performance
Releases
ScyllaDB
Seastar
Tutorials
User Stories
Developers Blog

Bluesky Reddit X LinkedIn

Building the infrastructure for a mobile application that reaches millions of users is not an easy task and the folks at Zenly seem to know exactly what they are doing. I enjoy talking to experts in their field to learn how they do things and recently sat down with Jean-Baptiste Dalido of Zenly to learn more about his professional background and upcoming talk at ScyllaDB Summit 2017. Let’s begin.

Please tell us about yourself and what you do at Zenly?

I am the Head of Infrastructure at Zenly, previously Appgratis and Batch.com, so I have almost 10 years of experience in mobile and startup environments. I started with Zenly two and half years ago and built the infrastructure from the ground up. It was quite a new experience compared to my previous jobs because Zenly has such a particular use-case. Instead of having to sustain high traffic on your infrastructure that is easily overcome with cache farms, I was facing a new problem because everything is live. We cannot display an old position, even if it’s one minute old because we might lose the live effect that the app produces for the user, and at the same time we may lose credibility because the product seems to be unusable. The use case for Zenly is by far closer to online video games than any other mobile app with seconds as a metric instead of ms. Other than that, you’ll find me on a motorcycle riding the nice road of France.

How did you get started working with databases?

I started working with heavy loaded databases at Appgratis, handling data for 10 Million Daily Active Users (DAU). It was my first move over to distributed databases on bare metal. We benchmarked at the time multiple solutions from sharded (and non-distributed) Postgres instances, Riak, and of course Apache Cassandra. We went for the last one for one main reason: the amazing distributed network design (multiple clusters over regions), and the overall concepts behind the database. I think Apache Cassandra is the software that made me understand and love distributed systems.

What types of issues or problems did you face when managing massive Apache Cassandra Clusters?

My main issue with Apache Cassandra from over three startups has always been garbage collection (GC). Cassandra is a database that you need to handle cautiously. When you are designing the data model, you cannot do any mistakes on your data distribution. If you do so and one or more of your nodes slowly became unavailable, you’re in for a long night every time. GC is a deal-breaker for a database, the distributed nature of Apache Cassandra and the high number of nodes you’ll deploy might transform it into a relatively small/hidden issue. But most of the clusters you are going to deploy in real life will be small and less than 15 nodes. You need to have so much knowledge of the do’s and don’ts of Apache Cassandra to be proficient and the database itself will become the problem on small clusters and impact your uptime.

What will you be talking about at ScyllaDB Summit 2017?

Zenly’s journey from Elasticsearch to ScyllaDB in production and our experiences eight months after.

Can you please tell me more about your talk?

This talk is going to be an overview of the reasons we chose ScyllaDB, its deployment on Google Cloud, the performances we achieved, and the few hiccups we hit along the way. The real goal of the talk is to make people understand that ScyllaDB is perfectly ready to use in production and can cover so many use cases. The side effect of ScyllaDB’s performances is simple: it’s going to hide the fact you’ve done mistakes and still respond with incredible performances, becoming a human-production-ready usable database. And of course, if you have some experience of tuning your data models with Apache Cassandra and apply the same rules on ScyllaDB, you are entering a brand new world.

How many transactions does ScyllaDB process at Zen.ly?

We are right now handling over 600,000 requests per second on our clusters with very few nodes. Our clusters are split by functions to have a better overview of the load produced and to ensure the high availability of the application per feature. Everything at Zenly is built around the fact that anything can die instantly anywhere on the infrastructure and we apply the same principles for ScyllaDB. Of course, we’ve done quite a few benchmarks and we’ve seen easily a million request per second with around 10 nodes in our clusters.

How do you use Elasticsearch and how does it interact with or compliment ScyllaDB?

Elasticsearch has always been used as the main database at Zenly, handling everything from users to any data related to production. There are no interactions between the two. ScyllaDB completely replaced Elasticsearch as our main database. In fact, ScyllaDB is now the ONLY database in use at Zenly except for the cache which is still running with Redis, but it may not last long. ScyllaDB has proven to be a good choice for so many use cases. We had a discussion not so long ago on why we should not use ScyllaDB for a cache and why Redis was better. Contrary to Apache Cassandra where we had plenty of arguments to say it was crazy and useless but with ScyllaDB, we started thinking it was not a bad choice at all due to the design and the amazing performances we see every day. It’s still an open subject and I’ll be glad to have some discussions about it at the summit.

How can the people get in touch with you?

Feel free to reach out to me on Twitter.

Thank you very much, Jean-Baptiste. We can not wait to see your talk in person and learn more.

ScyllaDB Summit is taking place in San Francisco, CA on October 24-25. Check out the current agenda on our website to learn about the rest of the talks—including technical talks from the ScyllaDB team, the ScyllaDB roadmap, and a hands-on workshop where you’ll learn how to get the most out of your ScyllaDB cluster.

Apache®, Apache Cassandra®, are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. No endorsement by The Apache Software Foundation is implied by the use of these marks.

oreilly-book-ads

Events ScyllaDB Summit

About Phillip Tribble

View all posts by this author

Previous Post Next Post

Related Posts

First Look at the Monster SCALE Summit Agenda

A Tiny Peek at Monster SCALE Summit 2025

ScyllaDB User Tech Talks: Dor Laor’s Takeaways

Start scaling with the world's best high performance NoSQL database.

Product

Product Overview
Architecture
ScyllaDB Enterprise
ScyllaDB Cloud
Vector Search
Benchmarks
Solutions
Release Notes
Install
Tools
Pricing

Resources

Documentation
Online Training
NoSQL Guides
Resource Center
Events
Customer Support
Customer Portal
Community Projects
Blog

Company

About Us
Team
Customers
Partners
News
Media Kit
Careers
Contact Us

Privacy Policy
Terms of Use
Trust Center
Legal Center
Cookie Policy

2026 ©ScyllaDB | ScyllaDB, and ScyllaDB Cloud, are registered trademarks of ScyllaDB, Inc.