Menu

Products
- Quick Links
  
  Real-Time AI
  
  ScyllaDB powers real-time AI with low latency, high throughput, and billion-scale data.
  Learn More
  
  Is ScyllaDB right for me?
  
  ScyllaDB is purpose-built for data-intensive apps that require high throughput & predictable low latency.
  Learn More
- Products
- Compare
- Related Technologies
Developers
Customers
Resources
- Featured Resource
  
  ScyllaDB University
  
  Level up your skills with our free NoSQL database courses.
  Take a Course
  
  ScyllaDB Blog
  
  Our blog keeps you up to date with recent news about the ScyllaDB NoSQL database and related technologies, success stories and developer how-tos.
  Read the Blog
- Resource Center
- Events
- Compare
Pricing
Contact Us
Chat Now
Get Started
Sign In
Search
Button Links

See all blog posts

Making the Move: Migrating to ScyllaDB

By Eyal Gutkind

December 19, 2017

Bluesky Reddit X LinkedIn

Subscribe to Receive Blog Updates

Subscription Categories

All Posts
Events
Integrations
News
Performance
Releases
ScyllaDB
Seastar
Tutorials
User Stories
Developers Blog

Bluesky Reddit X LinkedIn

migrate

Update: We now have a full masterclass on NoSQL migration. You can watch it (free + on-demand) here: NoSQL Data Migration Masterclass. And we also have a new blog on the ScyllaDB Spark Migrator.

For the past two years, we have helped users build fast, resilient, and stable applications with ScyllaDB, an enterprise-grade database solution. During these two years, our early adopters migrated from a variety of database solutions, and while most of the migrations we successfully completed were Apache Cassandra (enterprise and open-source versions), we have seen users migrate from MongoDB, HBase, relational systems such as MySQL and Postgres, and key/value stores like Memcache and Redis.

Migration strategies differ between users and systems. In general, we can divide Apache Cassandra-to-ScyllaDB migrations into two main strategies, cold migration and hot migration.

Cold Migration

During the cold migration process, neither the legacy system nor the ScyllaDB system is operational. The cold migration strategy is easier on the operators as it lets the team stop the legacy database system at a point in time and restart it at the same point in time with the new ScyllaDB system. However, users are not able to use the system during the migration process which is a constraint very few organizations are willing to accept.

Hot Migration

The second strategy is a hot migration. Contrary to cold migration, in hot migration, both the legacy and ScyllaDB deployment are fully operational during the migration process. We describe the required steps for a hot migration in the following document.

What about other database migrations?

For a document-based solution, we work with the user to “serialize” the data, for example, converting a JSON data entry to a columnar one. The following is a simple example of one of the conversions, in which a user profile is converted from a document model to a columnar one.

The next model is a ScyllaDB data model to hold the same address information. We will need to create a user-defined type for addresses:

Create the table:

And insert the data:

Migrations from a different database architecture

Obviously, migrations from a different database architecture such as relational, require more attention to data models and data retrieval patterns. For example, some databases offer joins and aggregation functions. For joins, we recommend users to denormalize their data model and consider the queries deployed by the application. Here is an example of denormalizing a relational database model.

The relational data model and query look like the following:

Here is our sample data after insertion:

player_id	name
1	Cristiano Ronaldo
2	Lionel Messi
3	David Beckham

Game	Against	Goals	Game Type	Player
1	F.C. Barcelona	1	away	1
2	Atletico Madrid	2	home	1
3	F.C. Valencia	1	home	1
4	Malaga	2	away	3
5	Sevillia	1	away	2
6	Real Madrid	1	away	2

The following is a query we can deploy in a relational database scenario:

The outcome of the query will be:

Game	Against	Goals	Game Type	Player
2	Atletico Madrid	2	home	1
3	F.C. Valencia	1	home	1

The query retrieves, based on a player name, the games in which he or she scored, and whether it was a home or an away game.

The following example is one of the options to store the needed information in ScyllaDB and enable the query. Please note that arbitrary where clauses are not implemented with ScyllaDB. With Materialized Views, users can create different views of a table and query the information should they need a less granular access. You can read more about ScyllaDB’s implementation of Materialized Views here.

Here is the complete games info:

And the query to learn how Cristiano Ronaldo did for his home games:

For aggregation functions, some functions are available today in ScyllaDB, while others are in development. We also recommend using additional tools such as Spark or Presto.

As we demonstrated above, it is possible to migrate from different databases to ScyllaDB. In our recent user conference, we presented a talk about database migration. You can see the presentation and the slides on our website.

Also, check out our migration documentation, and if you are ready to get started, contact us to learn about the professional services we offer to help with your migration.

oreilly-book-ads

deployment Apache Cassandra Migration

About Eyal Gutkind

Eyal Gutkind is a solution architect for ScyllaDB. Prior to ScyllaDB Eyal held product management roles at Mirantis and DataStax. Prior to DataStax Eyal spent 12 years with Mellanox Technologies in various engineering management and product marketing roles.Eyal holds a BSc. degree in Electrical and Computer Engineering from Ben Gurion University, Israel and MBA from Fuqua School of Business at Duke University, North Carolina.

View all posts by this author

Previous Post Next Post

Related Posts

Apache Cassandra Performance Tuning: What We Learned

Scaling Performance Comparison: ScyllaDB Tablets vs Cassandra vNodes

ScyllaDB: Not Just a Faster Apache Cassandra

Start scaling with the world's best high performance NoSQL database.

Product

Product Overview
Architecture
ScyllaDB Enterprise
ScyllaDB Cloud
Vector Search
Benchmarks
Solutions
Release Notes
Install
Tools
Pricing

Resources

Documentation
Online Training
NoSQL Guides
Resource Center
Events
Customer Support
Customer Portal
Community Projects
Blog

Company

About Us
Team
Customers
Partners
News
Media Kit
Careers
Contact Us

Privacy Policy
Terms of Use
Trust Center
Legal Center
Cookie Policy

2026 ©ScyllaDB | ScyllaDB, and ScyllaDB Cloud, are registered trademarks of ScyllaDB, Inc.