Virtual Workshops
Twice-monthly interactive sessions with our NoSQL solution architects.
Join Our Next Session >

Scylla and Apache Spark

Scylla is the highly scalable, high performance NoSQL database that can keep up with the streaming analytics demands of Apache Spark

apache-spark-logo

Scylla is the fastest, most powerful and scalable NoSQL database. Apache Spark is the fastest, most powerful and scalable data analytics framework. Many users who deploy one deploy the other because they are entirely complementary technologies.

Hooking Up Spark and Scylla

Four part blog series providing a primer on how to use Spark and Scylla together: We provide all the open source code in Github for you to try this yourself.

Mastering the Scylla Spark Migrator

spark
+
parquet

The Scylla Spark Migrator is our workhorse engine written in Apache Spark, capable of taking data from multiple sources, including databases such as Apache Cassandra or DynamoDB, or big data file formats like Apache Parquet, and migrating them into Scylla or Scylla Cloud.

Scylla University Mascot

Scylla University

Get started on the path to Scylla expertise.

Live Test CTA

Live Test

Spin up a 3-node Scylla cluster to see our light-speed performance

Virtual Workshop

Interactive sessions with our NoSQL solution architects.