Alexys Jacob, CTO, Numberly
24:38December 20, 2018
Many organizations struggle to balance traditional big data infrastructure with NoSQL databases. Other organizations do the smart thing and consolidate the two. This presentation explores Numberly’s experience migrating an intensive and join hungry production workload from MongoDB and Hive to ScyllaDB. Using ScyllaDB, we were able to accommodate a join of billions of rows in seconds, while also dramatically reducing operational and development complexity by using a single database for our hybrid analytical use case. As a bonus, we’ll cover benchmarks for Dask (a flexible parallel computing library for analytic computing) and Spark, highlighting their differences and lessons learned along the way.
Holden Karau, Developer Advocate, Google
23:58
Glauber Costa, Principal ArchitectScyllaDB
27:01
Takahiro Iwase, Engineer, Yahoo! JapanMurukesh Mohanan, DevOps Engineer, Yahoo! Japan
17:56
Eyal Gutkind, Head of Solution Architects, ScyllaDB
Apache® and Apache Cassandra® are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Amazon DynamoDB® and Dynamo Accelerator® are trademarks of Amazon.com, Inc. No endorsements by The Apache Software Foundation or Amazon.com, Inc. are implied by the use of these marks.