Grab is a powerhouse in Southeast Asia. Its mobile app services cover a broad swath of everyday needs, from acting as a mobile wallet, to arranging affordable ridesharing, food and package delivery. Imagine if Apple Pay, Lyft, and Doordash were all bundled in one app. Grab has exploded in use across Southeast Asia, from its origins in Malaysia and Singapore all across Cambodia, Vietnam, Thailand, Myanmar, Indonesia, and the Philippines. The audience at last fall’s Scylla Summit had a chance to hear about Grab’s use of Scylla in detail, directly from Grab’s Engineering Lead, Aravind Srinivasan. Aravind first presented the […]
The Internet is not just connecting people around the world. Through the Internet of Things (IoT), it is also connecting humans to the machines all around us and directly connecting machines to other machines. In this blog post we’ll share an emerging machine-to-machine (M2M) architecture pattern in which MQTT, Apache Kafka and Scylla all work together to provide an end-to-end IoT solution. We’ll also provide demo code so you can try it out for yourself. IoT Scale IoT is a fast-growing market, already known to be over $1.2 trillion in 2017 and anticipated to grow to over $6.5 trillion […]
To quote Bob Dylan, “the times they are a changin’.” Microsoft loves Linux, IBM buys Red Hat, RedisLabs changes their module license to Commons Clause, Mongo invents Server Side Public License (SSPL) and moves from AGPL, AWS open sources Firecracker and releases a Kafka service, and the hot news from Friday, Confluent changes its license for components of the Confluent Platform from Apache 2.0 to the Confluent Community License. A few weeks ago I wrote about MongoDB’s SSPL, which is similar to Confluent’s new license. You could say the Confluent Community License is to the Apache license as MongoDB SSPL […]
Spark Structured Streaming with Scylla Hello again! Following up on our previous post on saving data to Scylla, this time, we’ll discuss using Spark Structured Streaming with Scylla and see how streaming workloads can be written in to ScyllaDB. This is the fourth part of our four part series. Make sure you check out all the prior blogs! Our code samples repository for this post contains an example project along with a docker-compose.yaml file with the necessary infrastructure for running the it. We’re going to use the infrastructure to run the code samples throughout the post and run the project itself, […]
Scylla Summit 2018 was quite an event! Your intrepid reporter tried to keep up with the goings-on, live-tweeting the event from opening to close. If you missed my Tweetstream, you can pick it up here: And here we go! "Four years ago, Dor and Avi had an idea for a better database."#ScyllaSummit pic.twitter.com/lCM08NVDJd — Peter Corless 🌎☮ (@PeterCorless) November 6, 2018 It’s impossible to pack two days and dozens of speakers into a few thousand words, so I’m going to give just the highlights and will embed the SlideShare links for a selected few talks. However, free to check out […]
In the run-up to Scylla Summit 2018, we’ll be featuring our speakers and providing sneak peeks at their presentations. This interview in our ongoing series is with two speakers holding a joint session: Scylla and KairosDB in Smart Vehicle Diagnostics. The first part of the talk will feature Brian Hawkins speaking on the time-series database (TSDB) KairosDB, which runs atop Scylla or Cassandra. He’ll then turn the session over to Bin Wang of Faraday Future (FF), who will discuss his company’s use case in automotive real-time data collection. Brian, Bin, thank you for taking the time to speak with me. […]
In the run-up to Scylla Summit 2018, we’ll be featuring our speakers and providing sneak peeks at their presentations. This interview in our ongoing series is with Hojjat Jafarpour of Confluent. His presentation at Scylla Summit is entitled Scalable Stream Processing with KSQL, Kafka and Scylla. Hojjat, before we get into your talk, tell us a little about yourself. What do you like to do for fun? I’m a Software Engineer at Confluent and the creator of KSQL, the Streaming SQL engine for Apache Kafka. In addition to challenging problems in scalable data management, I like traveling and outdoors. We are so […]
Scylla came out on top and is now in production at Grab. In this post, you can learn more about Grab’s journey to Scylla and see what they liked about it.
It is time to start processing data gathered from applications more efficiently. Applications typically gather large amounts of data over time from different sources and data types such as from IoT devices and microservices applications. Traditional data warehouses use ETL (Extract, Transform, Load) strategies which are batch-driven at specific time intervals and each component talked to every other component through messaging queues. This creates a management nightmare because custom scripts move data from their sources to destinations as one-offs along with many single points of failure and there is not a way to analyze the data in real-time. A more […]
Let’s talk about a financial use case where streaming and near-real-time messaging is used through Kafka and Scylla. We will model a system that allows subscribers to follow stock prices for companies of their interest, similar to a simplified use of a trading terminal. Our system follows an architectural pattern in which updates of stock prices are pushed to a Kafka queue, and subscribers consume messages that contain company stock information. These consumed messages are then stored in Scylla instances, where they can be used later for more sophisticated analysis (for example, using an engine like Spark).