Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 171 posts
Connect Apache Spark and DataStax Astra - Business Platform Team
11/19/2020
In this blog, we discuss connecting Apache Spark and DataStax Astra. Also, a webinar recording is embedded below if you want to watch a live demo where we use Gitpod and Spark-Shell to connect to Data...
Apache Spark Streaming with Kafka and Cassandra I
11/18/2020
PrerequitesWe need to make sure Java is installed: $ java -version openjdk version "1.8.0_111" OpenJDK Runtime Environment (build 1.8.0_111-8u111-b14-2ubuntu0.16.04.2-b14) OpenJDK 64-Bit Server VM (bu...
Creating Data Pipeline with Spark streaming, Kafka and Cassandra
Reading Time: 3 minutes Hi Folks!! In this blog, we are going to learn how we can integrate Spark Structured Streaming with Kafka and Cassandra to build a simple data pipeline. Spark Structured Stre...
Cassandra Sink for Spark Structured Streaming
I've recently begun to use Spark and had to store the results produced by Structured Streaming API in a Cassandra database.In this post, I provide a simple example of how to create and use Cassandra s...
Streaming Analytics with Spark, Kafka, Cassandra, and Akka - Databricks
This talk will address how a new architecture is emerging for analytics, based on Spark, Mesos, Akka, Cassandra and Kafka (SMACK). Popular architecture like Lambda separate layers of computation and d...
Spark Streaming, Kafka and Cassandra Tutorial - Instaclustr
11/4/2020
This tutorial builds on our basic “Getting Started with Instaclustr Spark and Cassandra” tutorial to demonstrate how to set up Apache Kafka and use it to send data to Spark Streaming where it is summa...
YugabyteDB Architecture: Diverse Workloads with Operational Simplicity - The Distributed SQL Blog
11/1/2020
DataStax-Examples/SparkBuildExamples
10/23/2020
These are template projects that illustrate how to build Spark Application written in Java or Scala with Maven, SBT or Gradle which can be run on either DataStax Enterprise (DSE) or Apache Spark. The...