Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 171 posts
mgubaidullin/infinity
9/4/2020
Infinity is a prototype of cloud-agnostic forecasting platform inspired by Amazon Forecast service.Project was created as a part of the DataStax Hackathon aka ✨ASTRAKATHON✨ and won the first p...
Expero Blog | Building a Distributed Data Ingestion Pipeline
8/26/2020
IntroductionOn a recent client engagement where we had to load and process data from several data sources, we were tasked with a broader mandate to develop a wholesale data loading strategy for a suit...
polynote/polynote
8/25/2020
Polynote is an experimental polyglot notebook environment. Currently, it supports Scala and Python (with or without Spark), SQL, and Vega.For more information, see Polynote's websiteCurrent notebook ...
Polynote | The polyglot Scala notebook
Polynote | The polyglot Scala notebook Learn to love your notebooks. Try it! Polynote is a different kind of notebook. It supports mixing m...
Spark and Cassandra for Machine Learning: Testing - Business Platform Team
8/21/2020
Testing is how we guess at the efficacy of our machine learning models out in the real world. The basics may seem obvious, but specific test metrics can help you emphasize performance on the parts of ...
Spark and Cassandra For Machine Learning: Cross-Validation - Business Platform Team
Cross-validation is a collection of methods for repeated training and testing of our machine learning models. We do it in order to learn more than simple testing can tell us. These tests can help us t...
Spark and Cassandra: Doing SQL and Joins on Cassandra Tables - Business Platform Team
In this blog post, we will introduce Spark, a unified analytics engine for large-scale data processing, and discuss how to use it to do SQL on a NoSQL database like Cassandra. We will also give you a ...
polomarcus/Spark-Structured-Streaming-Examples
8/5/2020
Stream the number of time Drake is broadcasted on each radio. And also, see how easy is Spark Structured Streaming to use using Spark SQL's Dataframe APIRun the ProjectStep 1 - Start containersStart t...