Showing posts with the label ScalaShow All

Real-time Anomaly detection on Azure

Anomaly Detection is a very powerful pattern and mostly, 70% time used. Azure Stream Analytics is having built-in ML-based Anomaly detection. It is based on the Un-supervised learning model i.e. model does not come with any pre-training, it starts learning with no o…

Read more

Delta Lake in Apache Spark - Basics

Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark and big data workloads. Delta Lake offers: ACID Transactions Scalable Metadata Handling Time Travel (data versioning) Open Format Unified Batch and Streaming Sou…

Read more

Learn Apache Spark and Scala with examples

First if you not have installed Apache Spark yet, you need to follow here to install spark on windows 10. Now Download Bible.txt from here . We will be using bible for analysis here. I have placed this at C:\install // Example 1: Simple word count for bible …

Read more

How To Install Apache Spark On Windows 10

Step1: Apache Spark requires Java 8, so make sure you have it before you start Step2. C:\>java -version java version "1.8.0_231" Java(TM) SE Runtime Environment (build 1.8.0_231-b32) Java HotSpot(TM) Client VM (build 25.231-b32, mixed mode) …

Read more