Anomaly Detection is a very powerful pattern and mostly, 70% time used. Azure Stream Analytics is having built-in ML-based Anomaly detection. It is based on the Un-supervised learning model i.e. model does not come with any pre-training, it starts learning with no o…
Read moreDelta Lake is an open-source storage layer that brings ACID transactions to Apache Spark and big data workloads. Delta Lake offers: ACID Transactions Scalable Metadata Handling Time Travel (data versioning) Open Format Unified Batch and Streaming Sou…
Read moreFirst if you not have installed Apache Spark yet, you need to follow here to install spark on windows 10. Now Download Bible.txt from here . We will be using bible for analysis here. I have placed this at C:\install // Example 1: Simple word count for bible …
Read moreStep1: Apache Spark requires Java 8, so make sure you have it before you start Step2. C:\>java -version java version "1.8.0_231" Java(TM) SE Runtime Environment (build 1.8.0_231-b32) Java HotSpot(TM) Client VM (build 25.231-b32, mixed mode) …
Read more