Hands-On Data Analysis with Scala Perform Data Collection, Processing, Manipulation, and Visualization with Scala.

This book will help you perform effective data analysis with Scala using practical examples. You will come across different challenges and their effective solutions for a variety of data processing tasks - be it data exploration, data manipulation, or real-time data analysis using Apache Spark.

Bibliographic Details
Main Author: Gupta, Rajesh
Format: eBook
Language:English
Published: Birmingham Packt Publishing, Limited, 2019.
Subjects:
Online Access:EBSCOhost
Перейти в каталог НБ ТГУ
LEADER 04678cam a2200457Mi 4500
001 koha001014109
003 OCoLC
005 20250222070023.0
006 m d
007 cr |n|---|||||
008 190525s2019 enk o 000 0 eng d
035 |a koha001014109 
040 |a EBLCP  |b eng  |e pn  |c EBLCP  |d CHVBK  |d OCLCQ  |d YDX  |d UKAHL  |d OCLCQ  |d N$T 
019 |a 1100452515 
020 |a 1789344263 
020 |a 9781789344264  |q (electronic bk.) 
050 4 |a QA76.73.S67  |b .L674 2019 
082 0 4 |a 005.133  |2 23 
049 |a MAIN 
100 1 |a Gupta, Rajesh.  |9 453787 
245 1 0 |a Hands-On Data Analysis with Scala  |b Perform Data Collection, Processing, Manipulation, and Visualization with Scala. 
260 |a Birmingham  |b Packt Publishing, Limited,  |c 2019.  |9 911099 
300 |a 1 online resource (288 pages) 
588 0 |a Print version record. 
505 0 |a Cover; Title Page; Copyright and Credits; Dedication; About Packt; Contributors; Table of Contents; Preface; Section 1: Scala and Data Analysis Life Cycle; Chapter 1: Scala Overview; Getting started with Scala; Running Scala code online; Scastie; ScalaFiddle; Installing Scala on your computer; Installing command-line tools; Installing IDE; Overview of object-oriented and functional programming; Object-oriented programming using Scala; Functional programming using Scala; Scala case classes and the collection API; Scala case classes; Scala collection API; Array; List; Map 
505 8 |a Overview of Scala libraries for data analysisApache Spark; Breeze; Breeze-viz; DeepLearning; Epic; Saddle; Scalalab; Smile; Vegas; Summary; Chapter 2: Data Analysis Life Cycle; Data journey; Sourcing data; Data formats; XML; JSON; CSV; Understanding data; Using statistical methods for data exploration; Using Scala; Other Scala tools; Using data visualization for data exploration; Using the vegas-viz library for data visualization; Other libraries for data visualization; Using ML to learn from data; Setting up Smile; Running Smile; Creating a data pipeline; Summary; Chapter 3: Data Ingestion 
505 8 |a Data extractionPull-oriented data extraction; Push-oriented data delivery; Data staging; Why is the staging important?; Cleaning and normalizing; Enriching; Organizing and storing; Summary; Chapter 4: Data Exploration and Visualization; Sampling data; Selecting the sample; Selecting samples using Saddle; Performing ad hoc analysis; Finding a relationship between data elements; Visualizing data; Vegas viz for data visualization; Spark Notebook for data visualization; Downloading and installing Spark Notebook; Creating a Spark Notebook with simple visuals; More charts with Spark Notebook 
505 8 |a Box plotHistogram; Bubble chart; Summary; Chapter 5: Applying Statistics and Hypothesis Testing; Basics of statistics; Summary level statistics; Correlation statistics; Vector level statistics; Random data generation; Pseudorandom numbers; Random numbers with normal distribution; Random numbers with Poisson distribution; Hypothesis testing; Summary; Section 2: Advanced Data Analysis and Machine Learning; Chapter 6: Introduction to Spark for Distributed Data Analysis; Spark setup and overview; Spark core concepts; Spark Datasets and DataFrames; Sourcing data using Spark; Parquet file format 
505 8 |a Avro file formatSpark JDBC integration; Using Spark to explore data; Summary; Chapter 7: Traditional Machine Learning for Data Analysis; ML overview; Characteristics of ML; Categories or types of ML; Decision trees; Implementing decision trees; Decision tree algorithms; Implementing decision tree algorithms in our example; Evaluating the results; Using our model with a decision tree; Random forest; Random forest algorithms; Ridge and lasso regression; Characteristics of ridge regression; Characteristics of lasso regression; k-means cluster analysis 
500 |a Natural language processing for data analysis 
520 |a This book will help you perform effective data analysis with Scala using practical examples. You will come across different challenges and their effective solutions for a variety of data processing tasks - be it data exploration, data manipulation, or real-time data analysis using Apache Spark. 
653 0 |a Data mining. 
653 0 |a Scala (Computer program language) 
653 0 |a SQL. 
655 0 |a EBSCO eBooks  |9 905790 
655 4 |a Electronic books.  |9 899821 
856 4 0 |3 EBSCOhost  |u https://www.lib.tsu.ru/limit/2023/EBSCO/2117000.pdf 
856 |y Перейти в каталог НБ ТГУ  |u https://koha.lib.tsu.ru/cgi-bin/koha/opac-detail.pl?biblionumber=1014109 
910 |a EBSCO eBooks 
999 |c 1014109  |d 1014109 
039