Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud

iebukes Apress 396 次浏览 , 没有评论
Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud Front Cover

Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud

by Robert Ilijason
  • Length: 291 pages
  • Edition: 1
  • Publisher: Apress
  • Publication Date: 2020-07-10
  • ISBN-10: 1484257804
  • ISBN-13: 9781484257807
  • Sales Rank: #1898915 (See Top 100 Books)
Description

Analyze vast amounts of data in record time using Apache Spark with Databricks in the Cloud. Learn the fundamentals, and more, of running analytics on large clusters in Azure and AWS, using Apache Spark with Databricks on top. Discover how to squeeze the most value out of your data at a mere fraction of what classical analytics solutions cost, while at the same time getting the results you need, incrementally faster.

This book explains how the confluence of these pivotal technologies gives you enormous power, and cheaply, when it comes to huge datasets. You will begin by learning how cloud infrastructure makes it possible to scale your code to large amounts of processing units, without having to pay for the machinery in advance. From there you will learn how Apache Spark, an open source framework, can enable all those CPUs for data analytics use. Finally, you will see how services such as Databricks provide the power of Apache Spark, without you having to know anything about configuring hardware or software. By removing the need for expensive experts and hardware, your resources can instead be allocated to actually finding business value in the data.

This book guides you through some advanced topics such as analytics in the cloud, data lakes, data ingestion, architecture, machine learning, and tools, including Apache Spark, Apache Hadoop, Apache Hive, Python, and SQL. Valuable exercises help reinforce what you have learned.

What You Will Learn

  • Discover the value of big data analytics that leverage the power of the cloud
  • Get started with Databricks using SQL and Python in either Microsoft Azure or AWS
  • Understand the underlying technology, and how the cloud and Apache Spark fit into the bigger picture
  • See how these tools are used in the real world
  • Run basic analytics, including machine learning, on billions of rows at a fraction of a cost or free

Who This Book Is For

Data engineers, data scientists, and cloud architects who want or need to run advanced analytics in the cloud. It is assumed that the reader has data experience, but perhaps minimal exposure to Apache Spark and Azure Databricks. The book is also recommended for people who want to get started in the analytics field, as it provides a strong foundation.

下载地址:

Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud

 

 亲,网盘文件已删,下载链接已失效


因为,我,失业了!于是我老家十八线小县城找了份掏下水道的工作。。。
 
为了生活
 
我决定将iebueks电子网站由免费改为赞助入群:
 
一年45元
 
从百度网盘群满之日算起。
 
这45元除了最新的英文IT电子书,还包括:

免费找书服务,中文英文皆可

国内出版社出版的中文电子书  
中文电子书
2022年公考资料
2022年公考资料
2023年考研学习资料
2023年考研学习资料
人人素材网各种视频素材模板以及中文字幕教程
人人素材网

入群指南


扫描下面二维码关注微信公众号获取资源

微信公众号二维码

发表评论

您的电子邮箱地址不会被公开。 必填项已用*标注

Go