最新消息:网盘下载利器JDownloader--|--发布资讯--|--解压出错.密码问题--|--百度盘链接打开页面有问题的请将Http改成Https

Data Just Right: Introduction to Large-Scale Data & Analytics by Michael Manoochehri-P2P

Making Big Data Work: Real-World Use Cases and Examples, Practical Code, Detailed Solutions

Large-scale data analysis is now vitally important to virtually every business. Mobile and social technologies are generating massive datasets; distributed cloud computing offers the resources to store and analyze them; and professionals have radically new technologies at their command, including NoSQL databases. Until now, however, most books on “Big Data” have been little more than business polemics or product catalogs. Data Just Right is different: It’s a completely practical and indispensable guide for every Big Data decision-maker, implementer, and strategist.

Michael Manoochehri, a former Google engineer and data hacker, writes for professionals who need practical solutions that can be implemented with limited resources and time. Drawing on his extensive experience, he helps you focus on building applications, rather than infrastructure, because that’s where you can derive the most value.

Manoochehri shows how to address each of today’s key Big Data use cases in a cost-effective way by combining technologies in hybrid solutions. You’ll find expert approaches to managing massive datasets, visualizing data, building data pipelines and dashboards, choosing tools for statistical analysis, and more. Throughout, the author demonstrates techniques using many of today’s leading data analysis tools, including Hadoop, Hive, Shark, R, Apache Pig, Mahout, and Google BigQuery.

Coverage includes:

  • Mastering the four guiding principles of Big Data success—and avoiding common pitfalls
  • Emphasizing collaboration and avoiding problems with siloed data
  • Hosting and sharing multi-terabyte datasets efficiently and economically
  • “Building for infinity” to support rapid growth
  • Developing a NoSQL Web app with Redis to collect crowd-sourced data
  • Running distributed queries over massive datasets with Hadoop, Hive, and Shark
  • Building a data dashboard with Google BigQuery
  • Exploring large datasets with advanced visualization
  • Implementing efficient pipelines for transforming immense amounts of data
  • Automating complex processing with Apache Pig and the Cascading Java library
  • Applying machine learning to classify, recommend, and predict incoming information
  • Using R to perform statistical analysis on massive datasets
  • Building highly efficient analytics workflows with Python and Pandas
  • Establishing sensible purchasing strategies: when to build, buy, or outsource
  • Previewing emerging trends and convergences in scalable data technologies and the evolving role of the Data Scientist

Data Just Right: Introduction to Large-Scale Data & Analytics by Michael Manoochehri-P2P
English | ISBN: 0321898656 | 2013 | 256 pages | PDF/EPUB | 6.7 MB/7.3 MB


Download uploaded
http://ul.to/i8onnhpf

Download nitroflare
http://nitroflare.com/view/65B685362989CF9/0321898656Data.pdf

Download 城通网盘
http://page88.ctfile.com/fs/pax154432404

Download 百度云
http://pan.baidu.com/s/1c2BNg9i

您必须 登录 才能发表评论!

网友最新评论 (1)

  1. 大规模数据分析现在已经对每个企业都是至关重要的。移动和社会性技术正在生成海量数据;分布式云计算提供了存储和分析的资源;专业人士拥有着包括NoSQL数据库这样的技术。然而,直至目前,大多数讲”大数据“的书籍还只是停留在企业论战或产品目录层面。本书则不同。这是一本对于每个大数据决策制定者、实施者以及战略家来说完全实用且必不可缺的指南。 Michael Manoochehri,前Google工程师和数据黑客,为那些在有限资源和时间条件下,需要实用的能够部署的解决方案的专业人士编写了本书。来源于其自身的经验,他会帮助你集中于开发应用程序,而非基础架构,因为那才是你能够发掘最多价值的地方。 Manoochehri会为你以综合了混合解决方案技术的方式,讲解如何处理每天关键的大数据使用案例。你将会学习到管理海量数据、可视化数据、开发数据管道和仪表盘、选择统计分析工具等专业方法。贯穿本书,作者会使用时下领先的数据分析工具,包括Hadoop、Hive、Shark、R、Apache Pig、Mahout以及Google BigQuery。 主要内容:掌握大数据成功的四个原则,避免常见错误;强调协作,避免孤立数据;有效经济地存储和分项海量数据;适用于未来增长;利用Redis开发NoSQL Web应用,收集众包数据;通过Hadoop、Hive和Shark对海量数据运行分布式查询;利用Google BigQuery开发数据仪表盘;通过高级的可视化谈了大数据;
    wilde(特殊组-翻译)8个月前 (07-30)