最新消息:网盘下载利器JDownloader--|--发布资讯--|--站务--|--解压出错.密码问题--

Oreilly – Data Science at the Command Line

教程/Tutorials killking 0评论

Oreilly - Data Science at the Command Line

English | .MP4 | h264, yuv420p, 1280x720, 29.97 fps(r) | aac, 44100 Hz, mono | 392 MBGenre: E-learning

We data scientists love to create exciting data visualizations and insightful statistical models. However, before we get to that point, usually much effort goes into obtaining, scrubbing, and exploring the required data. 
The command line, although invented decades ago, is an amazing environment for performing such data science tasks. By combining small, yet powerful, command-line tools you can quickly explore your data and hack together prototypes. New tools such as GNU Parallel, jq, and Drake allow you to use the command line for today's data challenges. Even if you're already comfortable processing data with, for example, R or Python, being able to also leverage the power of the command line can make you a more efficient data scientist.

We will make use of the Data Science Toolbox, which is a free, open-source virtual environment that allows everybody to get started with data science in minutes. The Data Science Toolbox runs not only on Linux, but also on Mac OS X and Microsoft Windows, so everybody can participate with this hands-on webcast. In about two hours we will cover the following subjects:

Essential concepts of the *nix command line;
Setting up the Data Science Toolbox;
Integrating the command line with IPython and R;
Filters such as cut, grep, sed, and awk;
Scraping websites using curl, scrape, xml2json, and jq;
Managing your data science workflow using Drake;
Parallelizing and distributing data-intensive pipelines using GNU Parallel;
Turning existing Python, R and Java code into reusable command-line tools; and
Creating data visualizations and statistical models.

Oreilly - Data Science at the Command Line

Download uploaded
http://uploaded.net/file/1t1mqts2/DataScienceAtTheCommandLine.part1.rar
http://uploaded.net/file/t67kq48l/DataScienceAtTheCommandLine.part2.rar
http://uploaded.net/file/9l34nmy4/DataScienceAtTheCommandLine.part3.rar
http://uploaded.net/file/cj24bxep/DataScienceAtTheCommandLine.part4.rar

Download rapidgator
http://rg.to/file/4f27ad57cc554a30152d325dbdb3a06c/DataScienceAtTheCommandLine.part1.rar.html
http://rg.to/file/16f3a9960f28e2d8aba3879ea67b3e99/DataScienceAtTheCommandLine.part2.rar.html
http://rg.to/file/8bb70a1c4414057639306cf6f882b89e/DataScienceAtTheCommandLine.part3.rar.html
http://rg.to/file/298923238312a42b15fb3f392a89fb55/DataScienceAtTheCommandLine.part4.rar.html

Download 百度云

以下隐藏内容只提供VIP赞助会员

sorry! The following hidden content sponsorship VIP members only.

您必须 登录 才能发表评论!

网友最新评论 (1)

  1. Oreilly – 命令行的数据科学 我们这些数据科学家喜欢用直观的统计模型创建令人激动的数据可视化。但是在实现这一目标之前,大量的精力都耗在了获取、清洗以及探索所需的数据。 命令行尽管是十多年前发明的东东了,但是其仍旧是一个令人喜欢的用于执行数据科学任务的环境。通过与小巧但是强大的命令行工具,你可以很快地探索数据并尝试原型。如GNU Parallel、jq和Drake这样的新工具会允许你使用命令行用于数据的挑战。即使你已经喜欢了使用R或Python处理数据,你也能够利用命令行从而令你成为更有效率的数据科学家。 我们还将使用免费开源的Data Science Toolbox,其可视化环境能够令每个人在几分钟内就开始处理数据。Data Science Toolbox不仅允许在Linux上,也可以允许在Mac OS X和Windows。这样每个人都可以参与到本教程的学习中来。 主要内容:基本的*nix命令行;建立Data Science Toolbox;与IPython和R集成;如cut, grep, sed, and awk的过滤器;使用curl, scrape, xml2json, and jq抓取网站;使用Drake管理数据科学工作流程;使用GNU Parallel并行和分散丰富的数据管道;将已有Python、R和Java代码转化为可复用的命令行工具;创建数据可视化和统计模型;
    wilde(特殊组-翻译)2年前 (2015-02-09)