Data Science at the Command Line

Data Science at the Command Line
Author :
Publisher : "O'Reilly Media, Inc."
Total Pages : 207
Release :
ISBN-10 : 9781491947807
ISBN-13 : 1491947802
Rating : 4/5 (802 Downloads)

Book Synopsis Data Science at the Command Line by : Jeroen Janssens

Download or read book Data Science at the Command Line written by Jeroen Janssens and published by "O'Reilly Media, Inc.". This book was released on 2014-09-25 with total page 207 pages. Available in PDF, EPUB and Kindle. Book excerpt: This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data. To get you started—whether you’re on Windows, OS X, or Linux—author Jeroen Janssens introduces the Data Science Toolbox, an easy-to-install virtual environment packed with over 80 command-line tools. Discover why the command line is an agile, scalable, and extensible technology. Even if you’re already comfortable processing data with, say, Python or R, you’ll greatly improve your data science workflow by also leveraging the power of the command line. Obtain data from websites, APIs, databases, and spreadsheets Perform scrub operations on plain text, CSV, HTML/XML, and JSON Explore data, compute descriptive statistics, and create visualizations Manage your data science workflow using Drake Create reusable tools from one-liners and existing Python or R code Parallelize and distribute data-intensive pipelines using GNU Parallel Model data with dimensionality reduction, clustering, regression, and classification algorithms


Data Science at the Command Line Related Books

Data Science at the Command Line
Language: en
Pages: 207
Authors: Jeroen Janssens
Categories: Computers
Type: BOOK - Published: 2014-09-25 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how
Data Science at the Command Line
Language: en
Pages: 270
Authors: Jeroen Janssens
Categories: Computers
Type: BOOK - Published: 2021-08-17 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll le
Cleaning Data for Effective Data Science
Language: en
Pages: 499
Authors: David Mertz
Categories: Mathematics
Type: BOOK - Published: 2021-03-31 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

Think about your data intelligently and ask the right questions Key FeaturesMaster data cleaning techniques necessary to perform real-world data science and mac
Hands-On Data Science with the Command Line
Language: en
Pages: 121
Authors: Jason Morris
Categories: Computers
Type: BOOK - Published: 2019-01-31 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

Big data processing and analytics at speed and scale using command line tools. Key FeaturesPerform string processing, numerical computations, and more using CLI
Python Data Science Handbook
Language: en
Pages: 609
Authors: Jake VanderPlas
Categories: Computers
Type: BOOK - Published: 2016-11-21 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources e