Exploratory Data Analysis

60 %
40 %
Information about Exploratory Data Analysis

Published on February 15, 2014

Author: thinrhino



Talk given by me at Gnunify 2014 on Exploratory Data Analysis

Exploratory Data Analysis Aditya Laghate Twitter: @thinrhino 1

Who am I? • A pseudo geek • Freelance software consultant • Wildlife photographer Twitter: @thinrhino 2

Agenda • • • • Data gathering Data cleaning Usage of classic unix tools Data analysis Twitter: @thinrhino 3

Data Gathering • Public data websites o o • Social websites o o • Blogs / websites /etc via scrapping Twitter: @thinrhino 4

Data cleaning • Eg: openrefine o OpenRefine (ex-Google Refine) is a powerful tool for working with messy data, cleaning it, transforming it from one format into another, extending it with web services, and linking it to databases like Freebase o Twitter: @thinrhino 5

Classic Unix Tools • sed /awk • Shell scripts • GNU parallel o Examples: o cat rands20M.txt | awk '{s+=$1} END {print s}’ o cat rands20M.txt | parallel --pipe awk '{s+=$1}END{print s}' | awk '{s+=$1} END {print s}’ o wc -l bigfile.txt o cat bigfile.txt | parallel {print s}' Twitter: @thinrhino --pipe wc -l | awk '{s+=$1} END 6

Data Analysis Twitter: @thinrhino 7

Questions @thinrhino Twitter: @thinrhino 8

Add a comment

Related presentations

Presentación que realice en el Evento Nacional de Gobierno Abierto, realizado los ...

In this presentation we will describe our experience developing with a highly dyna...

Presentation to the LITA Forum 7th November 2014 Albuquerque, NM

Un recorrido por los cambios que nos generará el wearabletech en el futuro

Um paralelo entre as novidades & mercado em Wearable Computing e Tecnologias Assis...

Microsoft finally joins the smartwatch and fitness tracker game by introducing the...

Related pages

Exploratory data analysis - Wikipedia, the free encyclopedia

In statistics, exploratory data analysis (EDA) is an approach to analyzing data sets to summarize their main characteristics, often with visual methods.
Read more

1. Exploratory Data Analysis

1. Exploratory Data Analysis: This chapter presents the assumptions, principles, and techniques necessary to gain insight into data via EDA--exploratory ...
Read more

Exploratory Data Analysis - Johns Hopkins University ...

Exploratory Data Analysis from Johns Hopkins University. This course covers the essential exploratory techniques for summarizing data. These techniques are ...
Read more

1.1.1. What is EDA? - NIST

Approach: Exploratory Data Analysis (EDA) is an approach/philosophy for data analysis that employs a variety of techniques (mostly graphical) to
Read more

Exploratory Data Analysis Using R | Udacity

Exploratory Data Analysis Using R covers ways to summarize and visualize important characteristics of a data set. Find more on data analysis from Udacity.
Read more

Chapter 4 Exploratory Data Analysis - CMU Statistics

Chapter 4 Exploratory Data Analysis A rst look at the data. As mentioned in Chapter 1, exploratory data analysis or EDA" is a critical rst step in ...
Read more


Interactive Exploratory Data Analysis on your desktop. Exploratory Desktop provides an interactive and reproducible real data wrangling and analysis ...
Read more Exploratory Data Analysis (9780201076165 ...

Buy Exploratory Data Analysis on FREE SHIPPING on qualified orders
Read more

Exploratory Data Analysis (Addison-Wesley Series in ...

John W. Tukey - Exploratory Data Analysis (Addison-Wesley Series in Behavioral Science) jetzt kaufen. ISBN: 9780201076165, Fremdsprachige Bücher ...
Read more

Explorative Datenanalyse – Wikipedia

Tatsächlich kann die Begriffsverwirrung zwischen den zwei Typen der Analyse und der Anwendung ... Robust and Exploratory Data Analysis ISBN 0 ...
Read more