Data Literacy Training - case of CA Election 70

100 %
0 %
Information about Data Literacy Training - case of CA Election 70

Published on January 25, 2014

Author: anjesh



This hands-on training was given to the journalists in Dec 2013 under OpenNepal banner.

Hands-on Training Data – what and how? A case of CA Election 70 YoungInnovations OpenNepal

Data → Story ● Find data ● Wrangle/Cleanup the data ● Merge data with others (if any) ● Filter and sort the data ● Analyze data ● Visualize data (story)

CA Election 2070 ● What is data? – The candidates (age, gender, party) – The constituencies (vdc, ward, party) – The results (with votes, winner) – …..

Where to find it? ● ● The following FPTP results data in XML

Not lucky every time finding data ● Scrapping (requires programming knowledge) – Using google scraper ● PDF conversion ● PDF manual transcribe

Chrome Scraper Extension ● Search for “Chrome extension Scraper” from Chrome browser to install

Scraper in Action

PDF to Text ● Online tools available ● Linux has different set of utilities ● PDF is still a big nuisance (though something is better than nothing)

PDF to Text es/ecn_report/constwisecandidatecount.pdf

PDF to Text ● Linux utility - pdftotext

CSV ● ● ● CSV - Comma Separated Value Opens in MS Excel, Open Office, Google Spreadsheet Easy to work with

CA XML Data to CSV

XML to CSV? ● Online services are available ● Might need help from technologist ● In linux (there might be several ways, e.g) xml2 < FPTP-CA70.xml | 2csv FPTP DISTNAME CONST CANDIDATE AGE SEX PARTYNAME SYMBOLNAME TOTALVOTE STATUS > FPTP-CA70.csv

OpenNepal ● Repository of datasets – ● ● ● data in csv, xml or json format Request for dataset Request for help in conversion from one format to another, scrapping data, ... OpenNepal Community (GoogleGroup) is very vibrant

CA Results CSV data ● Converted from XML

Processing/Cleaning CSV – Basics ● Add header ● Sorting (by different fields) ● Filter ● Simple formulas

Add headers ● Insert row at the top ● Add header for each column

Sorting ● Sorting by Age – Ascending, Descending ● Find out youngest winning candidate age

Filtering ● Filter the list of winning female candidates

Some exercise ● ● ● ● ● Are there people who didn't receive a single vote? What is the highest and lowest number of votes of candidate who didn't win? Find the percentage of female and male candidates, percentage of winning female candidates? Try the above exercise in one district of your interest? Think of other things you can do with this basic skills

More questions ● ● ● How many parties have candidates in all 240 constituencies? How many male and female candidates are there in Nepali Congress? Ratio of male-female in far-west districts? Which party has the highest number of female candidates?

Data Processing - Pivottable

PivotTable - more ● Breakdown of independent candidates

Lets again see numbers ● Sorted by total number of candidates

Visualization ● Bar graph of male-female candidates of top few districts

What else visualizations are possible? ●

What else visualizations are possible? ●

Geocoding ● Geo-coding – – ● the conversion of a human-readable location name into a numeric (or other machine-processable) location such as a longitude and latitude Kathmandu => [geocoding] => {latitude: 27.70169, longitude: 85.3206} Online tools available for geocoding – Google fusion table – cartodb

Lat-long in ● Put the lat long (27.70169 85.3206) in google map search box

Services available for geocoding rch?format=xml&q=Kathmandu,Nepal

Problems with this CSV ● Unicode in districts name ● Can't geocode (currently only english)

Adding english district name

Google Fusion Table ● (need @gmail account)

Imported data


Using filter in the map

Use of heatmap based on votes

Thank you

Add a comment

Related presentations

Presentación que realice en el Evento Nacional de Gobierno Abierto, realizado los ...

In this presentation we will describe our experience developing with a highly dyna...

Presentation to the LITA Forum 7th November 2014 Albuquerque, NM

Un recorrido por los cambios que nos generará el wearabletech en el futuro

Um paralelo entre as novidades & mercado em Wearable Computing e Tecnologias Assis...

Microsoft finally joins the smartwatch and fitness tracker game by introducing the...

Related pages

Data Literacy - HLWIKI Canada

Data literacy (also data ... Humphrey C. Collaborative training in statistical and data library ... Johnson CA. The information diet: a case for ...
Read more | Data Transparency for the State of California

Welcome to Data.CA .Gov Whether you are ... 2012 by CA Data ... crime debt economic education elections emissions employment energy environment ...
Read more

Electoral Commission | Home

... the independent elections watchdog and regulator of ... Data and analysis; Electoral ... Electoral Commission designates ‘Vote Leave Ltd’ and ...
Read more

Computer Training | Computer Certifications | Microsoft ...

Get online computer training and certification for all Microsoft technology, ... Read a case study; Benefits of ... Microsoft Digital Literacy curriculum ...
Read more

Cancer Statistics - National Cancer Institute

Cancer statistics describe what happens in large groups of ... and where the data ... an estimated 1,685,210 new cases of cancer will be diagnosed in ...
Read more

Prison Literacy - Literacy Research - The Education Coalition

Report on Correctional Education for a Satellite Based Corrections ... In most cases the department of ... Literacy Training in Penal ...
Read more

Data | The Guardian

Latest Data news, comment and analysis ... election 2016 US world opinion sports soccer ... The Guardian back to top. home; election 2016 US world opinion
Read more

Casualty Actuarial Society | Casualty Actuarial Society

Casualty Actuarial Society. About Us; Membership; ... A Roundup of CAS International News, ... Edgar Pal Discusses Data Analytics;
Read more

Volunteer Position Descriptions

Volunteer Position Descriptions; ... $20.00 for attending Inspector training, $130.00 for serving on Election Day, ... Maps and Other Election Data.
Read more