Platfora - An Analytics Sandbox In A World Of Big Data

50 %
50 %
Information about Platfora - An Analytics Sandbox In A World Of Big Data
Technology

Published on March 14, 2014

Author: markginnebaugh

Source: slideshare.net

Description

As Big Data becomes the norm in dealing with data volume, variety, and velocity, it becomes increasingly harder for the Data Analyst to understand and work with data sets. To overcome this we introduce Platfora, a Hadoop backed data analysis framework which nicely complements more traditional data warehousing and BI solutions. This presentation covers ingestion of new data and building of data sets and visualizations,in a system that requires no more work than interacting with a graphical interface. You'll see examples of peer-to-peer lending and how insights on loan applicants and their risk profiles can be quickly revealed with no ETL development or demanding data transformation.

©2014 DesignMind. All Rights Reserved. An Analytics Sandbox in a World of Big Data Roberto Arnetoli roberto@designmind.com Vice President,Big DataSolutions Andrew Eichenbaum andrew@designmind.com Principal DataScience Consultant Platfora

2 ©2014 DesignMind. All Rights Reserved. DesignMind’s Expertise and Offering Power BI Applications Databases Data Warehousing Big Data BI & Data Visualization Information Sharing & CollaborationCloud Computing Data Science

3 ©2014 DesignMind. All Rights Reserved. Our Clients

4 ©2014 DesignMind. All Rights Reserved. Agenda  Big Data and Self-Service Analytics  Platfora  Case Study: Peer-2-Peer Lending  Demo  Conclusion and Questions

5 ©2014 DesignMind. All Rights Reserved. Big Data and Self-Service Analytics

6 ©2014 DesignMind. All Rights Reserved. What is Big Data?  Largedata sets  excessive retrievaland processing time  structured and unstructured collections BIG DATA

7 ©2014 DesignMind. All Rights Reserved.  volume velocity variety Volum e Velocity Variety SQL BIG DATA SQL vs. Big Data

8 ©2014 DesignMind. All Rights Reserved. We tend to structure data  we tend to prepare, transform and structuredata  severaladvantages - - - -  severalnon-trivial disadvantages - - - Traditional DataWarehouse Big Data Platform

9 ©2014 DesignMind. All Rights Reserved. For today’s Data Scientistsit issimply not enough! mailfeeds additional databases multimedia logs social geo e-commerce unstructured text web Traditional DataWarehouse Big Data Platform

10 ©2014 DesignMind. All Rights Reserved. mailfeeds additional databases ia social web Traditional DataWarehouse Big Data Platform For today’s Data Scientistsit issimply not enough!  self-serviceanalyticsplatform  ‘analyticssandbox’  significantly reduce timeand costs

11 ©2014 DesignMind. All Rights Reserved. DesignMind chooses Platfora  Microsoft Gold Data PlatformPartnerand SilverBI Partner ClouderaPartner PlatforaPartner  data analyticswinning solution maximize thevalueof their data makefact-based decisions Big Data Platform Traditional Data Warehouse Self-Service Analytics

12 ©2014 DesignMind. All Rights Reserved. Platfora

13 ©2014 DesignMind. All Rights Reserved. Platfora is an All in One Data Sandbox Ingest Select Explore

14 ©2014 DesignMind. All Rights Reserved. Platfora Easily Ingests Data  Delimited Text XML JSON Raw Text Avro 

15 ©2014 DesignMind. All Rights Reserved. Platfora MeansHands Off ETL    lenses

16 ©2014 DesignMind. All Rights Reserved. Platfora MeansHands Off ETL  Platfora ETLprocessbacked by Hadoop - Automaticcluster creation on multiple platforms(Amazon,Cloudera, Hortonworks) - Cluster sizesfrom one node to many  Automaticallyhandlesthe handoff of multiple filesof any size to the cluster  Scheduling available for data reprocessing or updates

17 ©2014 DesignMind. All Rights Reserved. Platfora Allows for Easy Data Exploration 

18 ©2014 DesignMind. All Rights Reserved. Typical Big Data Warehousing Stack  complexlinear process Data warehouse accesstools have no easy way to accessthe data from earlier stages Only way to get new data in is to reprocess the data at the Ingestion and Transformation levels Ingest Select Explore Transformation I n g e t s i o n

19 ©2014 DesignMind. All Rights Reserved. Big Data Warehousing Tools Pig  Transformation  Each step can be complexand need a knowledgeablesupport staff  Ingestion  BI Tools  data warehousing

20 ©2014 DesignMind. All Rights Reserved. Platfora Sits Parallel to the Traditional Stack  Ingest Select Explore Data Catalog VizboardsLenses Transformation I n g e t s i o n

21 ©2014 DesignMind. All Rights Reserved. Case Study: Peer-2-Peer Lending

22 ©2014 DesignMind. All Rights Reserved. What is P2P Lending   

23 ©2014 DesignMind. All Rights Reserved.  - - -  - - -

24 ©2014 DesignMind. All Rights Reserved. Completed Loans: Months to Last Payment  Loans can complete in two ways: Charge Off (Default) and Fully Paid  Normal loan durations are 36 and 60 months.  Early payoff and Charge Offs follow the same curve after two months of payments.  Loan Charge Off rate is approximately 16% for loans completed in the first the first 18 months.

25 ©2014 DesignMind. All Rights Reserved. Loan Stats: Average Revolving to Maximum Credit  When loans are in funding, can we find predictors of default?  We look at loan applicants total revolving credit (e.g. credit cards) vs the average revolving credit balance

26 ©2014 DesignMind. All Rights Reserved. Loan Stats: Average Revolving to Maximum Credit

27 ©2014 DesignMind. All Rights Reserved. Demo

28 ©2014 DesignMind. All Rights Reserved. Demo Notes  - -  - -  

29 ©2014 DesignMind. All Rights Reserved. Conclusion

30 ©2014 DesignMind. All Rights Reserved.

31 ©2014 DesignMind. All Rights Reserved.  Concluding Remarks  Quick Introduction to Platfora and its abilities - It is a data analytics sandbox that is complimentary to current ETL/Warehouse implementations - Allows data practitioners free range to access and use new data easily  Platfora can do a lot more than shown  Platfora is extensible: - UDFs allow access to almost any Java routine - Data ingestion can be scheduled

32 ©2014 DesignMind. All Rights Reserved. Questions

33 ©2014 DesignMind. All Rights Reserved. www.designmind.com

Add a comment

Related presentations

Presentación que realice en el Evento Nacional de Gobierno Abierto, realizado los ...

In this presentation we will describe our experience developing with a highly dyna...

Presentation to the LITA Forum 7th November 2014 Albuquerque, NM

Un recorrido por los cambios que nos generará el wearabletech en el futuro

Um paralelo entre as novidades & mercado em Wearable Computing e Tecnologias Assis...

Microsoft finally joins the smartwatch and fitness tracker game by introducing the...

Related pages

Platfora - An Analytics Sandbox In A World Of Big Data ...

As Big Data becomes the norm in dealing with data volume, variety, and velocity, it becomes increasingly harder for the Data Analyst to understand and work wit
Read more

Platfora Big Data Analytics | DesignMind

Platfora Big Data Analytics. Platfora gives enterprises blazing fast factual insights across all of their data ... An Analytics Sandbox In A World Of Big Data.
Read more

Platfora: An Analytics Sandbox In A World Of Big Data ...

About. Meet the DesignMind Team; Partners; People; Resources; White Papers + Datasheets; Services. Big Data Consulting Solutions; Business Intelligence ...
Read more

Big Data Discovery | Big Data Analytics | Platfora

Platfora's Big Data Discovery and Analytics ... Answer “how” and “why” with Big Data Discovery that combines data prep, behavioral analytics, ...
Read more

Platfora Big Data Analytics Blog

Platfora big data analytics blog. Platfora puts the power of big data into the ... to simplify how business users derive value from a world of big data.
Read more

What is a Data Sandbox (in Big Data)? - Definition from ...

A data sandbox, in the context of big data, is a scalable and developmental platform used to explore an organization's rich information sets through ...
Read more

Platfora | LinkedIn

Welcome to the Big Data Analytics Party, Oracle. Platfora’s take on Oracle’s Big Data Discovery ... An Analytics Sandbox In A World Of Big Data. 1,641 ...
Read more

Customer Analytics and Risk Management in Financial ...

Big Data Analytics is transforming how banks and financial institutions unlock insights, make more meaningful decisions, ... Sandbox herunterladen ...
Read more