An introduction to Apache Falcon

52 %
48 %
Information about An introduction to Apache Falcon

Published on December 26, 2013

Author: mikejf12



A short introduction to Apache Falcon, what is it and what is it used for ?
How can it help with Hadoop based data life cycle management ? What is it's
architecture and what are the benefits of using it ?

Apache Falcon ● What is it ? ● Benefits ● Architecture ● Example

Apache Falcon – What is it ? ● A data life cycle management framework ● Created for Hadoop ● Logic based in Falcon rather than apps ● Simplifies data management ● Developed by InMobi and HortonWorks ● Falcon can manage – Work flows – Replication – Provides data abstraction

Apache Falcon – What is it ? ● Falcon provides services – Data import / replication – Scheduling / coordination – Lifecycle policies – Cluster management – SLA Management ● An enterprise solution for data lifecycle management ● Currently an Apache incubator project

Apache Falcon – Benefits ● Reduce workflow / ETL development time ● Reduce costs ● No need to re implement functionality – – ● Already in Falcon Already tested Use a single Falcon configuration file to – Define replication points – Define data processing pipeline

Apache Falcon – Architecture

Apache Falcon – BI Example ● Falcon used to manage work flow ● Falcon used to manage Cluster data replication ● BI example – Staged and presented data replicated – Presented data visible for Reporting ● Analytics ● See next slide ..... ●

Apache Falcon – BI Example

Contact Us ● Feel free to contact us at – – ● We offer IT project consultancy ● We are happy to hear about your problems ● You can just pay for those hours that you need ● To solve your problems

Add a comment

Related presentations

Presentación que realice en el Evento Nacional de Gobierno Abierto, realizado los ...

In this presentation we will describe our experience developing with a highly dyna...

Presentation to the LITA Forum 7th November 2014 Albuquerque, NM

Un recorrido por los cambios que nos generará el wearabletech en el futuro

Um paralelo entre as novidades & mercado em Wearable Computing e Tecnologias Assis...

Microsoft finally joins the smartwatch and fitness tracker game by introducing the...

Related pages

An introduction to Apache Falcon - YouTube

A short introduction to Apache Falcon, what is it and what is it used for ? How can it help with Hadoop based data life cycle management ? What ...
Read more

Apache Falcon - Hortonworks

A framework for managing data life cycle in Hadoop clusters Apache™ Falcon ... Introduction Apache Falcon ... Apache, Hadoop, Falcon ...
Read more

Data Lab: Introduction to Apache Falcon

Apache Falcon simplifies complicated data management workflows into generalized entity definitions. Falcon makes it far easier to:
Read more

Introduction — Falcon 1.0.0 documentation

Introduction¶ Falcon is a minimalist, ... you agree to also license your source code under the terms of the Apache License, Version 2.0, as described above.
Read more

Mirroring Datasets between Hadoop clusters with Apache ...

Introduction. Apache Falcon is a framework to simplify data pipeline processing and management on Hadoop clusters. It provides data management services ...
Read more

Incremental Backup of Data from HDP to Azure using Falcon ...

Introduction. Apache Falcon simplifies the configuration of data motion with: replication; lifecycle management; lineage and traceability. This provides ...
Read more

Hortonworks Technical Preview for Apache Falcon

Hortonworks Inc. Page 4 Apache!Falcon!Introduction! Apache!Falconprovides!aframeworkfor!simplifyingthe!development!of!data management!applications!in ...
Read more