Big Data Warehousing Meetup: Cloudera Navigator

31 %
69 %
Information about Big Data Warehousing Meetup: Cloudera Navigator

Published on February 18, 2014

Author: CasertaConcepts



In our recent Big Data Warehousing Meetup, we discussed Data Governance, Compliance and Security in Hadoop.

As the Big Data paradigm becomes more commonplace, we must apply enterprise-grade governance capabilities for critical data that is highly regulated and adhere to stringent compliance requirements. Caserta and Cloudera shared techniques and tools that enables data governance, compliance and security on Big Data.

For more information, visit

Cloudera Navigator Patrick Angeles 1

Why You Need Cloudera Navigator 1 2 Many Users Working with the Data 3 2 Lots of Data Landing in Cloudera Enterprise Need to Effectively Control & Consume Data  Huge quantities  Many different sources – structured & unstructured  Varying levels of sensitivity  Administrators & compliance officers  Analysts & data scientists  Business users  Get visibility & control over the environment  Discover and explore data

Cloudera Navigator Data Management Layer for Cloudera Enterprise Audit & Access Control Ensuring appropriate permissions & reporting on data access for compliance CLOUDERA NAVIGATOR Audit & Access Control Discovery & Exploration Finding out what data is available and what it looks like Discovery & Exploration Lineage Lifecycle Mgmt. Enterprise Metadata Repository  Business metadata  Lineage metadata  Operational metadata Lineage Tracing data back to its original source CDH Lifecycle Management Migration of data based on policies 3 HDFS HBASE HIVE

Cloudera Navigator 1.0 Data Audit & Access Control Verify Permissions View which users and groups have access to files and directories IAM / LDAP SYSTEM Audit Configuration Configuration of audit tracking for HDFS, HBase and Hive Audit Dashboard Simple, queryable interface to view data access Information Export Export audit information for integration with SIEM tools 4 CLOUDERA NAVIGATOR 1.0 ACCESS SERVICE AUDIT LOG SERVICE VIEW PERMISSIONS HDFS AUDIT LOG CONFIG AUDIT LOG COLLECTION HBASE 3rd PARTY SIEM / GRC SYSTEM HIVE

Benefits of Cloudera Navigator 1.0 Control Visibility  Verify access permissions to files & directories  Report on data access by user and type Integration 5  Store sensitive data  Maintain full audit history  The first & only centralized audit tool for Hadoop  View permissions for LDAP/IAM users  Export audit data for integration with 3rd party SIEM tools

Navigator Subscription Data Management Layer for Hadoop Centralized audit management & access control 8x5 or 24x7 support CLOUDERA SUPPORT CLOUDERA NAVIGATOR CLOUDERA MANAGER CORE PROJECTS CLOUDERA MGR CLOUDERA NVGTR DATA AUDIT BASIC FEATURES IMPALA SEARCH ACCESS MGMT ADVANCED FEATURES CDH Optional add-on to Cloudera Enterprise subscription HBASE BACKUP & DR HBASE CORE PROJECTS IMPALA SEARCH Cloudera Enterprise 6 Navigator Subscription

Navigator 2.0 – Q1 2014 • Manage and explore your data with Cloudera Navigator 2.0 (Q1 2014) • • • Data Discovery (what data do we have?), Annotations/Tags Search, explore, define, and tag data sets. Important for: • • • • DBAs/Data Modelers Self-Service Business Analysts Data Scientists Data Lineage (where did the data come from? where is it used?) For files and tables, MR jobs, Hive queries, Impala queries, Pig scripts, Sqoop load/export. • Important for: • Risk and compliance audits. BI users facing 10K tables in HDFS. Which ones are relevant to the source data I need, or the table I’m looking at? • Data retention policies, where you need to purge not just the source data, but any data that’s been derived from it. • • 7

Navigator 2.0 - Lineage • • • • 8 Audit data access Verify access privileges Search meta data Visualize lineage



Add a comment

Related presentations

Presentación que realice en el Evento Nacional de Gobierno Abierto, realizado los ...

In this presentation we will describe our experience developing with a highly dyna...

Presentation to the LITA Forum 7th November 2014 Albuquerque, NM

Un recorrido por los cambios que nos generará el wearabletech en el futuro

Um paralelo entre as novidades & mercado em Wearable Computing e Tecnologias Assis...

Microsoft finally joins the smartwatch and fitness tracker game by introducing the...

Related pages

Boston Cloudera User Group -

Apache Kudu is a fast new columnar data store for the Hadoop ... Find a Meetup Group Start a ... Boston Cloudera User Group. Home; Members; Sponsors Photos;
Read more

Midwest Cloudera User Group (Chicago, IL) - Meetup

Start a new Meetup Group ... around the Cloudera Big Data platform and components including Cloudera Impala, Cloudera Manager, Cloudera Navigator, ...
Read more

Hadoop SQL Archives - Cloudera Engineering Blog

Cloudera Engineering Blog. Best ... for BI and SQL analytics on modern big data ... graduation_meetup. Cloudera hosted the Apache Sqoop ...
Read more

BDM Meeting | Big Data Montreal

... // ... Data Lineage – Overview of Cloudera Navigator, ... Full presentation on Big Data Warehousing at Shopify by ...
Read more

Big Data Montreal

... // ... Big ... Data Lineage – Overview of Cloudera Navigator, ...
Read more

Where to Find Cloudera Tech Talks (Through March 2014 ...

Cloudera Engineering Blog. Best practices ... update about where to find tech talks by Cloudera ... by to assist your meetup by providing ...
Read more


Sitemap. Sitemap. Sitemap × |. | ...
Read more