An introduction to Apache Accumulo

56 %
44 %
Information about An introduction to Apache Accumulo
Photos

Published on September 17, 2013

Author: mikejf12

Source: slideshare.net

Description

A short introduction to Apache Accumulo. What is it and
how does it relate to big table ? How does it use Hadoop,
Zookeeper and Thrift in its implementation ?

Apache Accumulo ● What is it ? ● Design ● Integrity ● Administration ● Squirrel www.semtech-solutions.co.nz info@semtech-solutions.co.nz

Accumulo – What is it ? ● A key / value store ● A column oriented database ● Based on Google's Big Table ● Based on – Apache Hadoop – Apache Zoo Keeper – Apache Thrift ● Written in Java ● Licensed by Apache www.semtech-solutions.co.nz info@semtech-solutions.co.nz

Accumulo – Design ● Has cell level security via column visibility ● Server side programming created via iterators ● Table based constraints written in Java ● Sharding can be used for parallel doc storage ● Large rows can be larger than memory size www.semtech-solutions.co.nz info@semtech-solutions.co.nz

Accumulo – Integrity ● Zookeeper used to manage master fail over ● Write ahead logs written to each server ● Logical time managed for – Consistant transactions – Bulk data import ● Fate transactions ( Fault Tolerant Transactions ) – Transactions complete even after master failure ● Isolation – Transactions see a consistant view of data at row level www.semtech-solutions.co.nz info@semtech-solutions.co.nz

Accumulo – Administration ● System monitoring and stats via web page ● System and table config stored in Zoo Keeper ● Table naming stored in Zoo Keeper via id's ● Follow threads of execution using tracing – Record time actions take place ● Accumulo can be used with Squirrel server – As next slide shows – Future presentation will cover Squirrel www.semtech-solutions.co.nz info@semtech-solutions.co.nz

Accumulo – with Squirrel www.semtech-solutions.co.nz info@semtech-solutions.co.nz

Accumulo – Data Management Internal Data Management ● Locality groups – Group columns within a single file ● Smart compaction – Smaller files merged with larger using definable ratio until all files merged ● Minor compaction – To avoid max files being reached in memory files merged with larger files ● Loading user created jars – Load Jars from HDFS using VFS www.semtech-solutions.co.nz info@semtech-solutions.co.nz

Accumulo – Data Management On Demand Data Management ● Compactions – Force tablets ( table partitions ) to compact to a single file ● Tablet merging – Request tablet merging via shell ● Table cloning – Clone a table from an existing one, reference data / config ● Table import / export – Copy table / meta data to another cluster www.semtech-solutions.co.nz info@semtech-solutions.co.nz

Accumulo – Screen Shot www.semtech-solutions.co.nz info@semtech-solutions.co.nz

Contact Us ● Feel free to contact us at – www.semtech-solutions.co.nz – info@semtech-solutions.co.nz ● We offer IT project consultancy ● We are happy to hear about your problems ● You can just pay for those hours that you need ● To solve your problems

Add a comment

Related presentations

Related pages

Apache Accumulo User Manual Version 1.7

Apache Accumulo User Manual ... Apache Accumulo 1.7. 1. Introduction; 2. Accumulo Design. 2.1 ... and can be found in the org.apache.accumulo.core ...
Read more

An introduction to Apache Accumulo - scribd.com

An introduction to Apache Accumulo - Free download as PDF File (.pdf), Text File (.txt) or view presentation slides online.
Read more

An introduction to Apache Accumulo - YouTube

A short introduction to Apache Accumulo. What is it and how does it relate to big table ? How does it use Hadoop, Zookeeper and Thrift in its ...
Read more

Apache Accumulo User Manual Version 1.6

Apache Accumulo User Manual Version 1.6 Apache ... Apache Accumulo is a highly scalable ... The introduction of multiple volume support in 1 ...
Read more

An introduction to Apache Accumulo - O'Reilly Media Free ...

An introduction to Apache Accumulo. How it Works, Why it Exists, and How it is Used. Date: This event took place live on August 05 2014. Presented by ...
Read more

How To Install Apache Accumulo Database on Ubuntu 14.04 ...

Introduction. Apache Accumulo is an open-source distributed NoSQL database based on Google's BigTable. It is used to efficiently perform CRUD (Create Read ...
Read more

An Introduction to Accumulo, an Apache NoSQL Solution ...

Join the DZone community and get the full member experience. The following 10-minute video will introduce you to Apache Accumulo, a distributed key/value ...
Read more

An introduction to Apache Accumulo - Video Dailymotion

A short introduction to Apache Accumulo. What is it and how does it relate to big table ? How does it use Hadoop, Zookeeper and Thrift in its ...
Read more