advertisement

Real Time Analytics with Cassandra

50 %
50 %
advertisement
Information about Real Time Analytics with Cassandra
Technology

Published on March 15, 2014

Author: vagmi

Source: slideshare.net

Description

A recipe of Acunu style analytics with Cassandra
advertisement

Real Time Analytics with Vagmi Mudumbai @vagmi / @reducedata

What is Cassandra?

Dynamo Based on

Facebook Built by

Key Value Store is both a

Column Store and a

The CAP Theorem

Column Families

HashMap<RowKey,SortedMap<ColumnName, Value>>

id name email country 1 Vagmi me@vagmim.in IN 2 Karthik yeskarthik@blah IN 3 MarkZ mark@fb US Rowkey 1 2 3 name Vagmi Karthik MarkZ email me@vagmim.in yeskarthik@blah mark@fb country IN IN US

The Problem

As a user, I want to view real time metrics and filter by dimensions like time, city, category, etc.

select sum(measure) from events where time between A and B and country=’US’ and device_platform=’Android’ The wrong way

HashMap<RowKey,SortedMap<ColumnName, Value>>

Counters

create column family view_counts_hourly with comparator=UTF8Type and default_validation_class=CounterColumnType and key_validation_class=UTF8Type;

http://reducedata.com/, Chrome, 2014-03-14 15:30:00Z, IP, Cookie-Info

RowKey 20140101 20140102 20140103 20140104 ... ... 20140628 ... 20150308 sid1#us 2553 2341 2342 3242 ... ... 32342 ... 33423 sid1#us#chrome 1556 1532 1892 ... ... ... ... ... ... sid1#us#chrome#25 833 899 1200

Uniques? but what about

Bitmaps to the rescue

1 0 1 0 1 1 0 0 0 1 1 0 1 0 0 1 u1 u2 u3 u4 u5 u6 u7 u8 u9 u10 u11 u12 u13 ... ... ...

UID- 1328abc2838fd283e282 Fast Hash Function - Murmur32 1 0 1 0 1 1 0 0 0 1 1 0 1 0 0 1 u1 u2 u3 u4 u5 u6 u7 u8 u9 u10 u11 u12 u13 ... ... ...

RowKey 20140101 20140102 20140103 20140104 ... ... 20140628 ... 20150308 sid1#us 10101 10111 11100 11101 ... ... ... ... 11101 sid1#us#chrome ... ... ... ... ... ... ... ... ... sid1#us#chrome#25 10101 11101 11100 …. ... ... ... ... ...

But I do not have Big Data

Oh and we’re hiring (vagmi@reducedata.co)

Thanks @vagmi on Github / Twitter / Facebook

Add a comment

Related presentations

Presentación que realice en el Evento Nacional de Gobierno Abierto, realizado los ...

In this presentation we will describe our experience developing with a highly dyna...

Presentation to the LITA Forum 7th November 2014 Albuquerque, NM

Un recorrido por los cambios que nos generará el wearabletech en el futuro

Um paralelo entre as novidades & mercado em Wearable Computing e Tecnologias Assis...

Microsoft finally joins the smartwatch and fitness tracker game by introducing the...

Related pages

Using Cassandra for Real-time Analytics: Part 1

This blog posting created by Ayushman Jain of Marked Up Analytics; to view more postings by Marked Up visit their blog here. In a previous blog post titled ...
Read more

Using Cassandra for Real-time Analytics: Part 1

In a previous blog post titled “Cassandra, Hive, and Hadoop: How We Picked Our Analytics Stack” we talked about our process for selecting Cassandra as ...
Read more

DataStax Enterprise Analytics | Real Time & Batch ...

DataStax Enterprise Analytics provides streaming data, real-time & batch analytics on cassandra with external hadoop integration to analyze customer data
Read more

Using Cassandra for Real-time Analytics: Part 2

This post describes the Cassandra data model and shows MarkedUp's implementation. It also shows the read and write strategy used for MarkedUp analytics and ...
Read more

Real-Time Analytics with Apache Cassandra at LoopLogic ...

If you've ever wondered how to utilize Apache Cassandra for real-time analytics, then this is a meetup you won't want to miss! We're excited to ...
Read more

olap cube - Real time analytics with cassandra - Stack ...

So this is a super late reply, but I was looking for resources on Cassandra for analytics. When you build an OLAP table, you aggregate over a particular ...
Read more

Real-time analytics with Cassandra - Berlin Buzzwords 2011 ...

Real-time analytics with Cassandra Sylvain Lebresne sylvain@datastax.com Berlin Buzzword - 6th June, 2011 1/16
Read more

Real time analytics with Cassandra - Tom Wilkie - YouTube

In this talk we will discuss different approaches to realtime analytics, before focusing on how to build realtime analytics application using ...
Read more