Published on March 13, 2014
Don‘t Leave Your Data in the Dark – Optimize and Simplify Database Performance with DataStax
Today‘s Speaker – Robin Schumacher • VP Products • Database geek/DBA for longer than will admit • Author of many database articles and 3 database books • Has Ph.D., but only makes wife call him ―Dr.‖ • Prior stints directing product lines for MySQL, PostgreSQL, and DB tools for Oracle, SQL Server, Sybase, and DB2 • Husband of one, father of two, forced- against-his-will owner of one horse
Agenda 1. Today‘s Online World 2. How Relational Databases Fail 3. Why Cassandra for Online Applications? 4. Why DataStax Enterprise? 5. Increasing Performance with DataStax Enterprise 4.0 6. Key Takeaways 7. Questions
Always On Data is the new currency Dynamic experience1 2 3 Today‘s Online World
100% Uptime Slow is down Disaster avoidance 1. Always On
75% of executives say that technology failures are rising 11-24 days average time to recover from failure 45% of executives say failures are causing damage to brand or share Application Outages Hurt Business
2. Data is Your Most Valuable Asset
Customer-driven innovation Real-time personalization Agility 3. Dynamic Experience
Always on? Data Scale/Performance? Dynamic Experience? Slower time to market Lack of personalization Modern Architecture? Multi-DC/geo/cloud? No single point failure? High costs Success outgrows database How Relational Databases Fail Loss of revenue Loss of customers Linear performance? Elastically scalable? Simple to manage? Flexible schema? Easy to change online? Support all data types?
What is Apache Cassandra? Cassandra is an open source, NoSQL, distributed database built for modern, mission-critical online applications C A S S A N D R A
Cassandra – Unmatched Uptime, Performance, Flexibility One Application. More than a trillion transactions per day. No problem. (That‘s 10 million transactions every second!) C A S S A N D R A C A S S A N D R A C A S S A N D R A
Messaging Collections/ Product Catalogs Fraud detection Recommendation/ Personalization Internet of things/ Sensor data Common Cassandra Use Cases
Always on Data Scale/Performance Dynamic Experience Fast time to market Easy personalization Modern Architecture? Multi-DC/geo/cloud? No single point failure? Low costs Database > Success Why Cassandra for Online Apps Increase revenue Gain customers Linear performance? Elastically scalable? Simple to manage? Flexible schema? Easy to change online? Support all data types? DataStax Enables Organizations to Build Powerful Online Applications 100% Uptime Faster Time To Market Turn Data into $$
Outbrain Helps People Discover the Most Interesting, Relevant Content Out There • Outbrain delivers 90 billion recommendations on over 10 billion page views per month • Reach of 86% of the online U.S. population • Cassandra is Outbrain‘s primary data store • Cassandra chosen over HBase due to ease in spanning multiple data centers During Hurricane Sandy, we lost an entire data center. Completely. Lost it. Our application fail-over resulted in us losing just a few moments of serving requests for a particular region of the country, but our data in Cassandra never went offline. Always On?
World’s largest online marketplace, eBay uses DSE for fraud detection, messaging, and displaying social data on product pages • DataStax‘s scale out architecture enables eBay to deploy multiple DSE clusters across several different data centers using commodity hardware. 250TBs+ of storage—in DataStax Enterprise clusters; 1 40TB table. • Using DataStax, eBay cost-effectively processes massive amounts of data at very high velocities – 9 billion writes/5 billion reads per day “We have to be ready for disaster recovery all the time. It’s really great that Cassandra allows for active-active multiple data centers where we can read and write data anywhere.” -Jay Patel, Technical Architect at eBay Data Scale/ Performance
Cassandra Delivers Superior Scalability to the Largest Cloud Platform in the World - Netflix • Migrated from Oracle to Cassandra • 95% of data stored on Cassandra including viewing history of its 36 million customers • Super scalability, no single point of failure, 100% uptime Dynamic Experience
DataStax Delivers Cassandra to the Enterprise Expert Support, Consulting, Software Updates, Health Checks Developer IDE and Drivers In Memory Analytics Search Security Management Services Visual Management and Monitoring Tools Certified, Enterprise-Ready Cassandra
RELATIONAL DATABASES CQL SQL Visual management and development tools Various management tools DSE for search and analytics Analytic functions and full-text search Security Security Support, consulting & training 30-year-old ecosystem Legacy RDBMS to Modern NoSQL is Easy Automatic management services Built-in task management DATASTAX
Simplified Management A new, 10-node Cassandra (or Hadoop) cluster with OpsCenter running in 3 minutes…A new, 10-node DSE cluster with OpsCenter running on AWS in 3 minutes… Done1 2 3
New in-memory option built on a production-certified version of Apache Cassandra (2.0) • Enhanced enterprise search • Improved visual management and monitoring What‘s New in DataStax Enterprise 4.0
• Dial-the-performance flexibility based on data • Simple to use; transparent to developers • Ideal for use cases requiring fast writes and low latency (Web, financial, telecom) • Internal testing shows 10-100x improvements Same database cluster In-memory SSD‘s Spinning Disk Fastest response time for low- latency requests Very fast response times for ‗hot‘ data Good performance for large data volumes Increasing Performance with In-Memory
In-Memory for Increased Database Performance
• Certified Cassandra shortens development time • New developer-enabling features • Lightweight transactions • CQL improvements • Enhanced enterprise search delivers high throughput • Certified Solr 4.6 with more developer features • Faster internode communication has low latency even with thousands of concurrent requests Certified Cassandra 2.0 and Enterprise Search
• Improved visual interface to manage and control production workloads • Capacity planning analysis, with enhanced trend analysis and forecasting • Supports multiple data centers • Monitoring of in-memory tables • Enhanced drill-downs for faster troubleshooting on individual nodes Simplified Management with OpsCenter 4.1
“DSE has made the transition to an AWS-based infrastructure much easier than a traditional RDBMS would have.” ―DataStax Enterprise has become one of our highest available systems with the lowest cost of ownership.‖ Build Powerful Online Applications ―We have to be ready for disaster recovery all the time. It‘s really great that Cassandra allows for active-active multiple data centers where we can read and write data anywhere.‖ ―With Cassandra, we get better business agility, and we don‘t have to plan capacity in advance, we don‘t need to ask permission of other people to build things for us, and we don‘t worry about running out of space or power.‖
Founded in April 2010 OUR INVESTORS 400+ customers 25% of the Fortune 100 200+ employees 38 countries worldwide Powering more than 1 trillion transactions a day DATASTAX BY THE NUMBERS About DataStax
The Fastest Way from Data to Online Business Collect, store and manage data on premise or in the Cloud Increase developer productivity with Developer IDE, drivers and tools Visually monitor and automatically manage your environment Search and analyze data to optimize customer experience 100% data availability in a secure environment DataStax scales as your business grows
OS Cassandra DSE Standard DSE Pro DSE Max DATABASE MANAGEMENT / SERVICES Advanced Security Option ✔ ✔ ✔ In-Memory Option ✔ ✔ ✔ Automatic Management Services ✔ ✔ ✔ Enterprise Search ✔ ✔ Batch Analytics ✔ MANAGEMENT OpsCenter Basic Advanced Advanced Advanced SUPPORT AND PROFESSIONAL SERVICES Expert 24x7x365 Support ✔ ✔ ✔ Platform certification ✔ ✔ ✔ Certified service packs ✔ ✔ ✔ Hot fixes ✔ ✔ ✔ Bug escalation ✔ ✔ ✔ Health checks/Performance reviews ✔ ✔ ✔ Custom builds Option DataStax Subscriptions
Key Takeaways 1. New rules for a new online world • Today‘s online applications must be always available, data-driven and dynamic enough to adapt to changing user requirements 2. Relational databases are not built for today’s online applications • Outdated architecture, high costs and lack of agility drive online application failure and negative customer experiences 3. DataStax is the best database choice for today’s online world • Modern architecture for modern applications • Always on, maximum scale/performance, and able to serve dynamic apps • New in-memory speed, easier development, faster search, and simplified visual management
Next Steps 1. Try DataStax Enterprise/OpsCenter/DevCenter for free 2. Visit our knowledge base on Datastax.com for online documentation, tutorials, free online training, tech papers, and more 3. Join our community at PlanetCassandra.org
Questions? Thank You for Attending!
Webinar: Don't leave your data in the dark - Optimize and simplify database performance ... environment to get the most of your online data.
1. Don‘t Leave Your Data in the Dark – Optimize and Simplify Database Performance with DataStax . 2. Today‘s Speaker – Robin Schumacher • VP ...
Cassandra Community Webinar ... Don't like this video? ... Webinar: Don't leave your data in the dark ...
Webinar: Don't Leave Your Data in the Dark. Sonic Summit Stuck In The Past Upload. ABCs of Security in the Cloud Webinar. Building IoT Apps in the Cloud ...
This webinar is not going to reveal any hidden secrets or ... The number one reason your customers leave. ... Don't worry, we won't share your details with ...
Title: Don’t Leave Your Data in the Dark: Optimize and Simplify Database Performance with DataStax; Date: March 11, 2014; Time: 9am PT / 12pm ET / 16:00 GMT
... as well as the security of your data. In this comprehensive webinar, ... ones you don’t ... and firewalls can leave your enterprise open to data ...
Go now Archived Webinars Webinar | Macy’s: Why Your Database Decision Directly. DataStax. DOWNLOAD DATASTAX. ... Webinar | Don’t Leave Your Data in the ...