Data Science 101: Cassandra Tutorial for Beginners

Print Friendly, PDF & Email

Provided by our friends over at Edureka, Module 1 of their Apache Cassandra course below discusses the fundamental concepts of using a highly-scalable, column-oriented database to implement appropriate use cases.

Who might be interested in learning Cassandra?

  • A developer working with large-scale, high-volume websites.
  • An application architect or data architect who needs to understand the available options for high-performance, decentralized, elastic data stores.
  • A database administrator or database developer currently working with standard relational database systems who needs to understand how to implement a fault-tolerant, eventually consistent data store.
  • A manager who wants to understand the advantages (and disadvantages) of Cassandra and related columnar databases to help make decisions about technology strategy.
  • A student, analyst, or researcher who is designing a project related to Cassandra or other non-relational data store options.

Pre-requisites

The course materials assume no prior knowledge of Apache Cassandra or any other NoSQL database. Though some familiarity with Linux command line is essential, minimal exposure to Java,database or data-warehouse concepts is required.

Why Learn Apache Cassandra?

Apache Cassandra™, an Apache Software Foundation project, is an open-source NoSQL distributed database management system. Apache Cassandra was originally developed at Facebook, and is used by many companies today. While many developers have embraced simpler NoSQL variants (like MongoDB and CouchDB), Cassandra is possibly at the forefront of the NoSQL innovation, providing a level of reliability and fine tuning not found in many of the competitors’ offerings. When it comes to scaling, nothing scales like it, the biggest example being the Facebook which uses Cassandra for storing petabytes of Data.

Why should you Learn Cassandra?

  • Cassandra is designed to handle Cassandra workloads across multiple data centers with no single point of failure, providing enterprises with extremely high database performance and availability.
  • World’s largest Website (i.e Cassandra) is running over Cassandra.
  • Daily 100s of start-ups and large product companies are choosing Cassandra for their next generation computing and data platforms. Some companies using Cassandra are Facebook, Twitter, IBM, Cisco, Rackspace, NetFlix, eBay, Reddit, @WalmartLabs, Zoho, Digg and so on.
  • Apache Cassandra is open-source. It means you can deep dive into its source code and change it according to your own requirements.
  • The job market for Apache Cassandra is at peek and is growing at rate of 300%!

 

 

Earn your master’s in predictive analytics completely online from Northwestern University.

 

Speak Your Mind

*