Cassandra is one the popular and robust distributed database management system. It is known for providing high availability with no single point of failure. It has one of the awesome feature that is asynchronous replication between multiple nodes without requiring master nodes.

While working on Cassandra cluster, I was introduced with terms called Data Center where multiple nodes are member of it and atleast one member should be act as see.I was glad to see how asynchronously data was replicating across all nodes. So, we will discuss on clustering of cassandra in our upcoming post and also touch the area of backup and restore.

As a first step , in this post we will learn how to install Cassandra on CentOS 7 / RHEL 7 .

Requirements :

1. Cassandra version 2.x . (Here, x can be any value as per current stable release version)
2. Stable version of Java 7 or 8
3. Installed CentOS 7 / RHEL 7 on system.

Install Java

First install the Java on CentOS 7 / RHEL 7. Download the latest stable JDK package from oracle website.
Download the rpm file as per the architecture of your operating system.

We have download currently available stable jdk-8u45-linux-x64.rpm package. May be when you read this article there will be new stable release available, hence download the JDK package.

Install the jdk rpm package by using below given command.

rpm -ivh jdk-8u45-linux-x64.rpm

In case you have multiple Java installed in your system, you can set java environment of your selective java version. We have already written installation of Java and setting environment in our post. You can read from this link.

Check Java version installed on your CentOS 7 / RHEL 7 system

To check java version , use command below command. As per requirement of Cassandra 2.x installation it should be above java 7.x version.

java -version

Two methods to install Cassandra on CentOS 7 / RHEL 7

Use only one method as per you convenience. Do not opt for both method, follow only one method. Only because of tutorial purpose we are mentioning two alternate methods.

Method 1: By using source code – For this simply untar the source ball and start using it.

Method 2: By using yum command – By using yum command we will use datastax provided packages.

Method 1: By using source code

Download the latest stable release code from Apache foundation’s Cassandra site.

We are downloading current stable release version. Always use current stable release version.
Install wget utility in case not available in your system.

yum install wget 

wget -c http://apache.mirrors.hoobly.com/cassandra/2.0.14/apache-cassandra-2.0.14-bin.tar.gz

TIP: To make continue downloading we are using -c option with wget command.

Extract out the downloaded package. We are directly extracting to /opt

tar -xvzf apache-cassandra-2.0.14-bin.tar.gz -C /opt

Change to /opt and do ls command, you will see the extracted out package name

[root@localhost ~]# cd /opt/
[root@localhost opt]# ls
apache-cassandra-2.0.14
[root@localhost opt]#

Now you are ready to use the Cassandra. As per our practical we have extracted Cassandra package to /opt . Hence, absolute path is /opt/apache-cassandra-2.0.14/, it may be different in your case as per where you extracted and version of Cassandra source ball.

Installed Directory : /opt/apache-cassandra-2.0.14
Configuration Directory : /opt/apache-cassandra-2.0.14/conf/
Bin Directory for commands : /opt/apache-cassandra-2.0.14/bin/
Data Directory (Default): /var/lib/cassandra . (You can change it from conf/cassandra.yaml file)
Log Directory: /var/log/cassandra/

Start Cassandra (From Source Code)

# Here, we are defining variable CASSANDRA_PATH to store value of absolute path for Cassandra
CASSANDRA_PATH=/opt/apache-cassandra-2.0.14

# now run below given command, it help to run the Cassandra in background. For foreground process use -f option. The -p option is for creating pid file
$CASSANDRA_PATH/bin/cassandra -p $CASSANDRA_PATH/cassandra.pid

Once you run the Cassandra, it will by default create cassandra data directory and its path is /var/lib/cassandra/ . It has all data of keyspaces.

Stop Cassandra (From Source Code)

Because we have started Cassandra with -p option, which will create cassandra.pid file. It will helps us to quickly kill the cassandra running process.

CASSANDRA_PATH=/opt/apache-cassandra-2.0.14
kill -9  $(cat $CASSANDRA_PATH/cassandra.pid)

If you are advance user, you can use ps -ef|grep cassandra , to get the pid no. of running cassandra process. And kill the process with pid no.

Login into cqlsh

To login into Cassandra console or cqlsh, use the following command. In other words it is cassandra terminal, where you can create keyspaces, tables or in other words you can do database work.

CASSANDRA_PATH=/opt/apache-cassandra-2.0.14
$CASSANDRA_PATH/bin/cqlsh

See the below given reference, from our system. I also run the DESC KEYSPACES query.

[root@localhost ~]# $CASSANDRA_PATH/bin/cqlsh
Connected to Test Cluster at localhost:9160.
[cqlsh 4.1.1 | Cassandra 2.0.14 | CQL spec 3.1.1 | Thrift protocol 19.39.0]
Use HELP for help.
cqlsh> 
cqlsh> desc keyspaces;

system  system_traces

cqlsh> exit

Install Cassandra on CentOS 7 / RHEL 7 using yum command

You can also install the Cassandra with the help of yum command. Datastax provides the community Cassandra package.
(Note : Datastax known for providing enterprise level of Apache Cassandra sell and support. The datastax is not endorsed from Apache foundation)

Now we are going to install Cassandra 2.x version with yum command. Installation of Java is prerequisite, hence it should be installed.

Create yum repo file for Cassandra

Create a yum repo file which will get the packages from Datastax yum repository server.

vi /etc/yum.repos.d/cassandra.repo

Paste below given content in repo file.

[datastax] 
name = DataStax Repo for Apache Cassandra
baseurl = http://rpm.datastax.com/community
enabled = 1
gpgcheck = 0

Save and exit

Installing Cassandra with yum command

Now run yum command to install Cassandra 2.x on CentOS 7 / RHEL 7.

yum install dsc20 cassandra2

Once the all package are installed. You are ready to use Cassandra.
Given below are brief details of Cassandra directories.
Configuration Directory : /etc/cassandra/
Data Directory : /var/lib/cassandra
Log Directory : /var/log/cassandra/

Start/stop/status/enable cassandra with systemctl command

To start cassandra service:

systemctl start cassandra

To restart cassandra service:

systemctl restart cassandra

To stop cassandra service:

systemctl stop cassandra

To enable cassandra service to start at booting time:

systemctl enable cassandra

To disable cassandra service to do not start at booting time:

systemctl disable cassandra

To check running status of cassandra:

systemctl status cassandra

Connecting to cqlsh

To connect to cassandra terminal, use the command

cqlsh

See below given reference from our system.

[root@localhost ~]# cqlsh 
Connected to Test Cluster at localhost:9160.
[cqlsh 4.1.1 | Cassandra 2.0.14 | CQL spec 3.1.1 | Thrift protocol 19.39.0]
Use HELP for help.
cqlsh> 

Now your Cassandra is installed and ready to use.

Read Some More Articles

3 Comments

  1. Hi.

    Good article – helped me install Cassandra on CentOS 7.2. I tried running the docker container to no avail.

    I also need to point out that the yum command to install is now:
    yum install dsc20 cassandra20

    Please also prefix “Install Cassandra on CentOS 7 / RHEL 7 using yum command” with “Method 2:” for consistency sake.

    Awesome article nonetheless.

    1. Hello Anthony,

      Thank you for sharing your feedback,your observations and suggestion will surely help others.
      I am working in docker too but not written any post in blog.

      Regards
      Sharad

  2. Your tutorial is great, but I was searching for a tutorial on how to install Cassandra on a Debian server. Luckily, I found this blog post: ‘example.com’ and completed the installation.
    Thanks again.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.