Rook: The Storage Orchestrator
August 3, 2020 | Words by James Villareal
What is Rook?
Rook is a cloud-native, open-source storage orchestration platform for Kubernetes environments. It provides a way to automate the tasks of a storage administrator (deployment, provisioning, scaling migration, disaster recovery, monitoring and rescue management). You can use it for file, block, and object storage. It’s an incubating project of Cloud-Native Foundation.
Rook works with several storage providers
Ceph and EdgeFS are the most stable within the list of storage providers.
- CEPH (Stable): Ceph is a highly scalable distributed storage solution for block storage, object storage, and shared filesystems with years of production deployments.
- EdgeFS (Stable): EdgeFS is high-performance and fault-tolerant decentralized data fabric with access to object, file, NoSQL and block.
- CockroachDB (Alpha): CockroachDB is a cloud-native SQL database for building global, scalable cloud services that can survive disasters.
- Cassandra (Alpha): Cassandra is a highly available NoSQL database featuring lightning fast performance, tunable consistency, and massive scalability.
- NFS (Alpha): NFS allows remote hosts to mount filesystems over a network and interact with those filesystems as though they are mounted locally.
- Yugabyte DB (Alpha): YugaByteDB is a high-performance, cloud-native distributed SQL database which can recover from disk, node, zone, and region failures automatically.
One of the main benefits of Rook is that it interacts with data storage through Kubernetes via configuration files. You will no longer need to manually configure storage providers using direct commands.
For example, if you would like to create CephFS in a cluster, all you would have to do is create a yaml file.
- You can automate provisioning of storage in Kubernetes
- It eliminates running manual configuration commands for creating storage
- You can run software distributed systems (SDS) (such as Ceph) on top of Kubernetes
- Rook can work with several storage providers. Some are in alpha stage but Ceph and EdgeFS are stable
- Health checks for MONs (ceph-mon is the cluster monitor daemon for the Ceph distributed file system) with automatic failover
- Simple management of Ceph clusters, pools, and filesystem through Kubernetes objects
- Offers storage selections in one central place
Why is it needed?
- Simple and reliable automated resource management
- Hyper-scale or hyper-converge your storage clusters
- Efficiently distribute and replicate data to minimize loss
- Provision, file, block, and object with multiple storage providers
- Manage open-source storage technologies
- Easily enable elastic storage in your datacenter
- It is open-source software released under the Apache 2.0 license
- Optimize workloads on commodity hardware
Prerequisites for CephFS
You will need at least one unformatted disk.
- Raw device (no partitions or formatted file systems)
RAW Partition (no formatted filesystems)
Rook is made up of several pre-defined yaml files for each storage provider. The best way to run Rook is by cloning the git repository.
Getting the repository:
$ git clone https://github.com/rook/rook.git
Once you have cloned the repository, you can
cd into the Ceph directory to run all the pods needed for Rook.
$ cd rook/cluster/examples/kubernetes/ceph
First we need common.yaml , operator.yaml , cluster.yaml, and the toolbox.yaml.
This will basically create everything that is required by the Rook operator and the Ceph-Cluster. It provides:
$ kubectl create -f common.yaml
This deploys the operator. It provides:
kubectl create -f operator.yaml
This will take a few minutes. It provides:
kubectl create -f cluster.yaml
This will deploy a Pod which can we use to check the integrity of the cluster for example the Ceph health status. It provides:
- Health Monitoring
kubectl create -f toolbox.yaml ceph status
A storage class is also needed.
You can use the predefined storageclass.yaml file located at
kubectl create –f storageclass.yaml
Example PVC Claim for Block Storage:
apiVersion: v1 kind: PersistantVolumeClaim metadata: name: rook-ceph-block-pvc spec: storageClassName: rook-ceph-block accessModes: - ReadWriteOnce resources: requests: storage: 1Gi
A shared filesystem can be mounted with read/write permission from multiple pods. This may be useful for applications which can be clustered using a shared filesystem. To run a shared filesystem, we will need a storageclass.yaml and filesystem.yaml.
Create the filesystem by specifying the desired settings for the metadata pool, data pools, and metadata server in the CephFilesystem CRD. In this example we create the metadata pool with replication of three and a single data pool with replication of three. For more options, see the documentation on creating shared filesystems. This config file provides:
kubectl create -f rook/cluster/examples/kubernetes/ceph/csi/cephfs/filesystem.yaml
Before Rook can start provisioning storage, a StorageClass needs to be created based on the filesystem. This is needed for Kubernetes to interoperate with the CSI driver to create persistent volumes. This config file provides:
kubectl create -f csi/cephfs/storageclass.yaml
Verify a successful deployment
You can verify the status of CEPH by running a command on the rook-ceph-tool container.
kubectl exec -n rook-ceph rook-ceph-tools-<randomID> ceph status
Example PVC claim for CephFS
apiVersion: v1 kind: PersistentVolumeClaim metadata: name: cephfs-pvc spec: accessModes: - ReadWriteMany resources: requests: storage: 1Gi storageClassName: rook-cephfs
I mentioned the steps to get Rook running on a Kubernetes Cluster. You can customize to fit your needs and scale to your environment.
Should I run my database in a container?
The consensus says it’s usually not recommended to run a DB within a K8 cluster container. There are many factors to think about when deciding to run a DB within a container.
Scaling the database/data Store is one of the primary challenges we face when the workload is dynamic. Most data stores have Read Scalability, but what about Write Scalability? Most RDBMSs are yet to have the Write Scalability option as of today.
Ease of Installation and configuration
How many times do you really need to install database/data store on a production setup in a month? The answers are usually zero or at max a single digit. It’s pretty easy to get a container with a base image up and running and the post-install can be accomplished with an automation tool (ie. Ansible).
Rollup upgrade and patching
How many times do you usually update a db within a year? The answer is zero or maybe a few time a year.
There is a theory that running database/data store in container improves performance. Database performance mostly comes from I/O and Memory. If you are assigning an external volume to a container as the data store, you depend on persistence storage APIs like Ceph. Database vendors recommend local SSD for better performance.
It’s possible to save associated database licensing and VM costs when you use containers in database. Many RDBMS/Data store vendors still do not have a clear policy on licensing for usage of databases in Containers. For most databases, cost is based on underlying cpu cores. If this is the case, I don’t see much savings in going the container route.
This is where the container route is above the rest. You can just ship your container images with complete environment configuration to another environment very easily.
There are also some key challenges while running databases in containers such as persistent storage, maintenance of your DevOps environment along with your database maintenance, container security, skills (you need DevOps engineers along with DBAs or the DBAs need to re-skill).
So it appears running a DB within a container does have it’s advantages. You do get some benefits from the automation Kubernetes provides to keep the database application running. That said, it is important to remember that pods are transient, so the likelihood of database application restarts or failovers is higher. Also, some of the more database-specific administrative tasks—backups, scaling, tuning, etc.—are different due to the added abstractions that come with containerization.
Here is an example criteria for setting up a database environment:
- save data safety for 10 years or even more (important data)
- handle 100TB of data or more
- high availability
- scalability, add more storage without downtime and easy on adding new storage
- write operation are more frequent than read operations
DB storage comparison: Ceph or Cassandra
Here is a comparison of the two with pros and cons.
- + Very fast at writing operation which using memtable for improving writing operations
- + Easy setup on new nodes (without change any configurations)
- + Just like other databases, not much effort to connect to cluster from application
- + Monitor via JConsole
- + High availability
- - Hard to setup in the first place(That’s debatable, depends on who you’re asking)
- - Each node has different configurations. Admin has to register new nodes to cluster manually (ceph-deploy or automation tools)
- + Connect to storage via RGW API or natively with librados
- + Multi osds(object storage daemon)/host
- + Very good monitoring metrics about cluster
- + High availability
Both can save data for 10 years or more and can handle more than 100TB or more. Both can be configure for high availability. Ceph is built for storage and Cassandra is built to be a key/value database. Ceph is a storage and is highly scalable. OSD’s can be added on the fly and without much work. Cassandra is build specifically for Databases. After researching Cassandra, it appears to be faster in the Database realm.
In this article I used a Rook Operator to deploy all the processes needed to run a Ceph cluster within Kubernetes. I also gave a comparison on running a Database on Ceph and Cassandra. You can use the storage solutions provided by Ceph (object, block, filesystem) to persist the data of your workload in a safe way.
All that Rook requires to run is a Kubernetes cluster where the storage is to be configured. Rook’s goal is to make your life easier by simplifying storage configuration. Kubernetes has enabled applications to be cloud native, but application storage has lacked cloud native features from the start. Rook, in conjunction with Ceph and other storage systems, bridges that divide. What Kubernetes has done for streamlining deployment of containerized applications, Rook is doing for storage, bringing dynamic data storage orchestration features to modern data centers. Rook reduces the effort required of a data center’s storage team. The main benefits are health checks for MONs with automatic failover, simple management of Ceph clusters, pools, filesystem and RGW through Kubernetes objects as well as offering storage selection in one central place. Rook and Ceph can help teams build cloud native solutions for storage. Now you can use the storage solutions provided by Ceph (object, block, filesystem) to persist the data of your workload in a safe way.