Ozone

Slack Docker Pulls GitHub edit source

This guide describes how to configure Ozone as Alluxio’s under storage system.

Ozone is a scalable, redundant, and distributed object store for Hadoop. Apart from scaling to billions of objects of varying sizes, Ozone can function effectively in containerized environments such as Kubernetes and YARN.

Ozone supports two different schemas. The biggest difference between o3fs and ofs is that o3fs suports operations only at a single bucket, while ofs supports operations across all volumes and buckets and provides a full view of all the volume/buckets. For more information, please read its documentation:

Prerequisites

If you haven’t already, please see Prerequisites before you get started.

In preparation for using Ozone with Alluxio:


Follow the Ozone On Premise Installation to install a Ozone cluster.

Basic Setup

To use Ozone as the UFS of Alluxio root mount point, you need to configure Alluxio to use under storage systems by modifying conf/alluxio-site.properties. If it does not exist, create the configuration file from the template.

$ cp conf/alluxio-site.properties.template conf/alluxio-site.properties

Specify an Ozone bucket and directory as the underfs address by modifying conf/alluxio-site.properties.


Ozone HA Mode

o3fs

To make Alluxio mount Ozone in HA mode, you should configure Alluxio’s server so that it can find the OzoneManager. Please note that once set up, your application using the Alluxio client does not require any special configuration. In HA mode alluxio.dora.client.ufs.root needs to specify <OM_SERVICE_IDS> such as:

alluxio.dora.client.ufs.root=o3fs://<OZONE_BUCKET>.<OZONE_VOLUME>.<OM_SERVICE_IDS>/
alluxio.underfs.hdfs.configuration=/path/to/hdfs/conf/ozone-site.xml

ofs

alluxio.dora.client.ufs.root=ofs://<OZONE_MANAGER>/<OZONE_VOLUME>/<OZONE_BUCKET>/
alluxio.underfs.hdfs.configuration=/path/to/hdfs/conf/ozone-site.xml

<OM_SERVICE_IDS> can be found in ozone-site.xml. In the following example ozone-site.xml file, <OM_SERVICE_IDS> is omservice1:

<property>
    <name>ozone.om.service.ids</name>
    <value>omservice1</value>
</property>

Mount Ozone with Specific Versions

Users can mount an Ozone cluster of a specific version as an under storage into Alluxio namespace. Before mounting Ozone with a specific version, make sure you have built a client with that specific version of Ozone. You can check the existence of this client by going to the lib directory under the Alluxio directory.

When mounting the under storage at the Alluxio root with a specific Ozone version, one can add the following line to the site properties file (conf/alluxio-site.properties).

alluxio.underfs.version=<OZONE_VERSION>

Running Alluxio Locally with Ozone

Once you have configured Alluxio to Ozone, try running Alluxio locally to see that everything works.

Use the HDFS shell or Ozone shell to visit your Ozone directory o3fs://<OZONE_BUCKET>.<OZONE_VOLUME>/<OZONE_DIRECTORY> to verify the files and directories created by Alluxio exist. For this test, you should see files named like

<OZONE_BUCKET>.<OZONE_VOLUME>/<OZONE_DIRECTORY>/default_tests_files/BasicFile_CACHE_PROMOTE_MUST_CACHE

Advanced Setup

Supported Ozone Versions

Currently, the only tested Ozone version with Alluxio is 1.0.0, 1.1.0, 1.2.1.

Contributed by the Alluxio Community

Ozone UFS integration is contributed and maintained by the Alluxio community. The source code is located here. Feel free submit pull requests to improve the integration and update the documentation here if any information is missing or out of date.