Advanced Storage Options

Enterprise Feature

The features or components discussed in this document are available in TDengine Enterprise only. TDengine OSS does not include these features or components.

This section introduces the multi-tier storage feature unique to TDengine Enterprise, which stores recent, frequently accessed data on high-speed media and old, infrequently accessed data on low-cost media, achieving the following objectives:

Reduce storage costs -- By tiering data, storing massive amounts of extremely cold data on cheap storage media brings significant economic benefits
Improve write performance -- Thanks to the support for multiple mount points at each storage level, and WAL pre-write logs also supporting parallel writing on multiple mount points at level 0, greatly enhancing write performance (tested to support continuous writing of over 300 million data points per second), achieving extremely high disk IO throughput on mechanical hard drives (tested up to 2GB/s)
Easy maintenance -- After configuring the storage mount points for each level, system data migration and other tasks do not require manual intervention; storage expansion is more flexible and convenient
Transparent to SQL -- Regardless of whether the queried data spans multiple levels, a single SQL query can return all data, simple and efficient

The storage media involved in multi-tier storage are all local storage devices. In addition to local storage devices, TDengine Enterprise also supports using object storage (S3) to save the coldest batch of data on the cheapest media to further reduce storage costs, and still allows querying when necessary, and where the data is stored is also transparent to SQL. Support for object storage was first released in version 3.3.0.0, and it is recommended to use the latest version.

Multi-Tier Storage

Configuration Method

Multi-tier storage supports 3 levels, with up to 128 mount points per level.

Tips Typical configuration schemes include: Level 0 configured with multiple mount points, each corresponding to a single SAS hard drive; Level 1 configured with multiple mount points, each corresponding to a single or multiple SATA hard drives; Level 2 can be configured with S3 storage or other inexpensive network storage.

The configuration method for TDengine multi-tier storage is as follows (in the configuration file /etc/taos/taos.cfg):

dataDir [path] <level> <primary>

path: The folder path of the mount point.
level: The storage medium level, values are 0, 1, 2. Level 0 stores the newest data, Level 1 stores the next newest data, Level 2 stores the oldest data, omitted defaults to 0. Data flow between storage levels: Level 0 -> Level 1 -> Level 2. Multiple hard drives can be mounted at the same storage level, and data files at that level are distributed across all hard drives at that level. It should be noted that the movement of data across different levels of storage media is automatically done by the system, and users do not need to intervene.
primary: Whether it is the primary mount point, 0 (no) or 1 (yes), omitted defaults to 1. In the configuration, only one primary mount point is allowed (level=0, primary=1), for example, using the following configuration method:

dataDir /mnt/data1 0 1
dataDir /mnt/data2 0 0
dataDir /mnt/data3 1 0
dataDir /mnt/data4 1 0
dataDir /mnt/data5 2 0
dataDir /mnt/data6 2 0

Note 1. Multi-tier storage does not allow cross-level configuration, legal configuration schemes are: only Level 0, only Level 0 + Level 1, and Level 0 + Level 1 + Level 2. It is not allowed to only configure level=0 and level=2 without configuring level=1. 2. It is forbidden to manually remove a mount disk in use, and currently, mount disks do not support non-local network disks.

Load Balancing

In multi-tier storage, there is only one primary mount point, which bears the most important metadata storage in the system, and the main directories of various vnodes are also located on the current dnode's primary mount point, thus limiting the write performance of that dnode to the IO throughput of a single disk.

Starting from TDengine 3.1.0.0, if a dnode is configured with multiple level 0 mount points, we distribute the main directories of all vnodes on that dnode evenly across all level 0 mount points, allowing these level 0 mount points to share the write load.

When network I/O and other processing resources are not bottlenecks, by optimizing cluster configuration, test results prove that the entire system's writing capability and the number of level 0 mount points have a linear relationship, that is, as the number of level 0 mount points increases, the entire system's writing capability also increases exponentially.

Same-Level Mount Point Selection Strategy

Generally, when TDengine needs to select a mount point from the same level to create a new data file, it uses a round-robin strategy for selection. However, in reality, each disk may have different capacities, or the same capacity but different amounts of data written, leading to an imbalance in available space on each disk. In practice, this may result in selecting a disk with very little remaining space.

To address this issue, starting from 3.1.1.0, a new configuration minDiskFreeSize was introduced. When the available space on a disk is less than or equal to this threshold, that disk will no longer be selected for generating new data files. The unit of this configuration item is bytes. If its value is set as 2GB, i.e., mount points with less than 2GB of available space will be skipped.

Starting from version 3.3.2.0, a new configuration disable_create_new_file has been introduced to control the prohibition of generating new files on a certain mount point. The default value is false, which means new files can be generated on each mount point by default.

Object Storage

This section describes how to use S3 object storage in TDengine Enterprise. This feature is based on the generic S3 SDK and has been adapted for compatibility with various S3 platforms, allowing access to object storage services such as MinIO, Tencent Cloud COS, Amazon S3, etc. By configuring the appropriate parameters, most of the colder time-series data can be stored in S3 services.

Note When used in conjunction with multi-tier storage, data saved on each storage medium may be backed up to remote object storage and local data files deleted according to rules.

Configuration Method

In the configuration file /etc/taos/taos.cfg, add parameters for S3 access:

Parameter Name	Description
s3EndPoint	The COS service domain name in the user's region, supports http and https, the bucket's region must match the endpoint's, otherwise access is not possible.
s3AccessKey	Colon-separated user SecretId:SecretKey. For example: AKIDsQmwsfKxTo2A6nGVXZN0UlofKn6JRRSJ:lIdoy99ygEacU7iHfogaN2Xq0yumSm1E
s3BucketName	Bucket name, the hyphen is followed by the AppId registered for the COS service. AppId is unique to COS, AWS and Alibaba Cloud do not have it, it needs to be part of the bucket name, separated by a hyphen. Parameter values are all string types, but do not need quotes. For example: test0711-1309024725
s3UploadDelaySec	How long a data file remains unchanged before being uploaded to S3, in seconds. Minimum: 1; Maximum: 2592000 (30 days), default value 60 seconds
s3PageCacheSize	Number of s3 page cache pages, in pages. Minimum: 4; Maximum: 102410241024, default value 4096
s3MigrateIntervalSec	The trigger cycle for automatic upload of local data files to S3, in seconds. Minimum: 600; Maximum: 100000. Default value 3600
s3MigrateEnabled	Whether to automatically perform S3 migration, default value is 0, which means auto S3 migration is off, can be set to 1.

Check Configuration Parameter Availability

After configuring s3 in taos.cfg, the availability of the configured S3 service can be checked using the taosd command with the checks3 parameter:

taosd --checks3

If the configured S3 service is inaccessible, this command will output the corresponding error information during execution.

Create a DB Using S3

After configuration, you can start the TDengine cluster and create a database using S3, for example:

create database demo_db duration 1d s3_keeplocal 3d;

After writing time-series data into the database demo_db, time-series data older than 3 days will automatically be segmented and stored in S3 storage.

By default, mnode issues S3 data migration check commands every hour. If there is time-series data that needs to be uploaded, it will automatically be segmented and stored in S3 storage. You can also manually trigger this operation using SQL commands, initiated by the user, with the following syntax:

s3migrate database <db_name>;

Detailed DB parameters are shown in the table below:

#	Parameter	Default	Min	Max	Description
1	s3_keeplocal	365	1	365000	The number of days data is kept locally, i.e., how long data files are retained on local disks before they can be uploaded to S3. Default unit: days, supports m (minutes), h (hours), and d (days)
2	s3_chunkpages	131072	131072	1048576	The size threshold for uploading objects, same as the tsdb_pagesize parameter, unmodifiable, in TSDB pages
3	s3_compact	1	0	1	Whether to automatically perform compact operation when TSDB files are first uploaded to S3.

Estimation of Read and Write Operations for Object Storage

The cost of using object storage services is related to the amount of data stored and the number of requests. Below, we discuss the processes of data upload and download separately.

Data Upload

When the TSDB time-series data exceeds the time specified by the s3_keeplocal parameter, the related data files will be split into multiple file blocks, each with a default size of 512 MB (s3_chunkpages * tsdb_pagesize). Except for the last file block, which is retained on the local file system, the rest of the file blocks are uploaded to the object storage service.

Upload Count = Data File Size / (s3_chunkpages * tsdb_pagesize) - 1

When creating a database, you can adjust the size of each file block through the s3_chunkpages parameter, thereby controlling the number of uploads for each data file.

Other types of files such as head, stt, sma, etc., are retained on the local file system to speed up pre-computed related queries.

Data Download

During query operations, if data in object storage needs to be accessed, TSDB does not download the entire data file. Instead, it calculates the position of the required data within the file and only downloads the relevant data into the TSDB page cache, then returns the data to the query execution engine. Subsequent queries first check the page cache to see if the data has already been cached. If the data is cached, it is used directly from the cache, thus effectively reducing the number of times data is downloaded from object storage.

Adjacent multiple data pages are downloaded as a single data block from object storage to reduce the number of downloads. The size of each data page is specified by the tsdb_pagesize parameter when creating the database, with a default of 4 KB.

Download Count = Number of Data Blocks Needed for Query - Number of Cached Data Blocks

The page cache is a memory cache, and data needs to be re-downloaded after a node restart. The cache uses an LRU (Least Recently Used) strategy, and when there is not enough cache space, the least recently used data will be evicted. The size of the cache can be adjusted through the s3PageCacheSize parameter; generally, the larger the cache, the fewer the downloads.

Azure Blob Storage

This section describes how to use Microsoft Azure Blob object storage in TDengine Enterprise. This feature is an extension of the 'Object Storage' feature discussed in the previous section and depends additionally on the Flexify service's S3 gateway. With proper parameter configuration, most of the colder time-series data can be stored in the Azure Blob service.

Flexify Service

Flexify is an application in the Azure Marketplace that allows S3-compatible applications to store data in Azure Blob Storage through the standard S3 API. Multiple Flexify services can be used to establish multiple S3 gateways for the same Blob storage.

For deployment methods, please refer to the Flexify application page.

Configuration Method

In the configuration file /etc/taos/taos.cfg, add parameters for S3 access:

s3EndPoint   http //20.191.157.23,http://20.191.157.24,http://20.191.157.25
s3AccessKey  FLIOMMNL0:uhRNdeZMLD4wo,ABCIOMMN:uhRNdeZMD4wog,DEFOMMNL049ba:uhRNdeZMLD4wogXd
s3BucketName td-test

Multiple items can be configured for s3EndPoint and s3AccessKey, but the number of items must match. Use ',' to separate multiple configuration items. Only one item can be configured for s3BucketName
Each set of {s3EndPoint, s3AccessKey} is considered to correspond to one S3 service, and one service will be randomly selected each time an S3 request is initiated
All S3 services are considered to point to the same data source, and operations on various S3 services are completely equivalent
If an operation fails on one S3 service, it will switch to another service, and if all services fail, the last generated error code will be returned
The maximum number of S3 services that can be configured is 10

Without Relying on Flexify Service

The user interface is the same as S3, but the configuration of the following three parameters is different:

#	Parameter	Example Value	Description
1	s3EndPoint	`https://fd2d01c73.blob.core.windows.net`	Blob URL
2	s3AccessKey	fd2d01c73:veUy/iRBeWaI2YAerl+AStw6PPqg==	Colon-separated user accountId:accountKey
3	s3BucketName	test-container	Container name

The fd2d01c73 is the account ID; Microsoft Blob storage service only supports the Https protocol, not Http.

Multi-Tier Storage​

Configuration Method​

Load Balancing​

Same-Level Mount Point Selection Strategy​

Object Storage​

Configuration Method​

Check Configuration Parameter Availability​

Create a DB Using S3​

Estimation of Read and Write Operations for Object Storage​

Data Upload​

Data Download​

Azure Blob Storage​

Flexify Service​

Configuration Method​

Without Relying on Flexify Service​

Support and Feedback

Multi-Tier Storage

Configuration Method

Load Balancing

Same-Level Mount Point Selection Strategy

Object Storage

Configuration Method

Check Configuration Parameter Availability

Create a DB Using S3

Estimation of Read and Write Operations for Object Storage

Data Upload

Data Download

Azure Blob Storage

Flexify Service

Configuration Method

Without Relying on Flexify Service