Update Video Tutorial: Wrangling Time Series Data with Basho Riak TS

Main Contents:

Video Tutorial: Wrangling Time Series Data with Basho Riak TS is an article under the topic Data Science Many of you are most interested in today !! Today, let’s InApps.net learn Video Tutorial: Wrangling Time Series Data with Basho Riak TS in today’s post !

Data Modeling

As Genova explained, Riak TS facilitates defining a schema with an SQL-like CREATE TABLE statement. Tables can include any number of columns of the usual data types such as integer, floating point, varchar, time stamp, and boolean.

“Every table must include a time-stamp column as well as a primary key definition, and the primary key uses time quantization to allow the user to specify the extent to which ordered ranges of data are partitioned together in a cluster,” said Genova.

DataModeling-query

“Primary keys are defined by a combination of the quantum function as well as a series ID,” said Genova. “Bucket type is created using the standard Riak admin command that folks that use KV would be familiar with, and the bucket type create statement includes the table definition as one of its properties.”

Writing Data

Demonstrating how to create and store a time series row with the Riak Java client, Genova said, “The columns in each row are required to be in the correct order as well as have the correct types based on the schema that was defined for the table. Time stamps are required to be in milliseconds, since the epoch.”

WritingData-query

Genova mentioned that in Riak 1.0, data validation will be server-side but client-side validation will be supported in future versions.

Reading Data

Riak TS facilitates reads through its support for an SQL-like query language, Genova explained. “Queries consist of standard SELECT statements that allow you to specify a time range, a series ID, as well as which subset of your columns that you’d like to return. You can use secondary fields optionally to filter the results set, and the standard set of logical operators apply to that, so operators like equals, not equals, greater than, less than, etc.”

Genova described two examples, the first example being a standard “SELECT from table” query, and the second example showed the selection of a single field and also filtering on a secondary field.

ReadingData-query

Reads are optimized primarily through the colocation and ordering of the primary data, according to Genova. “This allows the service queries from a tunable number of partitions, which allows the system to avoid the expensive coverage queries that would be necessary if you were to use secondary indexes to perform the same sort of query,” he said.

“We are also filtering data at the level of the storage back end. In the previous example we filtered on the temperature field,” he continued. “Rather than being at the coordinator after subqueries return their full result sets, the filtering of that data will happen at the level of the storage back end, which provides another level of efficiency and minimizes network overhead.”

Genova noted another optimization provided by the query language itself, which is a subset of SQL, offering flexibility and familiarity to the user.

To conclude, Genova stressed the maturity, the reliability and the ease of use of the underlying, highly-scalable architecture, and also referenced the multi-datacenter replication capabilities that are inherent in that architecture. He said that this quality will ultimately be inherited by the TS database as well.

“The way we’ve designed this was to optimize it specifically time series, rather than simply create a more generic, big table implementation,” he said, “and I think that provides another measure of ease of use.”

Basho is a sponsor of InApps Technology.

Source: InApps.net

Rate this post

Phu Nguyen

As a Senior Tech Enthusiast, I bring a decade of experience to the realm of tech writing, blending deep industry knowledge with a passion for storytelling. With expertise in software development to emerging tech trends like AI and IoT—my articles not only inform but also inspire. My journey in tech writing has been marked by a commitment to accuracy, clarity, and engaging storytelling, making me a trusted voice in the tech community.