Home
>
Data Science
>
Update Optimizing Data Queries for Time Series Applications

March 29, 2022 by Phu Nguyen

Update Optimizing Data Queries for Time Series Applications

Main Contents:

Optimizing Data Queries for Time Series Applications is an article under the topic Data Science Many of you are most interested in today !! Today, let’s InApps.net learn Optimizing Data Queries for Time Series Applications in today’s post !

Indexing

Katy Farmer

Katy lives in Oakland, CA with her husband and two dogs (at least one of whom talks to her about fun, technical stuff). She loves to experiment with code, break stuff, and try to fix it. She learned to code at Turing School of Software and Design in Denver, CO, and it gave her the perfect chance to break stuff before she knew how to fix it.

Indexing, the oft-recommended and rarely understood solution to all attempts at optimization, is applicable to most databases. Whether the time series database you’re using is built on Cassandra or MySQL or its own unique architecture, indexing affects your queries. Essentially, an index is a data structure that stores the values from a specific column, meaning that when we search by an indexed field, we have a handy shortcut to the values. When we search by unindexed fields, we have to discover the full path to the value, no shortcuts or magic tricks. Searching unindexed fields is like having to watch Frodo walk through Middle Earth unedited — it takes a long time.

While indexing is not unique to time series databases, we have to remember that the index is a data structure that becomes oversized if we have too many indexed columns or fields. An index structure that is too large ends up eating memory and slowing down processes, negating its advantages. The time series problem here is that there’s no convention around which pieces should be indexed, so we need to be aware of our schema at all times.

Query Scope

When a query gets me down, I usually hop into the command line. I’m happy there. When I was first discovering time series databases, I did just that. I skipped into my InfluxDB command line tool and typed:

SELECT * FROM ‘cpu’

And my life flashed before my eyes. Memories of small batches of user data brought tears to my eyes. My terminal turned into the kind of screen shown by a “hacker” in a crime TV show.

One of the distinctive qualities of time series data is that it is more valuable in higher volume—we store millions of points. Running a query using * (all) can potentially lock up your database while it retrieves points.

There are a few options to limit your query while also improving it.

Use a time range. Many time series application queries aggregate data from a window, so use that to your advantage.
Add a sub-query. This will limit the scope of your query by adding parameters, and ensure you only get relevant results.

The key to scoping your queries is to filter them — be as specific as possible to avoid data overload in your application, your terminal, and your mind.

Retention Policies

In the world of time series data, data points age like the bagged salad in my crisper drawer: I might keep it longer than I should, but eventually I’m going to need to throw it away. The high number of points makes it difficult to store time series data indefinitely, and even if disk space allows for an immense amount of data, the queries then have to run through a huge dataset.

Let’s say you’re ignoring some of my previous advice, and you need to run a query without a time window or a sub-query. You can control the amount of data just by setting up processes to delete expired data. This is another piece whose logistics depend on which database you’re using, but it’s a common time series problem, so solutions abound on the internet for your database of choice. Delete expired data and save yourself some… time.

Cardinality

Even if our query is perfect, high cardinality will slow us down. The number of unique values in a column or series determines cardinality — high cardinality means a high number of unique values. Cardinality tends to increase when we want to query across more and more combinations of attributes, which then leads to time the database spends: finding the appropriate values in a series, performing any necessary functions (i.e., sum the values) on those values, repeating for every relevant, unique series, and then combining them according to the query requirements. As the index and cardinality grow, so does the overhead in running a query.

In a columnar database, we can improve performance by ensuring we have fewer series with more points rather than more series with fewer number of points. Compression techniques in time series run more efficiently on long runs of values, so if we want to get the most out of our database, we need to follow its rules.

In time series databases built on relational databases, cardinality affects the index more than anything else, so we need to keep an eye on the size of the index so it doesn’t suck up our resources.

Conclusion

You got through some heavy stuff here. Remember to take deep breaths and go to a happy place to process all of the information.

Become the sheep.

Your time series application deserves excellence in its level of efficiency and performance—and you can make it happen. Paying attention to indexing, query scope, retention policies and cardinality may not solve all of your problems, but the more you know about your data, the better you’ll be able to craft queries. We’re one step closer to being time series masters.

InfluxData is a sponsor of InApps.

Illustrations by Katy Farmer.

Source: InApps.net

Rate this post

Phu Nguyen

As a Senior Tech Enthusiast, I bring a decade of experience to the realm of tech writing, blending deep industry knowledge with a passion for storytelling. With expertise in software development to emerging tech trends like AI and IoT—my articles not only inform but also inspire. My journey in tech writing has been marked by a commitment to accuracy, clarity, and engaging storytelling, making me a trusted voice in the tech community.

Let’s create the next big thing together!

Coming together is a beginning. Keeping together is progress. Working together is success.

Let’s talk

Recommended

Tech News

May 29, 2025 by Anh Hoang

Update Optimizing Data Queries for Time Series Applications

Read more about Optimizing Data Queries for Time Series Applications at Wikipedia

Indexing

Query Scope

Retention Policies

Cardinality

Conclusion

AI Automation for Business in 2025: A Step-by-Step Guide

FITNESS APP DEVELOPMENT

ONLINE COURSE APP

EVE HR – WEB DESIGN

AIRGOGO WEBSITE

WALLET APP DEVELOPMENT

Ho Chi Minh City Launches Digital Traffic App 2017

Why Your Business Needs a Mobile App Rather Than a Website

7 Questions To Ask Yourself Before You ‘App’ | Entrepreneur

Homestays Marketplace Application Development

Blog post

9 Practical Tips to Choose a Mobile App Development Company for 2023

AI Automation for Business in 2025: A Step-by-Step Guide

Top 10 Offshore Development Companies (ODCs) in 2025

How can businesses effectively integrate AI into their operations?

Locations

Read more about Optimizing Data Queries for Time Series Applications at Wikipedia

Indexing

Query Scope

Retention Policies

Cardinality

Conclusion

Get a custom Proposal

You need to enter your email to download

Blog post

Locations