Home
>
Data Science
>
Update Open Source Histograms: The Future of Telemetry Monitoring

March 22, 2022 by Phu Nguyen

Update Open Source Histograms: The Future of Telemetry Monitoring

Main Contents:

Open Source Histograms: The Future of Telemetry Monitoring is an article under the topic Data Science Many of you are most interested in today !! Today, let’s InApps.net learn Open Source Histograms: The Future of Telemetry Monitoring in today’s post !

Key Summary

Overview: The article by InApps Technology explores the growing importance of open-source histograms in telemetry monitoring, detailing their advantages, applications, and role in enhancing observability for modern distributed systems, particularly in cloud-native and DevOps environments.
What are Histograms in Telemetry?:
- Definition: A data structure that aggregates numerical data into predefined buckets to represent the distribution of metrics (e.g., request latency, memory usage) over time.
- Key Characteristics:
  - Captures the frequency of values within specific ranges (buckets).
  - Enables analysis of percentiles (e.g., p50, p95, p99) for performance insights.
  - Compact representation of large datasets, reducing storage and processing needs.
- Role in Telemetry: Provides detailed, actionable insights into system performance, complementing other telemetry signals like metrics, logs, and traces.
Why Open-Source Histograms?:
- 1. Accessibility:
  - Open-source tools (e.g., Prometheus, OpenTelemetry) make histograms widely available without proprietary costs.
  - Encourages community contributions for continuous improvement.
- 2. Standardization:
  - Open standards (e.g., OpenTelemetry’s histogram specification) ensure interoperability across tools and platforms.
  - Simplifies integration in heterogeneous environments.
- 3. Flexibility:
  - Customizable bucket configurations and aggregation methods suit diverse use cases.
  - Supports both client-side and server-side histogram processing.
- 4. Scalability:
  - Efficiently handles high-cardinality data in large-scale systems like Kubernetes or microservices.
  - Reduces overhead compared to raw metric storage.
How Histograms Enhance Telemetry Monitoring:
- 1. Granular Performance Insights:
  - Mechanism: Tracks distribution of metrics (e.g., API latency) to identify outliers and trends.
  - Impact: Enables precise diagnosis of performance bottlenecks.
  - Example: Detecting p99 latency spikes in a payment processing system.
- 2. Percentile Analysis:
  - Mechanism: Calculates percentiles to focus on user experience (e.g., 99% of requests under 200ms).
  - Impact: Prioritizes critical performance thresholds over averages.
  - Example: Ensuring 95% of e-commerce page loads are under 2 seconds.
- 3. Efficient Data Storage:
  - Mechanism: Aggregates data into buckets, reducing storage requirements compared to raw time-series data.
  - Impact: Lowers costs for long-term telemetry retention.
  - Example: Storing a year’s worth of latency data with minimal disk usage.
- 4. Real-Time Monitoring:
  - Mechanism: Integrates with tools like Prometheus for real-time histogram visualization.
  - Impact: Supports proactive issue detection in dynamic systems.
  - Example: Alerting on increased error rates in a microservices cluster.
- 5. Cross-System Observability:
  - Mechanism: OpenTelemetry histograms unify metrics across applications, clouds, and infrastructure.
  - Impact: Provides holistic insights into distributed systems.
  - Example: Correlating API latency with database performance across AWS and Azure.
Key Open-Source Tools for Histograms:
- Prometheus:
  - Offers native histogram support for metrics monitoring.
  - Visualizes distributions via Grafana dashboards.
  - Example: Tracking request latency in Kubernetes pods.
- OpenTelemetry:
  - Provides standardized histogram collection for metrics, logs, and traces.
  - Integrates with various backends (e.g., Jaeger, Zipkin).
  - Example: Aggregating IoT sensor data distributions.
- Grafana Tempo:
  - Supports histogram-based trace analysis for distributed tracing.
  - Example: Analyzing end-to-end latency in microservices.
- Elasticsearch:
  - Uses histograms for log and metric aggregation in observability pipelines.
  - Example: Visualizing server response times in a retail app.
Benefits:
- Enhanced Observability: Detailed distributions improve system understanding and troubleshooting.
- Cost Efficiency:
  - Open-source tools eliminate licensing fees.
  - Offshore development in Vietnam ($20-$50/hour via InApps Technology) for telemetry solutions saves 20-40% compared to U.S./EU rates ($80-$150/hour).
- Scalability: Handles high-volume telemetry data in cloud-native environments.
- Community-Driven Innovation: Rapid feature updates and integrations via open-source ecosystems.
- Interoperability: Works across diverse platforms, reducing vendor lock-in.
Challenges:
- Configuration Complexity: Defining appropriate bucket ranges requires domain knowledge to avoid skewed insights.
- Performance Overhead: High-cardinality histograms may strain storage or query performance if not optimized.
- Learning Curve: Teams need familiarity with tools like Prometheus or OpenTelemetry.
- Data Accuracy: Incorrect bucket configurations can misrepresent distributions.
Security Considerations:
- Access Control: Use RBAC to restrict access to histogram data in tools like Prometheus or Grafana.
- Encryption: Enable TLS for telemetry data in transit and at rest.
- Anonymization: Strip sensitive data (e.g., PII) from histograms to comply with GDPR or CCPA.
- Monitoring: Audit telemetry pipelines with tools like Prometheus Alertmanager to detect unauthorized access.
Use Cases:
- E-commerce: Monitoring checkout API latency to ensure fast user experiences during sales.
- IoT: Analyzing sensor data distributions for predictive maintenance in smart factories.
- Finance: Tracking transaction processing times to meet SLA requirements.
- Cloud-Native Apps: Observing microservices performance in Kubernetes clusters.
- Gaming: Measuring player latency distributions to optimize server performance.
InApps Technology’s Role:
- Offers expertise in open-source telemetry solutions, deploying Prometheus, OpenTelemetry, and Grafana for observability.
- Leverages Vietnam’s 200,000+ IT professionals, providing cost-effective rates ($20-$50/hour) for high-quality development.
- Supports Agile workflows with tools like Jira, Slack, and Zoom for transparent collaboration (GMT+7).
Recommendations:
- Start with Prometheus and Grafana for cost-effective histogram-based monitoring.
- Use OpenTelemetry for standardized, cross-platform telemetry collection.
- Optimize bucket configurations to balance granularity and storage efficiency.
- Partner with InApps Technology for expert telemetry solutions, leveraging Vietnam’s skilled developers for cost-effective, high-performance observability deployments.

Histogram Data Structures

Theo Schlossnagle

Theo founded Circonus in 2010, and continues to be its principal architect. He has been architecting, coding, building and operating scalable systems for 20 years. As a serial entrepreneur, he has founded four companies and helped grow countless engineering organizations. Theo is the author of Scalable Internet Architectures (Sams), a contributor to Web Operations (O’Reilly) and Seeking SRE (O’Reilly), and a frequent speaker at worldwide IT conferences. He is a member of the IEEE and a Distinguished Member of the ACM.

Histograms are a data structure that allows users to model the distribution of a set of samples such as the age of every human on earth. Instead of storing each sample as its own record, though, they are grouped together in buckets or bins. This allows for significant data compression and is economically superior. This compression of data allows for extraordinary metric transmission and ingestion rates, high frequency, real-time analytics and economical long-term storage. Histograms are also particularly useful in handling the breadth and depth of metric data produced by container technologies such as Kubernetes.

At Circonus, we’re passionate about histograms and how valuable they are for engineers and software developers, which is why we donated our histogram technology, OpenHistograms, to the open source community.

The problem is that the monitoring industry has no single standard for histograms, and therefore all too frequently, users are leveraging them incorrectly, which has costly consequences. In this article, I’ll share why histograms are needed now more than ever and why the monitoring industry needs to embrace an open source, single-standard histogram technology.

Histograms: Needed Now More Than Ever

When the internet was small and users were not accessing services at high rates, you could more easily store and analyze each individual request and set standards around serving all requests accurately and quickly enough. Today there are many, many more user interactions being generated, collected and analyzed. But even more game-changing is that organizations now have multiple layers of systems, services and applications communicating with each other that are generating an overwhelming volume of data – significantly more than what’s possible by just users. For example, if you’re running a database on a system and you expect your discs to perform operations at a certain speed, this activity alone could generate a million data points per second, which ends up being almost a hundred billion per day.

Now, ensuring that all requests are served fast enough becomes an impractical objective, both from a capability and economic standpoint. It’s just not worth being perfect. So engineers must analyze the behavior of their systems and determine quantitatively what is good enough. If you’re servicing web pages or an API endpoint, how many errors are you allowed to have? How fast do you need to service requests? The problem with the question of how fast do most of them need to be is that you have two variables: how fast (measured in milliseconds) and how many (measured in a number like a percentile).

This is a really hard statistics problem to solve. On top of this, organizations have significantly more data to store. If recording every single transaction is exorbitantly expensive and doing the math of analyzing latencies on every single transaction is also expensive, then engineers need some sort of model that allows them to inexpensively store all of those measurements and answer that question of how many, how fast. The histogram is a perfect model for all of that.

Histograms can collect, compress and store all data points (billions!) and allow engineers to accurately analyze what percentage of their traffic is slower or faster than a certain speed – at low cost and zero overhead. Critically, they allow engineers to change both of those variables on the fly, after data ingestion. So instead of saying, “I need 99% of requests to be served faster than one second,” you can start to ask, “what does it look like when I have 98% of requests served faster than 5,500 milliseconds.”

Without histograms, you have to be able to phrase your questions specifically before you start, and engineers cannot do this with specificity and accuracy beforehand. Histograms allow you to store unlimited data and post-facto answer more complex statistical questions, which is what’s needed in today’s service-centric, rapid release cycle environment.

Sponsor Note

sponsor logo

Circonus provides a unified monitoring and analytics platform for application, infrastructure cloud, and container monitoring. The platform is capable of handling unlimited telemetry metrics from unlimited sources in real time to drive unprecedented business insight and value.

Histograms Must be Open Source

At Circonus, we’re open source advocates and believe most technology should be open source because it provides the assurance that users can be a stakeholder in it. The most important reason we’re passionate about our histogram technology being open source, however, is because users absolutely must have an industry standard around histograms, meaning organizations can use a single histogram technology across their monitoring stacks.

If you’re collecting your telemetry using different histograms from different vendors within your monitoring and observability stack – say, telemetry from your cloud provider and your telemetry from your APM provider – you cannot merge the data between histograms because they have different binning or different techniques. Unfortunately, users often do merge this data, introducing significant error that carries into the subsequent data analysis. This ends up hurting the operator and the end user.

The industry must focus on a single histogram model implementation because it increases compatibility between services and directly benefits the end user. Circonus’ implementation of histograms, Circhlist, has been in the industry since 2011. It has been independently tested and evaluated multiple times over the years and consistently deemed superior to other approaches in terms of balancing performance, accuracy, correctness and usability. With the goal of fostering and facilitating the interchangeability and ability to merge data between vendor platforms for all users, we recently released our histogram technology under the Apache 2.0 license to the open source community as OpenHistograms.

Circonus’ OpenHistograms are vendor-neutral log-linear histograms for the compression, mergeability and analysis of telemetry data. Two key differentiating factors for OpenHistogram is that it’s in Base 10, which eases usability, and that it does not require floating-point arithmetic, so you can run it on embedded systems that don’t have floating-point units.

OpenHistograms allow users to seamlessly exchange telemetry between vendor platforms without introducing error. Organizations that are faced with the challenge of digesting and analyzing massive amounts of distribution data can now rely on a consistent, interchangeable and stable representation of that data, a significant capability for the monitoring now and in the future.

Time for a Single Standard

The volume of data IT organizations are responsible for collecting and analyzing is growing substantially year over year. As a result, users are increasingly employing histogram technology as a way to measure service quality. A vast majority are merging telemetry data from different vendor histograms, and the output, while not apparent, is wrong. Organizations are inaccurately concluding they are hitting or not hitting SLOs and basing key operational decisions on this data that can cost them thousands of dollars a year.

Every engineer and app developer should feel confident that they can create a histogram, give it to someone, and know that they can accurately use it. By embracing vendor-neutral, industry-standard histogram technology, users have one source of truth and can rest assured their analysis is accurate.

Featured image via Pixabay.

Source: InApps.net

Rate this post

Phu Nguyen

As a Senior Tech Enthusiast, I bring a decade of experience to the realm of tech writing, blending deep industry knowledge with a passion for storytelling. With expertise in software development to emerging tech trends like AI and IoT—my articles not only inform but also inspire. My journey in tech writing has been marked by a commitment to accuracy, clarity, and engaging storytelling, making me a trusted voice in the tech community.

Let’s create the next big thing together!

Coming together is a beginning. Keeping together is progress. Working together is success.

Let’s talk

Recommended

Tech News

May 29, 2025 by Anh Hoang

Update Open Source Histograms: The Future of Telemetry Monitoring

Key Summary

Read more about Open Source Histograms: The Future of Telemetry Monitoring at Wikipedia

Histogram Data Structures

Histograms: Needed Now More Than Ever

Histograms Must be Open Source

Time for a Single Standard

AI Automation for Business in 2025: A Step-by-Step Guide

FITNESS APP DEVELOPMENT

ONLINE COURSE APP

EVE HR – WEB DESIGN

AIRGOGO WEBSITE

WALLET APP DEVELOPMENT

Ho Chi Minh City Launches Digital Traffic App 2017

Why Your Business Needs a Mobile App Rather Than a Website

7 Questions To Ask Yourself Before You ‘App’ | Entrepreneur

Homestays Marketplace Application Development

Blog post

9 Practical Tips to Choose a Mobile App Development Company for 2023

AI Automation for Business in 2025: A Step-by-Step Guide

Top 10 Offshore Development Companies (ODCs) in 2025

How can businesses effectively integrate AI into their operations?

Locations

Key Summary

Read more about Open Source Histograms: The Future of Telemetry Monitoring at Wikipedia

Histogram Data Structures

Histograms: Needed Now More Than Ever

Histograms Must be Open Source

Time for a Single Standard

Get a custom Proposal

You need to enter your email to download

Blog post

Locations