Update PostgreSQL 9.5 Geared to Liberate Enterprises from the Data Warehouse

Main Contents:

PostgreSQL 9.5 Geared to Liberate Enterprises from the Data Warehouse is an article under the topic Data Science Many of you are most interested in today !! Today, let’s InApps.net learn PostgreSQL 9.5 Geared to Liberate Enterprises from the Data Warehouse in today’s post !

New Analytic Features

The database system comes with a number of new analytic features typically found in data warehouses, including grouping sets, cubes and roll-up. They all offer functionality that can be executed through a series of standard SQL operators such as UNION ALL, though they make it much easier to carry out this work, speeding the execution times of complex queries and offer the way to craft more nuanced commands. Think of the need to summarize information like employee headcount across different departments, locations and job roles.

“This [approach] has the efficiency of going through the data only once,” Momjian said. “Telling people to use UNION ALL gets awkward after a while.”

Another new feature that should help in analytics is a new indexing type called BRIN (Block Range Index). BRIN can generate very small indexes to describe a range of information, such as minimum and maximum values, that allow queries to skip over vast numbers of rows when looking for data within a certain range. With BRIN, 100GB of data can be summarized within 100KB or so.

“The BRIN creates a filter index,” Momjian said. “You’re looking for a purple shirt in a multi-terabyte table, and I can basically whittle down the table to know that the purple shirt will be within one percent of the table.”

UPSERT Done Right

One of the feature users have been most excited about has been the introduction of an UPSERT command, which combines INSERT and UPDATE commands into a single call such that it can automatically turn an INSERT into an UPDATE if the data already exists. PostgreSQL doesn’t have a specific UPSERT command, but rather offers a special clause that can be used with INSERT, that achieves the same outcome.

This is a feature that other database systems have had for awhile and Momjian admitted he was slightly embarrassed that PostGreSQL did not possess this feature until now.

It turns out that many of the implementations of UPSERT (also called MERGE on some systems) on other database systems were “handled very badly,” Momjian said. Implementing this feature is a difficult task, especially to handle use cases where the database is being updated by multiple parties at once. In numerous other systems, two people doing an UPSERT of the same data at the same time would just result in one user, or both, getting error messages, which is not an optimal way to handle the situation.

The PostgreSQL team didn’t want to just hack something together that would result in technical debt that would have be addressed later by the developers, or by the user. They are pleased by the results.

“What is nice about our implementation is that it never generates an unexpected error. You can have multiple people doing this, and there is very little performance impact,” Momjian said. Because it can work on multiple tables at once, it can even be used to merge one table into another.

Feature Image: Simon Bolivar, NYC street art by Dasic Fernandez.

Source: InApps.net

Rate this post

Phu Nguyen

As a Senior Tech Enthusiast, I bring a decade of experience to the realm of tech writing, blending deep industry knowledge with a passion for storytelling. With expertise in software development to emerging tech trends like AI and IoT—my articles not only inform but also inspire. My journey in tech writing has been marked by a commitment to accuracy, clarity, and engaging storytelling, making me a trusted voice in the tech community.