Update Neo: Graph Databases Are for More than Social Media

Main Contents:

Neo: Graph Databases Are for More than Social Media is an article under the topic Data Science Many of you are most interested in today !! Today, let’s InApps.net learn Neo: Graph Databases Are for More than Social Media in today’s post !

Mapping the Constellations

Launched in 2007, the Neo4j has already gotten some take-up in by some of the largest companies, especially in the space of retail, telecommunications and health care. About 100 of the Global 2000, Forbes list of the largest public companies, use the database. Walmart, for instance, uses it to generate product recommendations for its online retail operations.

IT giants are also increasingly backing the technology. For instance, IBM has stepped in to optimize the database to work with its Power8 line of servers, opening the possibility of running terabyte-sized Neo4j instances completely in memory.

Last month, when the International Consortium of Investigative Journalists (ICIJ) exposed the offshore tax havens of some of the world’s richest people, they used the Neo4j to make the linkages, after indexing the material with Apache Solr and Tika.

“The domain model used by the ICIJ is really basic, just containing four types of entities (Officer, Client, Company, Address) and four relationships between them,” explained Neo’s Michael Hunger and William Lyon, in a blog post detailing how the documents can be further analyzed with Neo4j.

The Panama Papers modeled with Neo4j

Analyzing the linkages, the journalists initially identified five government leaders who held money in off-shore or shell accounts, which resulted in at least one resignation, that of Iceland’s Prime Minister.

Neo4j’s 3.0’s cost-based optimizer now can optimize write queries, in addition to reads.

A graph database differs from standard relational databases in that instead of storing data in tables, and linking the data through foreign keys; data is stored in individual nodes, which are connected by specified relationships. One node may hold a product name while another may hold a vendor name, with the relationship between them specifying that the vendor supplies that product.

Graph computing, if not graph databases specifically, are most widely known through social media sites such as Facebook and LinkedIn, both which generate a lot of potentially useful information by making connections across different people (Think of Facebook’s “People You May Know” feature which suggests possible friends based on mutual friends).

Traditional relational databases are fairly bad at generating information about relations between entities. Every relationship query requires at least one join function, which degrades performance really quickly.

So think of any computing that needs to be done involving understanding the relationships among different entities. That is the purview of graph databases.

Take detecting the fraudulent use of a credit card, for instance. Fraud detection relies on a technique called link analysis, which examines who is charging items on a credit card, and where these transactions are taking place, and comparing them to a historical record of the credit card holder. Ideally, fraud detection should happen in real-time, so the cheaters don’t make off with too much of the money.

What’s in the Box?

For this 3.0 release, the Neo architects created the a new binary wire protocol, called Bolt, designed to speed communications between the application and the database system, a job heretofore handled by REST. The company also spun up drivers to work the protocol, for Java, .NET, JavaScript and Python.

neo4j-3-0-java-stored-procedure

These drivers will make it easier for developers to build applications for Neo4j, noted James Governor, RedMonk co-founder and analyst, in a statement.

Also potentially making Neo4j easier to work with for developers is support for Java Stored Procedures, which sets the stage for schema introspection.

Neo4j 3.0 also revamps its core technology to work with larger data sets. The company had found some of its customers were pushing into hundreds-of-billions scale graphs, though the software was more suited to the range of tens of billions of records. Thanks to a new storage engine, that limit has been removed.

Feature Image: Johannes Hevelius, “Prodromus Astronomia, volume III: Firmamentum Sobiescianum, sive Uranographia,” table QQ: Orion, 1690.

Source: InApps.net

Rate this post

Phu Nguyen

As a Senior Tech Enthusiast, I bring a decade of experience to the realm of tech writing, blending deep industry knowledge with a passion for storytelling. With expertise in software development to emerging tech trends like AI and IoT—my articles not only inform but also inspire. My journey in tech writing has been marked by a commitment to accuracy, clarity, and engaging storytelling, making me a trusted voice in the tech community.