Introducing GraphAware Databridge: Graph Data Import Made Simple

Introducing Graphaware Databridge: Graph Data Import Made Simple

Introducing Graphaware Databridge: Graph Data Import Made Simple

Databridge is a fully-featured ETL tool specifically built for Neo4j, and designed for usability, expressive power and impressive performance.

Until now, Neo4j users wanting to import data into Neo4j have been faced with two choices: Create Cypher statements in conjunction with Cypher’s LOAD CSV or use Neo4j’s batch import tool.
Each of these approaches has its strengths and weaknesses. LOAD CSV is very flexible, but you need to learn Cypher, it struggles with large volumes of data and is relatively slow.

On the other hand, Neo4j’s batch import tool is extremely efficient at processing large data volumes. You don’t need to know any Cypher, but the input files usually need to be manually generated beforehand. Being a simple CSV loader, it also lacks the expressive power of Cypher.

Furthermore, many of the issues faced by any reasonably complex data import process can’t easily be solved using the existing tooling. Consequently, people often resort to creating bespoke solutions in code. We know because we’ve done it enough times ourselves.

Read Also:
Duke research institute, SAS share 45 years of cardiovascular data

Databridge
At GraphAware , we didn’t want to keep re-inventing the wheel at every new client we went to. So we took a different approach and built Databridge. Databridge is a fully-featured ETL tool specifically built for Neo4j, and designed for usability, expressive power, and impressive performance. It’s already in use at a number of GraphAware clients, and we think it’s now mature enough to bring it to the attention of the wider world.

So, in this blog post, we’re going to take a quick tour of the main features of Databridge, to give you an idea of what it can do, and to help you get a feel for whether it would be useful for you.
We’ll create a really simple example that you can follow along with as we go.

Declarative Approach
One of the difficulties with the current ETL tools is that they are quite developer-oriented. You either have to learn a lot of Cypher, or you have to be able to manipulate your raw data sources and generate node and relationship files that the batch import tool can use. As noted earlier, when these two options become infeasible, you need to write code.
But in fact, every Neo4j import needs to do exactly the same sorts of things: locate the data sources, know how to transform them into graph objects, link nodes together with relationships, assign labels, index properties and so on. All this pretty much boils down to two questions:

Read Also:
Two Pathways for Establishing Data Confidence

What data do I want?
What do I want it to look like when it’s loaded in the graph?
Databridge tackles these questions by being primarily declarative, instead of programmatic in nature.
It does this by using simple JSON files called schema descriptors in which you define the graph schema you want to build, along with resource descriptors in which you identify the data you want to import, and how to get it. This means you’re able to work directly with your source data exactly as is.

If you can create a JSON document, you can use Databridge.



Data Innovation Summit 2017

30
Mar
2017
Data Innovation Summit 2017

30% off with code 7wData

Read Also:
12 Barriers To Real-Time Analytics

Big Data Innovation Summit London

30
Mar
2017
Big Data Innovation Summit London

$200 off with code DATA200

Read Also:
Across Industries, Big Data is the Engine of Digital Innovation

Enterprise Data World 2017

2
Apr
2017
Enterprise Data World 2017

$200 off with code 7WDATA

Read Also:
Big Data and the Internet of Things don’t make business smarter, Analytics and Data Science do
Read Also:
Data: The Missing Piece in Your Content Marketing Strategy

Data Visualisation Summit San Francisco

19
Apr
2017
Data Visualisation Summit San Francisco

$200 off with code DATA200

Read Also:
How businesses can learn from the first wave of AI

Chief Analytics Officer Europe

25
Apr
2017
Chief Analytics Officer Europe

15% off with code 7WDCAO17

Read Also:
Data: The Missing Piece in Your Content Marketing Strategy

Leave a Reply

Your email address will not be published. Required fields are marked *