Curate LabsCurate Labs

Curate Labs Article

GraphForge v0.2.1: Dataset Loading

GraphForge v0.2.1 made it easier to start analysis quickly by adding cached loading for real-world SNAP graph datasets.

GraphForge v0.3.0: Major Cypher Feature Release visual summary

Open Source release

GraphForge is Curate Labs work, released through the DecisionNerd open-source organization. This post is a release note for embedded graph tooling and analytical Python workflows.

GraphForge v0.2.1 added dataset loading infrastructure so users could start with real graphs instead of hand-building toy examples.

The release introduced cached loading for Stanford Network Analysis Project datasets, plus `MERGE ON CREATE SET` and `MERGE ON MATCH SET`.

What Shipped

  • `GraphForge.from_dataset()`

  • `load_dataset()`, `list_datasets()`, `get_dataset_info()`, and `clear_cache()`

  • Cached auto-downloads under `~/.graphforge/datasets/`

  • SNAP datasets including Facebook, Enron, AstroPh, Google Web, and Twitter graphs

  • MERGE create/match property semantics

  • WITH variable passing fix in aggregation queries

Why it matters

Good graph tooling should make the first useful query easy. Dataset loading turned GraphForge into a faster workbench for experiments, tutorials, and benchmark-style analysis.

Source