Curate Labs Article
GraphForge v0.2.1: Dataset Loading
GraphForge v0.2.1 made it easier to start analysis quickly by adding cached loading for real-world SNAP graph datasets.
Open Source release
GraphForge is Curate Labs work, released through the DecisionNerd open-source organization. This post is a release note for embedded graph tooling and analytical Python workflows.
GraphForge v0.2.1 added dataset loading infrastructure so users could start with real graphs instead of hand-building toy examples.
The release introduced cached loading for Stanford Network Analysis Project datasets, plus `MERGE ON CREATE SET` and `MERGE ON MATCH SET`.
What Shipped
`GraphForge.from_dataset()`
`load_dataset()`, `list_datasets()`, `get_dataset_info()`, and `clear_cache()`
Cached auto-downloads under `~/.graphforge/datasets/`
SNAP datasets including Facebook, Enron, AstroPh, Google Web, and Twitter graphs
MERGE create/match property semantics
WITH variable passing fix in aggregation queries
Why it matters
Good graph tooling should make the first useful query easy. Dataset loading turned GraphForge into a faster workbench for experiments, tutorials, and benchmark-style analysis.
Source
GitHub release: GraphForge v0.2.1