Package it.unimi.dsi.big.webgraph
Class EFGraph
java.lang.Object
it.unimi.dsi.big.webgraph.ImmutableGraph
it.unimi.dsi.big.webgraph.EFGraph
- All Implemented Interfaces:
FlyweightPrototype<ImmutableGraph>
An immutable graph based on the Elias–Fano representation of monotone sequences.
- Author:
- Sebastiano Vigna
-
Nested Class Summary
Modifier and TypeClassDescriptionprotected static final class
protected static final class
protected static final class
protected static final class
static final class
Nested classes/interfaces inherited from class it.unimi.dsi.big.webgraph.ImmutableGraph
ImmutableGraph.LoadMethod
-
Field Summary
Modifier and TypeFieldDescriptionprotected final CharSequence
The basename of this graph (or possiblynull
).protected long
If notLong.MIN_VALUE
, the node whose degree is cached incachedOutdegree
.protected long
IfcachedNode
is notLong.MIN_VALUE
, its cached outdegree.protected long
IfcachedNode
is notLong.MIN_VALUE
, the position immediately after the coding of the outdegree ofcachedNode
.static final int
The default size of the bit cache.static final int
The default base-two logarithm of the quantum.static final int
This number classifies the present graph format.protected final LongBigList
The list containing the graph.static final String
The standard extension for the graph longword bit stream.protected final int
The base-two logarithm of the indexing quantum.protected final long
The number of arcs of the graph.protected final long
The number of nodes of the graph.protected final LongBigList
An Elias–Fano monotone list containing the pointers of the bit streams stored ingraph
.static final String
The standard extension for the cachedLongBigList
containing the graph offsets.static final String
The standard extension for the graph-offsets bit stream.protected final EFGraph.LongWordBitReader
A longword bit reader used to read outdegrees.protected final long
The upper bound used during the graph construction (greater than or equal ton
.Fields inherited from class it.unimi.dsi.big.webgraph.ImmutableGraph
GRAPHCLASS_PROPERTY_KEY, NUMBER_OF_THREADS_PROPERTY, PROPERTIES_EXTENSION
-
Constructor Summary
ModifierConstructorDescriptionprotected
EFGraph
(CharSequence basename, long n, long m, long upperBound, int log2Quantum, LongBigList graph, LongBigList offsets) -
Method Summary
Modifier and TypeMethodDescriptionbasename()
Returns a symbolic basename for this graph (optional operation).copy()
Returns a flyweight copy of this immutable graph.static EFGraph
load
(CharSequence basename) Creates a newEFGraph
by loading a compressed graph file from disk to memory, with no progress logger and all offsets.static EFGraph
load
(CharSequence basename, ProgressLogger pl) Creates a newEFGraph
by loading a compressed graph file from disk to memory, with all offsets.protected static EFGraph
loadInternal
(CharSequence basename, boolean mapped, ProgressLogger pl) Loads a compressed graph file from disk into this graph.static LongBigArrayBigList
loadLongBigList
(CharSequence filename, ByteOrder byteOrder) Commodity method for loading a big list of binary longs with specified endianness into a long big array which just delegates toEFGraph.loadLongBigList(CharSequence, ByteOrder)
.static EFGraph
loadMapped
(CharSequence basename) Creates a newEFGraph
by memory-mapping a graph file.static EFGraph
loadMapped
(CharSequence basename, ProgressLogger pl) Creates a newEFGraph
by memory-mapping a graph file.static EFGraph
loadOffline
(CharSequence basename) Creates a newEFGraph
by loading just the metadata of a compressed graph file.static EFGraph
loadOffline
(CharSequence basename, ProgressLogger pl) Creates a newEFGraph
by loading just the metadata of a compressed graph file.static EFGraph
loadSequential
(CharSequence basename) Deprecated.static EFGraph
loadSequential
(CharSequence basename, ProgressLogger pl) Deprecated.static int
lowerBits
(long length, long upperBound) Returns the number of lower bits for the Elias–Fano encoding of a list of given length, upper bound and strictness.static void
long
numArcs()
Returns the number of arcs of this graph (optional operation).static long
numberOfPointers
(long length, long upperBound, int log2Quantum) Returns the number of forward or skip pointers to the Elias–Fano encoding of a list of given length, upper bound and strictness.long
numNodes()
Returns the number of nodes of this graph.long
outdegree
(long x) Returns the outdegree of a node.static int
pointerSize
(long length, long upperBound) Returns the size in bits of forward or skip pointers to the Elias–Fano encoding of a list of given length, upper bound and strictness.boolean
Checks whether this graph provides random access to successor lists.static void
store
(ImmutableGraph graph, long upperBound, CharSequence basename, int log2Quantum, int cacheSize, ByteOrder byteOrder, ProgressLogger pl) static void
store
(ImmutableGraph graph, long upperBound, CharSequence basename, ProgressLogger pl) static void
store
(ImmutableGraph graph, CharSequence basename) static void
store
(ImmutableGraph graph, CharSequence basename, int log2Quantum, int cacheSize, ByteOrder byteOrder, ProgressLogger pl) static void
store
(ImmutableGraph graph, CharSequence basename, ProgressLogger pl) successors
(long x) Returns a lazy iterator over the successors of a given node.Methods inherited from class it.unimi.dsi.big.webgraph.ImmutableGraph
equals, hasCopiableIterators, hashCode, intNumNodes, load, loadOnce, nodeIterator, nodeIterator, outdegrees, splitNodeIterators, store, store, successorBigArray, toString, wrap, wrap
-
Field Details
-
GRAPH_EXTENSION
The standard extension for the graph longword bit stream.- See Also:
-
OFFSETS_EXTENSION
The standard extension for the graph-offsets bit stream.- See Also:
-
OFFSETS_BIG_LIST_EXTENSION
The standard extension for the cachedLongBigList
containing the graph offsets.- See Also:
-
DEFAULT_CACHE_SIZE
public static final int DEFAULT_CACHE_SIZEThe default size of the bit cache.- See Also:
-
EFGRAPH_VERSION
public static final int EFGRAPH_VERSIONThis number classifies the present graph format. When new features require introducing binary incompatibilities, this number is bumped so to ensure that old classes do not try to read graphs they cannot understand.- See Also:
-
DEFAULT_LOG_2_QUANTUM
public static final int DEFAULT_LOG_2_QUANTUMThe default base-two logarithm of the quantum.- See Also:
-
n
protected final long nThe number of nodes of the graph. -
upperBound
protected final long upperBoundThe upper bound used during the graph construction (greater than or equal ton
. -
m
protected final long mThe number of arcs of the graph. -
graph
The list containing the graph. -
offsets
An Elias–Fano monotone list containing the pointers of the bit streams stored ingraph
. -
basename
The basename of this graph (or possiblynull
). -
outdegreeLongWordBitReader
A longword bit reader used to read outdegrees. -
log2Quantum
protected final int log2QuantumThe base-two logarithm of the indexing quantum. -
cachedNode
protected long cachedNodeIf notLong.MIN_VALUE
, the node whose degree is cached incachedOutdegree
. -
cachedOutdegree
protected long cachedOutdegreeIfcachedNode
is notLong.MIN_VALUE
, its cached outdegree. -
cachedPointer
protected long cachedPointerIfcachedNode
is notLong.MIN_VALUE
, the position immediately after the coding of the outdegree ofcachedNode
.
-
-
Constructor Details
-
EFGraph
protected EFGraph(CharSequence basename, long n, long m, long upperBound, int log2Quantum, LongBigList graph, LongBigList offsets)
-
-
Method Details
-
basename
Description copied from class:ImmutableGraph
Returns a symbolic basename for this graph (optional operation).Implementors of this class may provide a basename (usually a pathname from which various files storing the graph are stemmed). This method is optional because it is sometimes unmeaningful (e.g., for one-off anonymous classes).
- Overrides:
basename
in classImmutableGraph
- Returns:
- the basename.
-
lowerBits
public static int lowerBits(long length, long upperBound) Returns the number of lower bits for the Elias–Fano encoding of a list of given length, upper bound and strictness.- Parameters:
length
- the number of elements of the list.upperBound
- an upper bound for the elements of the list.- Returns:
- the number of bits for the Elias–Fano encoding of a list with the specified parameters.
-
pointerSize
public static int pointerSize(long length, long upperBound) Returns the size in bits of forward or skip pointers to the Elias–Fano encoding of a list of given length, upper bound and strictness.- Parameters:
length
- the number of elements of the list.upperBound
- an upper bound for the elements of the list.- Returns:
- the size of bits of forward or skip pointers the Elias–Fano encoding of a list with the specified parameters.
-
numberOfPointers
public static long numberOfPointers(long length, long upperBound, int log2Quantum) Returns the number of forward or skip pointers to the Elias–Fano encoding of a list of given length, upper bound and strictness.- Parameters:
length
- the number of elements of the list.upperBound
- an upper bound for the elements of the list.log2Quantum
- the logarithm of the quantum size.- Returns:
- an upper bound on the number of skip pointers or the (exact) number of forward pointers.
-
load
Creates a newEFGraph
by loading a compressed graph file from disk to memory, with no progress logger and all offsets.- Parameters:
basename
- the basename of the graph.- Returns:
- a
EFGraph
containing the specified graph. - Throws:
IOException
- if an I/O exception occurs while reading the graph.
-
load
Creates a newEFGraph
by loading a compressed graph file from disk to memory, with all offsets.- Parameters:
basename
- the basename of the graph.pl
- a progress logger used while loading the graph, ornull
.- Returns:
- a
EFGraph
containing the specified graph. - Throws:
IOException
- if an I/O exception occurs while reading the graph.
-
loadMapped
Creates a newEFGraph
by memory-mapping a graph file.- Parameters:
basename
- the basename of the graph.- Returns:
- an
EFGraph
containing the specified graph. - Throws:
IOException
- if an I/O exception occurs while memory-mapping the graph or reading the offsets.
-
loadMapped
Creates a newEFGraph
by memory-mapping a graph file.- Parameters:
basename
- the basename of the graph.pl
- a progress logger used while loading the offsets, ornull
.- Returns:
- an
EFGraph
containing the specified graph. - Throws:
IOException
- if an I/O exception occurs while memory-mapping the graph or reading the offsets.
-
loadSequential
@Deprecated public static EFGraph loadSequential(CharSequence basename, ProgressLogger pl) throws IOException Deprecated.Creates a newEFGraph
by loading a compressed graph file from disk to memory, without offsets.- Parameters:
basename
- the basename of the graph.pl
- a progress logger used while loading the graph, ornull
.- Returns:
- a
EFGraph
containing the specified graph. - Throws:
IOException
- if an I/O exception occurs while reading the graph.
-
loadSequential
Deprecated.UseloadOffline(CharSequence)
orloadMapped(CharSequence)
instead.Creates a newEFGraph
by loading a compressed graph file from disk to memory, with no progress logger and without offsets.- Parameters:
basename
- the basename of the graph.- Returns:
- a
EFGraph
containing the specified graph. - Throws:
IOException
- if an I/O exception occurs while reading the graph.
-
loadOffline
Creates a newEFGraph
by loading just the metadata of a compressed graph file.- Parameters:
basename
- the basename of the graph.pl
- a progress logger, ornull
.- Returns:
- a
EFGraph
containing the specified graph. - Throws:
IOException
- if an I/O exception occurs while reading the metadata.
-
loadOffline
Creates a newEFGraph
by loading just the metadata of a compressed graph file.- Parameters:
basename
- the basename of the graph.- Returns:
- a
EFGraph
containing the specified graph. - Throws:
IOException
- if an I/O exception occurs while reading the metadata.
-
loadLongBigList
public static LongBigArrayBigList loadLongBigList(CharSequence filename, ByteOrder byteOrder) throws IOException Commodity method for loading a big list of binary longs with specified endianness into a long big array which just delegates toEFGraph.loadLongBigList(CharSequence, ByteOrder)
.- Parameters:
filename
- the file containing the longs.byteOrder
- the endianness of the longs.- Returns:
- a big list of longs containing the longs in
filename
. - Throws:
IOException
-
loadInternal
protected static EFGraph loadInternal(CharSequence basename, boolean mapped, ProgressLogger pl) throws IOException Loads a compressed graph file from disk into this graph. Note that this method should be called only on a newly created graph.- Parameters:
basename
- the basename of the graph.mapped
- whether we want to memory-map the file.pl
- a progress logger used while loading the graph, ornull
.- Returns:
- this graph.
- Throws:
IOException
-
store
public static void store(ImmutableGraph graph, long upperBound, CharSequence basename, ProgressLogger pl) throws IOException - Throws:
IOException
-
store
public static void store(ImmutableGraph graph, CharSequence basename, ProgressLogger pl) throws IOException - Throws:
IOException
-
store
- Throws:
IOException
-
store
public static void store(ImmutableGraph graph, CharSequence basename, int log2Quantum, int cacheSize, ByteOrder byteOrder, ProgressLogger pl) throws IOException - Throws:
IOException
-
store
public static void store(ImmutableGraph graph, long upperBound, CharSequence basename, int log2Quantum, int cacheSize, ByteOrder byteOrder, ProgressLogger pl) throws IOException - Throws:
IOException
-
numNodes
public long numNodes()Description copied from class:ImmutableGraph
Returns the number of nodes of this graph.Albeit this method is not optional, it is allowed that this method throws an
UnsupportedOperationException
if this graph has never been entirely traversed using anode iterator
. This apparently bizarre behaviour is necessary to support implementations asArcListASCIIGraph
, which do not know the actual number of nodes until a traversal has been completed.- Specified by:
numNodes
in classImmutableGraph
- Returns:
- the number of nodes.
-
numArcs
public long numArcs()Description copied from class:ImmutableGraph
Returns the number of arcs of this graph (optional operation).- Overrides:
numArcs
in classImmutableGraph
- Returns:
- the number of arcs.
-
randomAccess
public boolean randomAccess()Description copied from class:ImmutableGraph
Checks whether this graph provides random access to successor lists.- Specified by:
randomAccess
in classImmutableGraph
- Returns:
- true if this graph provides random access to successor lists.
-
outdegree
public long outdegree(long x) Description copied from class:ImmutableGraph
Returns the outdegree of a node.- Specified by:
outdegree
in classImmutableGraph
- Parameters:
x
- a node.- Returns:
- the outdegree of the given node.
-
successors
Description copied from class:ImmutableGraph
Returns a lazy iterator over the successors of a given node. The iteration terminates when -1 is returned.- Overrides:
successors
in classImmutableGraph
- Parameters:
x
- a node.- Returns:
- a lazy iterator over the successors of the node.
-
copy
Description copied from class:ImmutableGraph
Returns a flyweight copy of this immutable graph.- Specified by:
copy
in interfaceFlyweightPrototype<ImmutableGraph>
- Specified by:
copy
in classImmutableGraph
- Returns:
- a flyweight copy of this immutable graph.
- See Also:
-
main
public static void main(String[] args) throws SecurityException, IllegalAccessException, InvocationTargetException, NoSuchMethodException, IOException, JSAPException, ClassNotFoundException, InstantiationException
-
loadOffline(CharSequence)
orloadMapped(CharSequence)
instead.