read.graph {igraph} | R Documentation |
The read.graph
function is able to read graphs in
various representations from a file, or from a http
connection. Currently some simple formats are supported.
read.graph(file, format = c("edgelist", "pajek", "ncol", "lgl", "graphml", "dimacs", "graphdb", "gml"), ...)
file |
The connection to read from. This can be a local file, or
a http or ftp connection. It can also be a character
string with the file name or URI. |
format |
Character constant giving the file format. Right now
edgelist , pajek , graphml , gml ,
ncol , lgl , dimacs and graphdb
are supported, the default is edgelist . As of igraph 0.4 this
argument is case insensitive.
|
... |
Additional arguments, see below. |
The read.graph
function may have additional arguments depending
on the file format (the format
argument).
This format is a simple text file with numeric vertex ids defining the edges. There is no need to have newline characters between the edges, a simple space will also do.
Additional arguments:
TRUE
.
igraph
implements only a subset of the Pajek format:
Vertex and edge attributes defined in the Pajek file will be also read and assigned to the graph object to be created. These are mainly parameters for graph visualization, but not exclusively, eg. the file might contain edge weights as well.
The following vertex attributes might be added:
igraph name | description, Pajek attribute |
id | Vertex id |
x, y, z | The ‘x’, ‘y’ and ‘z’ coordinate of the vertex |
vertexsize | The size of the vertex when plotted (size in Pajek). |
shape | The shape of the vertex when plotted. |
color | Vertex color (ic in Pajek) if given with symbolic name |
color-red, | |
color-green, | |
color-blue | Vertex color (ic in Pajek) if given in RGB notation |
framecolor | Border color (bc in Pajek) if given with symbolic name |
framecolor-red, | |
framecolor-green, | |
framecolor-blue | Border color (bc in Pajek) if given in RGB notation |
labelcolor | Label color (lc in Pajek) if given with symbolic name |
labelcolor-red, | |
labelcolor-green, | |
labelcolor-blue | Label color (lc in Pajek) if given in RGB notation |
xfact, yfact | The x_fact and y_fact Pajek attributes. |
labeldist | The distance of the label from the vertex. (lr in Pajek.) |
labeldegree, | |
labeldegree2 | The la and lphi Pajek attributes |
framewidth | The width of the border (bw in Pajek). |
fontsize | Size of the label font (fos in Pajek.) |
rotation | The rotation of the vertex (phi in Pajek). |
radius | Radius, for some vertex shapes (r in Pajek). |
diamondratio | For the diamond shape (q in Pajek). |
These igraph attributes are only created if there is at least one vertex
in the Pajek file which has the corresponding associated
information. Eg. if there are vertex coordinates for at least one vertex
then the ‘x’, ‘y’ and possibly also ‘z’
vertex attributes will be created. For those vertices for which the
attribute is not defined, NaN
is assigned.
The following edge attributes might be added:
igraph name | description, Pajek attribute |
weight | Edge weights. |
label | l in Pajek. |
color | Edge color, if the color is given with a symbolic name, c in Pajek. |
color-red, | |
color-green, | |
color-blue | Edge color if it was given in RGB notation, c in Pajek. |
edgewidth | w in Pajek. |
arrowsize | s in Pajek. |
hook1, hook2 | h1 and h2 in Pajek. |
angle1, angle2 | a1 and a2 in Pajek, Bezier curve parameters. |
velocity1, | |
velocity2 | k1 and k2 in Pajek, Bezier curve parameter. |
arrowpos | ap in Pajek. |
labelpos | lp in Pajek. |
labelangle, | |
labelangle2 | lr and lphi in Pajek. |
labeldegree | la in Pajek. |
fontsize | fos in Pajek. |
arrowtype | a in Pajek. |
linepattern | p in Pajek. |
labelcolor | lc in Pajek. |
There are no additional arguments for this format.
See the GraphML homepage at http://graphml.graphdrawing.org for more information about the GraphML format.
Additional arguments:
Although all syntactically correct GML can be parsed, we implement only a subset of this format, some attributes might be ignored. Here is a list of all the differences:
node
and edge
attributes are
used, and only if they have a simple type: integer, real or
string. So if an attribute is an array or a record, then it is
ignored. This is also true if only some values of the
attribute are complex.Version
and the
first graph
attribute are completely ignored.node
and
edge
are completely ignored.inf
(infinity) and nan
(not a number) as a real number. This is case insensitive, so
nan
, NaN
and NAN
are equal.Please contact us if you cannot live with these limitations of the GML parser.
There are not additional argument for this format.
The resulting graph is always undirected. LGL cannot deal with files which contain multiple or loop edges, this is however not checked here, as a igraph is happy with these.
Additional arguments:
lgl
format is used by the Large Graph Layout
visualization software
(http://bioinformatics.icmb.utexas.edu/lgl), it can describe
undirected optionally weighted graphs. From the LGL manual:
“The second format is the LGL file format (.lgl file
suffix). This is yet another graph file format that tries to be
as stingy as possible with space, yet keeping the edge file in a
human readable (not binary) format. The format itself is like
the following:
# vertex1name vertex2name [optionalWeight] vertex3name [optionalWeight]Here, the first vertex of an edge is preceded with a pound sign '#'. Then each vertex that shares an edge with that vertex is listed one per line on subsequent lines.”
LGL cannot handle loop and multiple edges or directed graphs, but in a igraph it is not an error to have multiple and loop edges.
Additional arguments:
This is a line-oriented text file (ASCII) format. The first
character of each line defines the type of the line. If the first
character is c
the line is a comment line and it is
ignored. There is one problem line (p
) in the file, it
must appear before any node and arc descriptor lines. The problem
line has three fields separated by spaces: the problem type
(min
, max
or asn
), the
number of vertices and number of edges in the graph.
Exactly two node identification lines are expected
(n
), one for the source, one for the target vertex.
These have two fields: the id of the vertex and the type of the
vertex, either s
(=source) or t
(=target). Arc lines start with a
and have three
fields: the source vertex, the target vertex and the edge capacity.
Vertex ids are numbered from 1.
The source vertex is assigned to the source
, the target
vertex to the target
graph attribute. The edge capacities
are assigned to the capacity
edge attribute.
Additional arguments:
The graphs are stored in a compact binary format, one graph per file. The file is composed of 16 bit words, which are represented using the so-called little-endian convention, i.e. the least significant byte of the word is stored first.
Then, for each node, the file contains the list of edges coming out of the node itself. The list is represented by a word encoding its length, followed by a word for each edge, representing the destination node of the edge. Node numeration is 0-based, so the first node of the graph has index 0.
See also graph.graphdb
.
Only unlabelled graphs are implemented.
Additional attributes:
A graph object.
Gabor Csardi csardi@rmki.kfki.hu