This program is free software: you can redistribute it and/or modify
it under the terms of the GNU Affero General Public License as
published by the Free Software Foundation, either version 3 of the
License, or (at your option) any later version.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU Affero General Public License for more details.
You should have received a copy of the GNU Affero General Public License
along with this program. If not, see .
*/
include 'tcng.php';
include 'header.php';
put_search_box('');
?>
Help
Introduction
The Cancer Network Galaxy (TCNG) is a database of cancer gene networks
estimated from the publicly available cancer gene expression data.
You can search for or browse networks via a WWW browser.
Search
Network and gene search
You can search for networks and genes by key words in the search box.
Currently, array set names and array set descriptions associated to networks,
and gene names and gene descriptions of genes are target field of the search.
If you enter more than one words, it is regarded as "AND" search; that is,
it searches for network and genes including all of the key words you entered.
To perform "OR" search, use keyword "OR" between words.
Edge search
Use keyword "target:edge" to perform edge search.
However, unfortunately it is very very slow.
For edge search, you can also use "pa:x" and "ch:x"
keywords where x is any keywords in order to search for edges by specifying
keywords for parents and children, particularly.
XML and JSON interface
Some pages support XML and JSON output. Add "&fmt=xml" or
"&fmt=json" to the HTTP URI.
Please do not submit too much requests from your scripts or software.
Please limit to 1 request per 3 seconds. That is, wait for at least 3 seconds between
your accesses.
Terminologies
A
- array set
- An array set is defined as a set of DNA microarrays.
The network estimation methods require many microarrays as input data for estimating
gene-to-gene regulatory relationships.
B
- Bayesian network
- A model of gene network. It can model causal relationships between random variables.
In gene network estimation, the gene expression of a gene is regarded as a random variable
in a network.
The networks in TCNG are estimated with the Bayesian network model.
- betweenness
- An index of centrality of nodes. It represents that how close
a particular node is located at the center of a network.
C
- child
- If two nodes are connected with an directed edge (arrow), the sink node
is called as child, child node or child gene.
- CSML
- Cell System Markup language. An XML format used in Cell Illustrator Online.
You can download networks in CSML from each network page.
D
- dataset
- A dataset is defined as a pair of a gene set and an array set.
The network estimation is performed for a dataset to estimate a network.
- degree
- A degree of a node is the number of edges connected to it.
It is the sum of the number of parents and children of it.
E
- edge
- An edge in a network is a directed arrow connecting two nodes.
It represents that two nodes (genes) are estimated as being related
in terms of gene expression, or more simply, if an edge is connected
from node A to B, then A regulates B.
G
- gene set
- A gene set is defined as a set of genes (nodes) used for the single gene network estimation.
H
- hub
- A hub, a hub gene, or a hub node is a node which has many child nodes in a network.
Hub genes are very important because they are considered as master regulators in a network.
N
- network
- A network or a gene network is a set of nodes and directed edges.
A network is estimated from a dataset by a network estimation
method. The network name is made from a combination of
a gene set name, an array set name and the estimation method.
- node
- A node is a point connected with edges in a network.
It is generally defined as a single probe in DNA arrays.
Thus, if it matches for more than one genes, it consists of
these matched genes. If so, in TCNG, the node name is made
from the matched gene symbols concatenated with triple slashes
(///).
Two nodes are connected with an directed edge.
P
- parent
- If two nodes are connected with an directed edge (arrow), the source node
is called as parent, parent node, or parent gene.
S
- SiGN
- SiGN is a collection of gene network estimation software develped at
Laboratory of DNA Information Analysis & Laboratory of Sequence Analysis,
Human Genome Center, Institute of Medical Science, The University of Tokyo; and
Department of Computer Science, Graduate School of Information Science and Technology,
The University of Tokyo. Visit SiGN web site
for more details.
- SiGN-BN
- SiGN-BN is gene network estimation software using Bayesian networks included in
SiGN. Visit SiGN-BN web site for more details.