The ANISEED database: digital representation, formalization, and elucidation of a chordate developmental program

authors

  • Tassy O.
  • Dauga D.
  • Daian F.
  • Sobral D.
  • Robin F.
  • Khoueiry P.
  • Salgado David
  • Fox V.
  • Caillol D.
  • Schiappa R.
  • Laporte B.
  • Rios A.
  • Luxardi G.
  • Kusakabe T.
  • J. S. Joly
  • Darras S.
  • Christiaen L.
  • Contensin M.
  • Auger H.
  • Lamy C.
  • Hudson C.
  • Rothbacher U.
  • M. J. Gilchrist
  • K. W. Makabe
  • Hotta K.
  • Fujiwara S.
  • Satoh N.
  • Satou Y.
  • Lemaire Patrick

keywords

  • Gene-expression profiles
  • Factor-binding sites
  • Embryonic-cell lineage
  • Regulatory networks
  • Ascidian embryo
  • Signaling pathway
  • Genome

document type

ART

abstract

Developmental biology aims to understand how the dynamics of embryonic shapes and organ functions are encoded in linear DNA molecules. Thanks to recent progress in genomics and imaging technologies, systemic approaches are now used in parallel with small-scale studies to establish links between genomic information and phenotypes, often described at the subcellular level. Current model organism databases, however, do not integrate heterogeneous data sets at different scales into a global view of the developmental program. Here, we present a novel, generic digital system, NISEED, and its implementation, ANISEED, to ascidians, which are invertebrate chordates suitable for developmental systems biology approaches. ANISEED hosts an unprecedented combination of anatomical and molecular data on ascidian development. This includes the first detailed anatomical ontologies for these embryos, and quantitative geometrical descriptions of developing cells obtained from reconstructed three-dimensional (3D) embryos up to the gastrula stages. Fully annotated gene model sets are linked to 30,000 high-resolution spatial gene expression patterns in wild-type and experimentally manipulated conditions and to 528 experimentally validated cis-regulatory regions imported from specialized databases or extracted from 160 literature articles. This highly structured data set can be explored via a Developmental Browser, a Genome Browser, and a 3D Virtual Embryo module. We show how integration of heterogeneous data in ANISEED can provide a system-level understanding of the developmental program through the automatic inference of gene regulatory interactions, the identification of inducing signals, and the discovery and explanation of novel asymmetric divisions.

more information