DAM-SIG | 08:45 | June 27
Realtime coverage and discussions
Schedule
(Saturday June 27, 2009 | 08:45 - 17:30)
- 08:45 - 09:00 James Taylor & Anton Nekrutenko | galaxyproject.org | Introduction to the DAM-SIG
09:00 - 13:30 Session 1
- 09:00 - 09:35 Carole Goble | Manchester | Workflow development and reuse in Taverna
- 09:35 - 10:10 Gos Micklem | Cambridge | InterMine - Open source data warehouse
- 10:15 - 10:45 Caffeine Break #1
- 10:45 - 11:20 Christopher Lee | UCLA | Exploring the future of bioinformatics data sharing and mining with Pygr and Worldbase
- 11:20 - 11:55 Jill Mesirov | Broad | GenePattern to GenomeSpace - Integrative computational genomics and reproducible research
- 11:55 - 12:30 Eileen Kramer | U. of Georgia | Adapting the Galaxy Bioinformatics Tool to Support Semantic Web Service Composition
- 12:30 - 13:30 Carbohydrate, Lipid & Protein Break
13:30 - 15:30 Session 2
- 13:30 - 14:00 Carol Lushbough | U. of South Dakota | BioExtract Server
- 14:00 - 14:30 Philippe Rocca-Serra | EBI | Standards and infrastructure for managing experimental metadata
- 14:30 - 15:00 Christoph Best | EBI | Grid/cloud data storage and computing for 3D image databases
- 15:00 - 15:30 Kathy Wolstencroft | Manchester | SysMO-DB: Sharing and Exchanging Systems Biology Data and Models
- 15:30 - 16:00 Caffeine Break #2
16:00 - 17:30 Session 3 (joint with BOSC)
- 16:00 - 16:20 Morris Swertz | Groningen | MOLGENIS by example: generating an extensible platform for genotype and phenotype experiments
- 16:20 - 16:40 Robert Murphy | Carnegie Mellon U. | PSLID, the Protein Subcellular Location Image Database: Subcellular location assignments, annotated image collections, image analysis tools, and generative models of protein distributions
- 16:40 - 17:00 Mark Welsh | geospiza.com | BioHDF: Open binary file formats for large-scale data management
- 17:00 - 17:15 Brad Chapman | biopython.org | Lowering barriers to publishing biological data on the web
- 17:15 - 17:30 Kam Dahlquist | Loyola Marymount U. | XMLPipeDB: A Reusable, Open Source Tool Chain for Building Relational Databases from XML Sources
Background
Over the last few years we have seen another explosion in the data generation capacity of experimental biology. New DNA sequencing and high-density array technologies have enabled a variety of novel genome scale experimental techniques. What is perhaps most unique about this recent data explosion is that it is distributed – relatively inexpensive instruments allow any lab or institution to produce enormous amounts of data. Because of this change, the traditional model of centralized data warehouses for biological data is no longer sufficient. This has led to a new generation of systems for data management and analysis which are designed to put make it possible for experimentalists to effectively manage and analyze the wealth of data they can now produce, and integrate that data with existing date warehouses and literature, with little or no expert informatics support. These new approaches are critical if the promise of next generation experimental techniques is to be realized. This session will serve as an open forum for introduction of cutting edge applications that mold data sources and analysis tools into a singular medium benefitting both extremes on the user spectrum: experimentalists and computational researchers.