Over the last decade the modeling and the storage of biological data has been a topic of wide interest for scientistis dealing with biological research.Currently most data is still stored in text files which leads to data redundancies and file caos. In this paper we show how to sue relational modeling techniques and relational database technology for modeling and storing biological sequence data,i.e.for data maintained in collections like EMBL or SWISS-PROT to better serve the needs for these application domains. For this reason we propose a two step approach.First,we model the structure (and therefore the meaning of the ) data using an Entity-Relationship approach.The ER modelleads to a clean design of a relational database schema for storing and retrieving the DNA and protein data extracted from various sources.Our approach provides the clean basis for building complex biological aplications that are more amenable to changes and software ports than their file-base counterparts.
展开▼