We have assisted clients designing solutions that utilize the caGrid infrastructure for the exchange of data with a solid semantic foundation. caGrid is middleware designed to facilitate secure and federated access to information and analytical resources in a multi-institutional environment. For example, a research project may require integrative analysis of microarray, imaging, and clinical data. These datasets may be collected by different entities (such as shared resources and medical information warehouses) and may not be stored in a centralized system. caGrid can be used to create a "virtually centralized" data warehouse of such datasets. Each dataset is managed by the respective owner, but is integrated as a virtually centralized data warehouse using caGrid service interfaces and tools, so that a researcher can access data from any of those datasets through a common interface.