Abstract

The Web has many forums for sharing personal data, but not for scientific data and not in a way that allows the data to be accessed by machines as users. A Web of data could add tremendous value by integrating disparate disciplines or conduct data-driven queries. Doing this is very complex and requires more robust standards than currently exist. The intended user for most data is not a person; it is a software application that can manipulate the data into something useful for humans. Such software could be search engines, analytic software, visualization tools, database back ends, and more. This need creates a much different requirement for standards than those that were developed for displaying web data to people. Data software needs a much greater understanding of context and that context has to be supplied alongside the data either through direct integration with the data or linking to a description of it in a persistent and accessible location. Data interoperability must be addressed at the beginning of developing systems because it is significantly harder and costlier to make these connections after both systems have separately implemented non-standardized data collections. Data interoperability must address three levels: legal (intellectual property rights), technical (computer languages and formats), and semantic (meaning of the data). The technical level is the furthest along, with the Semantic Web technologies. Getting scientists to agree on the semantic level could be nearly impossible. The legal level has the greatest opportunity by putting the data into the public domain. There are already precedents for this with genome data and the International Visual Observatory. Putting data into the public domain simplifies the implementation of the technical level. Libraries and publishers in the scholarly publishing community should lead the web of data initiative as they can ensure the connection, curation, and preservation needed. The NSF mandated data sharing could result in funding opportunities to build the web of data. But all involved must be in agreement not to replicate the copyright-controlled model that currently exists with books and journals.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.