Development of Large Language Models: Copyright Law Perspectives for Research Institutions and Research Libraries

Inger Berg Ørstavik

doi:10.1017/jli.2024.46

Inger Berg Ørstavik

https://doi.org/10.1017/jli.2024.46

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Abstract This article discusses European copyright law as applied to the development and training of generative AI and natural language processing in public interest research institutions and libraries. The article focuses on the scope of the new exceptions from copyright law for text and data mining (TDM) for research purposes and discusses them from the perspective of research ethics and principles of open science in publicly financed research. The public interest mission of research institutions and libraries includes the open dissemination of research results but the exceptions from copyright are focused only on the training phase in AI development. Regulation on data transparency is fragmented. The article finds that while new exceptions open for developing language models under research institutions and libraries’ public interest mission to preserve national languages, the regulation is not adapted to principles of research ethics and open science, and legal uncertainty remains.

Full Text