Archival Files Research Articles

We present a streamlined technical solution ("Publish First") designed to assist smaller, resource-constrained herbaria in rapidly publishing their specimens to the Global Biodiversity Information Facility (GBIF). Specimen data from smaller herbaria, particularly those in biodiversity-rich regions of the world, provide a valuable and often unique contribution to the global pool of biodiversity knowledge (Marsico et al. 2020). However, these institutions often face challenges not applicable to larger herbaria, including a lack of staff with technical skills, limited staff hours for digitization work, inadequate financial resources for specialized scanning equipment, cameras, lights, and imaging stands, limited (or no) access to computers and collection management software, and unreliable internet connections. Data-scarce and biodiversity rich countries are also often linguistically diverse (Gorenflo et al. 2012), and staff may not have English skills, which means pre-existing online data publication resources and guides are of limited use. The "Publish First" method we are trialing, addresses several of these issues: it drastically simplifies the publication process so technical skills are not necessary; it minimizes administrative tasks saving time; it uses simple, cheap and easily available hardware; it does not require any specialized software; and the process is so simple that there is little to no need for any written instructions. "Publish first" requires staff to attach QR code labels containing identifiers to herbarium specimen sheets, scan these sheets using a document scanner costing around €300, then drag and drop these files to an S3 bucket (a cloud container that specialises in storing files). Subsequently, these images are automatically processed through an Optical Character Recognition (OCR) service to extract text, which is then passed on to OpenAI's Generative Pre-Transformer 4 (GPT-4) Application Programming Interface (API), for standardization. The standardized data is integrated into a Darwin Core Archive file that is automatically published through GBIF's Integrated Publishing Toolkit (IPT) (GBIF 2021). The most technically challenging aspect of this project has been the standardization of OCR data to Darwin Core using the GPT-4 API, particularly in crafting precise prompts to address the inherent inconsistency and lack of reliability in these Large Language Models (LLMs). Despite this, GPT-4 outperformed our manual scraping efforts. Our choice of GPT-4 as a model was a naive one: we implemented the workflow on some pre-digitized specimens from previously published Norwegian collections, compared the published data on GBIF with GPT-4's Darwin Core standardized output, and found the results satisfactory. Moving forward, we plan to undertake more rigorous additional research to compare the effectiveness and cost-efficiency of different LLMs as Darwin Core standardization engines. We are also particularly interested in exploring the new "function calling" feature added to the GPT-4 API, as it promises to allow us to retrieve standardized data in a more consistent and structured format. This workflow is currently under trial in Tajikistan, and may possibly be used in Uzbekistan, Armenia and Italy in the near future.

Read full abstract

Drawing borders in post-conflict situations is a challenging undertaking between two or more actors that often ends up in arbitration. In some cases, it produces a political confrontation that may turn into a cycle of violence. This article sheds light on the dynamics of political and security challenges, the interaction of the foreign actors and the role of the local government and civic activism in resolving disputes related to the Kosovo-Macedonia border. This article focuses on the obstacles that came from the non-definition of the status of Kosovo and the popular and institutional dissatisfaction regarding the agreement on the border between the Federal Republic of Yugoslavia (FRY) and Former Yugoslav Republic of Macedonia (FYROM), bypassing Kosovo and UNMIK from decisionmaking. Secondly, it asks whether these two sovereign countries have had the right to decide on the part of the border that separates Kosovo and Macedonia and was it an appropriate moment to reach an agreement on the border in tense situation between Kosovo, Serbia, and Macedonia? If so, why was Kosovo not included in the final stage of implementation of the agreement? Third, in unclear situation with Kosovo political status, which of the parties to the agreement would be able undertake practical ground activity, that of placing the border stones and which kind of writings will take place on them: „Serbia” and „Macedonia”, or „Kosovo” and „Macedonia”? Could the implementation of the agreement be postponed, at least for the part that divided Kosovo and Macedonia, and completed instead after the final status of Kosovo was determined? We argue that political momentum between Kosovo-Macedonia-Serbia triangle did not favor achieve such sensitive agreement between newly created states of Federal Republic of Yugoslavia and FYROM. Excluding Kosovo provisional institutions and UN civil administration from the border agreement was a mistake that produced instability, hostility and additional bitterness in interethnic relations at the early stages, followed by the status quo. And, finally, including Kosovo as a partner in implementing the border issue paved the way for interstate cooperation that led to Macedonia’s recognition of Kosovo, which erupt a short wave of anti-Macedonian rhetoric by both, Serbian political leadership and people protests. The evidence used for the arguments presented were positivists qualitative methods such as social survey and official statistics. The principle of uti possidetis was applied on the border disputes in the period after the breakup of Yugoslavia, and also in the case of the demarcation of the border between Kosovo and the states of Macedonia, Montenegro and Albania, as the best solution because it lies in „its primary aim of securing respect for the territorial boundaries at the moment when independence is achieved”. In drawing conclusions related to the article topic, I used a combined methodology of literature research, comparative analyses and positivist qualitative methods such as social surveys through structured questionnaires, official statistics, interviewing the bearers of the institutions of the time and members of the technical commission for border demarcation. Archive of Kosovo Parliament and personal files also became important sources.

Read full abstract

Archival Files Research Articles

Related Topics

Articles published on Archival Files

Private Storage Cloud for Facilitate the Functions of Organizations

PRAME Expression Is a Useful Tool in the Diagnosis of Primary and Metastatic Dedifferentiated and Undifferentiated Melanoma.

Калмыцко-казахские отношения в период откочевки калмыков из России в Китай в 1771 г.

SISTEM INFORMASI CUTI KARYAWAN BERBASIS WEB PADA PT GIKEN PRECISION INDONESIA

ANALISA POLA PEMBELIAN KONSUMEN MENGGUNAKAN DATA MINING DENGAN ALGORITMA APRIORI (STUDI KASUS: EDUKITS BATAM CENTRE)

"Publish First": A Rapid, GPT-4 Based Digitisation System for Small Institutes with Minimal Resources

Minimally invasive approach with small diameter pleural drainage catheter (Easydren®) in malignant pleural effusions

Papiller Tiroid Karsinomada Tümör Büyüklüğü ve Metastazın Belirticisi Olarak Nötrofil Lenfosit Oranı

Sino-French Normalization and Its Impact on the United States and Taiwan

Pelatihan Manajemen Arsip Digital berbasis Google Drive Desktop bagi Pengurus Pondok Pesantren se-Kecamatan Sangkapura

Information Support of Oraganization of Communication Between Citizens and Archival Institutions

Političke i bezbjednosne dimenzije u rješavanju Kosovsko-Sjevernomakedonske demarkacije granice

“Doctors Aren’t Familiar with Your Tissues”: Self-Examination and Feminist Health Activism in 1970s Canada

DNA storage in thermoresponsive microcapsules for repeated random multiplexed data access.

SISTEM INFORMASI PENGARSIPAN SURAT MASUK DAN SURAT KELUAR PADA SD NEGERI 28 KOTA JAMBI

Analisis Sistem Administrasi Dan Keuangan Pada PERUM BULOG Kantor Cabang Batam

Capsular remnant in the rotator cuff footprint is a novel arthroscopic finding may indicate the etiology of the tear.

Prevalence of High-Risk Human Papillomavirus Types and Their Association with Cervical Squamous Cell Carcinoma, and High- and Low-Grade Squamous Intraepithelial Lesions in Turkish Women.

A “careful study” on public opinion. An exemplary investigation of media monitoring through press clippings collections in the League of Nations’ Information and Mandates Sections

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Archival Files Research Articles

Related Topics

Articles published on Archival Files

Private Storage Cloud for Facilitate the Functions of Organizations

PRAME Expression Is a Useful Tool in the Diagnosis of Primary and Metastatic Dedifferentiated and Undifferentiated Melanoma.

Калмыцко-казахские отношения в период откочевки калмыков из России в Китай в 1771 г.

SISTEM INFORMASI CUTI KARYAWAN BERBASIS WEB PADA PT GIKEN PRECISION INDONESIA

ANALISA POLA PEMBELIAN KONSUMEN MENGGUNAKAN DATA MINING DENGAN ALGORITMA APRIORI (STUDI KASUS: EDUKITS BATAM CENTRE)

"Publish First": A Rapid, GPT-4 Based Digitisation System for Small Institutes with Minimal Resources

Minimally invasive approach with small diameter pleural drainage catheter (Easydren®) in malignant pleural effusions

Papiller Tiroid Karsinomada Tümör Büyüklüğü ve Metastazın Belirticisi Olarak Nötrofil Lenfosit Oranı

Sino-French Normalization and Its Impact on the United States and Taiwan

Pelatihan Manajemen Arsip Digital berbasis Google Drive Desktop bagi Pengurus Pondok Pesantren se-Kecamatan Sangkapura

Information Support of Oraganization of Communication Between Citizens and Archival Institutions

Političke i bezbjednosne dimenzije u rješavanju Kosovsko-Sjevernomakedonske demarkacije granice

“Doctors Aren’t Familiar with Your Tissues”: Self-Examination and Feminist Health Activism in 1970s Canada

DNA storage in thermoresponsive microcapsules for repeated random multiplexed data access.

SISTEM INFORMASI PENGARSIPAN SURAT MASUK DAN SURAT KELUAR PADA SD NEGERI 28 KOTA JAMBI

Analisis Sistem Administrasi Dan Keuangan Pada PERUM BULOG Kantor Cabang Batam

Capsular remnant in the rotator cuff footprint is a novel arthroscopic finding may indicate the etiology of the tear.

Prevalence of High-Risk Human Papillomavirus Types and Their Association with Cervical Squamous Cell Carcinoma, and High- and Low-Grade Squamous Intraepithelial Lesions in Turkish Women.

A “careful study” on public opinion. An exemplary investigation of media monitoring through press clippings collections in the League of Nations’ Information and Mandates Sections