External Supervisor Research Articles

Reinforcement learning (RL) has garnered significant attention for developing decision-making agents that aim to maximize rewards, specified by an external supervisor, within fully observable environments. However, many real-world problems involve partial or noisy observations, where agents cannot access complete and accurate information about the environment. These problems are commonly formulated as partially observable Markov decision processes (POMDPs). Previous studies have tackled RL in POMDPs by either incorporating the memory of past actions and observations or by inferring the true state of the environment from observed data. Nevertheless, aggregating observations and actions over time becomes impractical in problems with large decision-making time horizons and high-dimensional spaces. Furthermore, inference-based RL approaches often require many environmental samples to perform well, as they focus solely on reward maximization and neglect uncertainty in the inferred state. Active inference (AIF) is a framework naturally formulated in POMDPs and directs agents to select actions by minimizing a function called expected free energy (EFE). This supplies reward-maximizing (or exploitative) behavior, as in RL, with information-seeking (or exploratory) behavior. Despite this exploratory behavior of AIF, its use is limited to problems with small time horizons and discrete spaces due to the computational challenges associated with EFE. In this article, we propose a unified principle that establishes a theoretical connection between AIF and RL, enabling seamless integration of these two approaches and overcoming their limitations in continuous space POMDP settings. We substantiate our findings with rigorous theoretical analysis, providing novel perspectives for using AIF in designing and implementing artificial agents. Experimental results demonstrate the superior learning capabilities of our method compared to other alternative RL approaches in solving partially observable tasks with continuous spaces. Notably, our approach harnesses information-seeking exploration, enabling it to effectively solve reward-free problems and rendering explicit task reward design by an external supervisor optional.

BackgroundNeonatal mortality comprises an increasing proportion of childhood deaths in the developing world. Essential newborn care practices as recommended by the WHO may improve neonatal outcomes in resource limited settings. Our objective was to pilot a Helping Babies Breathe and Essential Care for Every Baby (HBB and ECEB) implementation package using HBB-ECEB training combined with supportive supervision in rural Nicaragua.MethodsWe employed an HBB-ECEB implementation package in El Ayote and Santo Domingo, two rural municipalities in Nicaragua and used a pre- and post- data collection design for comparison. Following a period of pre-intervention data collection (June–August 2015), care providers were trained in HBB and ECEB using a train-the- trainer model. An external supportive supervisor monitored processes of care and collected data. Data on newborn care processes and short-term outcomes such as hypothermia were collected from facility medical records and analyzed using standard run charts. Home visits were used to determine breastfeeding rates at 7, 30 and 60 days.ResultsThere were 480 institutional births during the study period (June 2015–June 2016). Following the HBB-ECEB implementation package, cord care improved (pre-intervention median 66%; post-intervention shift to ≥85%) and early skin-to-skin care improved (pre-intervention median 0%; post-intervention shift to ≥56%, with a high of 92% in June 2016). Rates of administration of ophthalmic ointment and vitamin K were high pre-intervention (median 97%) and remained high. Early initiation of breastfeeding increased with a pre-intervention median of 25% and post-intervention shift to ≥28%, with a peak of 81% in June 2016. Exclusive breastfeeding rates increased short-term but were not significantly different by 60-days of life (9% pre-intervention versus 21% post-intervention).ConclusionsThe implementation of the HBB-ECEB programs combined with supportive supervision improved the quality of care for newborns in terms of cord care, early skin-to-skin care and early initiation of breastfeeding. The rates of administration of ophthalmic ointment and vitamin K were high pre- intervention and remained high afterwards. These findings show that HBB-ECEB programs implemented with supportive supervision can improve quality of care for newborns.

External Supervisor Research Articles

Articles published on External Supervisor

Active Inference and Reinforcement Learning: A Unified Inference on Continuous State and Action Spaces Under Partial Observability.

REFORMULATION OF THE DUTIES AND AUTHORITIES OF THE PROSECUTOR’S COMMISSION IN INDONESIAN LEGISLATION

A generalized homogeneity-based formation control for perturbed unicycle multi-agent systems

Symphony of Community Social Work in Outlying Islands: Reflections between an External Supervisor and Social Workers

Supporting Relational, Trauma-Informed Social Care Work with Autistic Adults: Evaluation of a Reflective Supervision Group Pilot

Does environmental information disclosure affect corporate cash flow? An analysis by taking media attentions into consideration

Human Resources Management System At Police Resort Bengkulu Utara

Supervised learning of soliton X-junctions in lithium niobate films on insulator.

KEWENANGAN PENGAWASAN DAN ADVOKASI KOMISI YUDISIAL TERHADAP HAKIM BERDASARKAN UNDANG-UNDANG NO. 18 TAHUN 2011 PERUBAHAN ATAS UNDANG-UNDANG NO. 22 TAHUN 2004 TENTANG KOMISI YUDISIAL

Peer assessment as a method for facilitating cross-sector learning: A national pilot

How to Restrain Regulatory Capture and Promote Green Innovation in China. An Analysis Based on Evolutionary Game Theory

Research on Fiscal Expenditures on Science and Technology Efficiency, Fiscal Transparency and Media Coverage: Evidence from China

The internal/external debate: The tensions within social work supervision

A Study on the Relationship between Analysts’ Cash Flow Forecasts Issuance and Accounting Information: Evidence from Korea

What works in video-based youth statutory caseworker supervision – caseworker and supervisor perspectives

Essential Care for Every Baby: improving compliance with newborn care practices in rural Nicaragua

Development and validation of the ExPRESS instrument for primary health care providers’ evaluation of external supervision

‘Loitering with intent’ – a model of practice for working in a New Zealand secondary school

Challenges of Instructional Supervision of Senior High Schools in the Techiman Municipality in the Brong Ahafo Region of Ghana

The Potential Impacts of Becoming a Parent on Practice

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

External Supervisor Research Articles

Articles published on External Supervisor

Active Inference and Reinforcement Learning: A Unified Inference on Continuous State and Action Spaces Under Partial Observability.

REFORMULATION OF THE DUTIES AND AUTHORITIES OF THE PROSECUTOR’S COMMISSION IN INDONESIAN LEGISLATION

A generalized homogeneity-based formation control for perturbed unicycle multi-agent systems

Symphony of Community Social Work in Outlying Islands: Reflections between an External Supervisor and Social Workers

Supporting Relational, Trauma-Informed Social Care Work with Autistic Adults: Evaluation of a Reflective Supervision Group Pilot

Does environmental information disclosure affect corporate cash flow? An analysis by taking media attentions into consideration

Human Resources Management System At Police Resort Bengkulu Utara

Supervised learning of soliton X-junctions in lithium niobate films on insulator.

KEWENANGAN PENGAWASAN DAN ADVOKASI KOMISI YUDISIAL TERHADAP HAKIM BERDASARKAN UNDANG-UNDANG NO. 18 TAHUN 2011 PERUBAHAN ATAS UNDANG-UNDANG NO. 22 TAHUN 2004 TENTANG KOMISI YUDISIAL

Peer assessment as a method for facilitating cross-sector learning: A national pilot

How to Restrain Regulatory Capture and Promote Green Innovation in China. An Analysis Based on Evolutionary Game Theory

Research on Fiscal Expenditures on Science and Technology Efficiency, Fiscal Transparency and Media Coverage: Evidence from China

The internal/external debate: The tensions within social work supervision

A Study on the Relationship between Analysts’ Cash Flow Forecasts Issuance and Accounting Information: Evidence from Korea

What works in video-based youth statutory caseworker supervision – caseworker and supervisor perspectives

Essential Care for Every Baby: improving compliance with newborn care practices in rural Nicaragua

Development and validation of the ExPRESS instrument for primary health care providers’ evaluation of external supervision

‘Loitering with intent’ – a model of practice for working in a New Zealand secondary school

Challenges of Instructional Supervision of Senior High Schools in the Techiman Municipality in the Brong Ahafo Region of Ghana

The Potential Impacts of Becoming a Parent on Practice