The planner’s role in crowdsourcing involves determining the time to stop collecting information (i.e., timed decision, TD). Previous studies have modeled the uncertain crowdsourcing environment as Partially Observable Markov Decision Processes (POMDPs) and utilized the value of information (VOI) to balance the utilities and costs of information collection. However, there is still room for optimization in solving the TD problem within single-agent POMDPs. In this paper, we propose a partitioning Monte Carlo approach for consensus tasks based on the option-candidate (OC) model partitioning. We simplify the state representation for the OC model and introduce a multi-agent POMDP representation. We establish a correspondence between the single-agent and multi-agent models using two consistency theorems. We propose three progressively improved partitioning Monte Carlo (PMC) algorithms to solve the TD problem within the multi-agent POMDP. We conducted experiments in synthetic domains and a citizen science project, demonstrating that the proposed algorithms exhibit continuous improvements in runtime advantages and consistently outperform the SOTA single-agent Monte Carlo sampling-based algorithm while providing nearly consistent output results.
Read full abstract