Human-centric Tasks Research Articles

Large language models (LLMs) have demonstrated remarkable generalization and across diverse tasks, leading individuals to increasingly use them as personal assistants due to their emerging reasoning capabilities. Nevertheless, a notable obstacle emerges when including numerical/temporal data into these prompts, such as data sourced from wearables or electronic health records. LLMs employ tokenizers in their input that break down text into smaller units. However, tokenizers are not designed to represent numerical values and might struggle to understand repetitive patterns and context, treating consecutive values as separate tokens and disregarding their temporal relationships. This article discusses the challenges of representing and tokenizing temporal data. It argues that naively passing timeseries to LLMs can be ineffective due to the modality gap between numbers and text. We conduct a case study by tokenizing a sample mobile sensing dataset using the OpenAI tokenizer. We also review recent works that feed timeseries data into LLMs for human-centric tasks, outlining common experimental setups like zero-shot prompting and few-shot learning. The case study shows that popular LLMs split timestamps and sensor values into multiple nonmeaningful tokens, indicating they struggle with temporal data. We find that preliminary works rely heavily on prompt engineering and timeseries aggregation to "ground" LLMs, hinting that the "modality gap" hampers progress. The literature was critically analyzed through the lens of models optimizing for expressiveness versus parameter efficiency. On one end of the spectrum, training large domain-specific models from scratch is expressive but not parameter-efficient. On the other end, zero-shot prompting of LLMs is parameter-efficient but lacks expressiveness for temporal data. We argue tokenizers are not optimized for numerical data, while the scarcity of timeseries examples in training corpora exacerbates difficulties. We advocate balancing model expressiveness and computational efficiency when integrating temporal data. Prompt tuning, model grafting, and improved tokenizers are highlighted as promising directions. We underscore that despite promising capabilities, LLMs cannot meaningfully process temporal data unless the input representation is addressed. We argue that this paradigm shift in how we leverage pretrained models will particularly affect the area of biomedical signals, given the lack of modality-specific foundation models.

This article presents a disassembly task planning algorithm considering human–robot collaboration (HRC) and human behavior prediction (HBP). Unlike assembly procedures, the disassembly of end-of-life (EOL) products has been a labor-intensive process with uncertainties difficult to cope with. Meanwhile, it is usually challenging to obtain an optimal sequence efficiently without excessive computational cost. Also, the conventional human-centered task planning, in which the robot has to halt frequently due to unsafe interruptions by human motions, may decrease the efficiency of the disassembly process. In this article, a sequence planner is proposed to assign tasks in real time between a human operator and a robot to overcome the aforementioned challenges. The cost function includes the effort of the human and the robot in terms of both movement distance and time spent on the tasks. The constraints include the disassembly rules and the safety of the human operation. The optimal sequence is generated by solving an optimization problem in a receding-horizon way. In particular, at each step, the proposed disassembly sequence planner locates the workers (a human operator and a robot) and the to-be-disassembled components, predicts human movement for the next several steps, and obtains the optimal disassembly sequence for the next several steps following disassembly rules and safety constraints. Experiments have been extensively conducted on the disassembly of a wooden toybox and a used hard disk drive (HDD) to validate the proposed disassembly sequence planner. The planner has successfully generated the disassembly sequence in an HRC setting explicitly considering real-time human motion prediction and assigned the human operator and the robot to collaboratively complete disassembly tasks without violating disassembly rules and safety constraints.

Human-centric Tasks Research Articles

Related Topics

Articles published on Human-centric Tasks

Data augmentation in human-centric vision

Algorithmic management and human-centered task design: a conceptual synthesis from the perspective of action regulation and sociomaterial systems theory.

The first step is the hardest: pitfalls of representing and tokenizing temporal data for large language models.

Collaborative human-centered design of manufacturing tasks: a multi-user immersive VR experience

Human-Centered Task Allocation: A Simulation-Based Case Study

Multi-Objective Multi-Resource Task Allocation For Collaborative Robots Systems

Robot-Assisted Disassembly Sequence Planning With Real-Time Human Motion Prediction

From Handcrafted to Deep Features for Pedestrian Detection: A Survey.

Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis.

Generating Sketch-Based Synthetic Seismic Images With Generative Adversarial Networks

A neural signature of pattern separation in the monkey hippocampus

Neck kinematics and muscle activity during mobile device operations

From Abstract Task Knowledge to Executable Robot Programs

Model-based human-centered task automation: A case study in ACC system design

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Human-centric Tasks Research Articles

Related Topics

Articles published on Human-centric Tasks

Data augmentation in human-centric vision

Algorithmic management and human-centered task design: a conceptual synthesis from the perspective of action regulation and sociomaterial systems theory.

The first step is the hardest: pitfalls of representing and tokenizing temporal data for large language models.

Collaborative human-centered design of manufacturing tasks: a multi-user immersive VR experience

Human-Centered Task Allocation: A Simulation-Based Case Study

Multi-Objective Multi-Resource Task Allocation For Collaborative Robots Systems

Robot-Assisted Disassembly Sequence Planning With Real-Time Human Motion Prediction

From Handcrafted to Deep Features for Pedestrian Detection: A Survey.

Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis.

Generating Sketch-Based Synthetic Seismic Images With Generative Adversarial Networks

A neural signature of pattern separation in the monkey hippocampus

Neck kinematics and muscle activity during mobile device operations

From Abstract Task Knowledge to Executable Robot Programs

Model-based human-centered task automation: A case study in ACC system design