Key-value Memory Networks Research Articles

One of the fundamental tasks when providing personalized tutoring services to learners in online learning systems, such as intelligent tutoring systems and massive open online courses, is the learner knowledge diagnosis (LKD). LKD obtains the learner knowledge proficiency on skills by modeling their learning performance. Learners’ knowledge construction process is not static, but evolves overtime; hence, the evolution of learners’ knowledge proficiency must be dynamically traced. Moreover, considering the wide usage of online learning systems by large numbers of learners, the LKD task also needs to meet the requirements of large-scale assessment and interpretability to explain the diagnosed results. The existing models are either designed for static scenarios or find it difficult to explain the causality between learner performance and knowledge proficiency, as well as the item characteristics. To solve these issues, we propose herein a novel model, called the knowledge interaction-enhanced dynamic LKD (KIEDLKD), to develop learner performance, and hence, dynamically diagnose and trace the evolution of each learner’s knowledge proficiency during the exercising activities. We first propose a dynamic LKD framework by unifying the strength of the memory capacity of the key-value memory network to enhance the representation of the knowledge state during learner performance modeling and the interpretability of the Item Response Theory (IRT) to explain the learner performance in terms of knowledge proficiency and item characteristics (i.e., item difficulty and discrimination). In this framework, we diagnose and trace each learner’s knowledge proficiency on each knowledge concept (KC) over time and store them into an auxiliary memory using the key-value memory network. We further infer their general proficiencies and the IRT-based item characteristics using another neural network. Moreover, we propose the knowledge interaction concept among KCs and incorporate it into the LKD procedure to further exploit the long-term dependencies in the exercising sequences, thereby devising the KIEDLKD model. We also incorporate the learner-oriented cognitive item difficulty into our model, based on each learner’s exercising history, to adaptively model the item difficulty. Based on these factors, our KIEDLKD model can not only output the learners’ knowledge proficiency in a multi-granularity manner but also output the item characteristics, making it possible to interpret the learner performances in terms of their current knowledge states and item characteristics. Extensive experiments are conducted from six perspectives on five real-world datasets to test our model.The results of learner performance prediction demonstrate the superiority of our model on the LKD task. It can also automatically discover the underlying interaction between each pair of latent KCs, and the underlying concepts for each exercise. The ablation study verifies the contributions of each component in our model. Moreover, it can depict the evolution of learner knowledge proficiency in a multi-granularity manner and provide additional information for skill domain analysis, which enables the interpretability of our model.

While earlier research in human-robot interaction pre-dominantly uses rule-based architectures for natural language interaction, these approaches are not flexible enough for long-term interactions in the real world due to the large variation in user utterances. In contrast, data-driven approaches map the user input to the agent output directly, hence, provide more flexibility with these variations without requiring any set of rules. However, data-driven approaches are generally applied to single dialogue exchanges with a user and do not build up a memory over long-term conversation with different users, whereas long-term interactions require remembering users and their preferences incrementally and continuously and recalling previous interactions with users to adapt and personalise the interactions, known as the lifelong learning problem. In addition, it is desirable to learn user preferences from a few samples of interactions (i.e., few-shot learning). These are known to be challenging problems in machine learning, while they are trivial for rule-based approaches, creating a trade-off between flexibility and robustness. Correspondingly, in this work, we present the text-based Barista Datasets generated to evaluate the potential of data-driven approaches in generic and personalised long-term human-robot interactions with simulated real-world problems, such as recognition errors, incorrect recalls and changes to the user preferences. Based on these datasets, we explore the performance and the underlying inaccuracies of the state-of-the-art data-driven dialogue models that are strong baselines in other domains of personalisation in single interactions, namely Supervised Embeddings, Sequence-to-Sequence, End-to-End Memory Network, Key-Value Memory Network, and Generative Profile Memory Network. The experiments show that while data-driven approaches are suitable for generic task-oriented dialogue and real-time interactions, no model performs sufficiently well to be deployed in personalised long-term interactions in the real world, because of their inability to learn and use new identities, and their poor performance in recalling user-related data.

Key-value Memory Networks Research Articles

Related Topics

Articles published on Key-value Memory Networks

Enhanced Dynamic Key-Value Memory Networks for Personalized Student Modeling and Learning Ability Classification

Explore Bayesian analysis in Cognitive-aware Key–Value Memory Networks for knowledge tracing in online learning

Exploring the Teaching Mode of Secondary English Education Based on Big Data Technology

DKVMN-KAPS: Dynamic Key-Value Memory Networks Knowledge Tracing With Students’ Knowledge-Absorption Ability and Problem-Solving Ability

Confidence-based dynamic cross-modal memory network for image aesthetic assessment

Use of a Deep Learning Approach for the Evaluation of Students' Online Learning Cognitive Ability

Document-Level Event Role Filler Extraction Using Key-Value Memory Network

Kcr-FLAT: A Chinese-Named Entity Recognition Model with Enhanced Semantic Information.

Interpretable Knowledge Tracing: Simple and Efficient Student Modeling with Causal Relations

Knowledge interaction enhanced sequential modeling for interpretable learner knowledge diagnosis in intelligent tutoring systems

An Extensible Heterogeneous Network Embedding Framework for Knowledge Tracing

Coffee With a Hint of Data: Towards Using Data-Driven Approaches in Personalised Long-Term Interactions.

Learning to Respond with Your Favorite Stickers

Dynamic Key-Value Memory Networks With Rich Features for Knowledge Tracing.

A Dynamic Knowledge Diagnosis Approach Integrating Cognitive Features

Improving biomedical named entity recognition with syntactic information

KM[formula omitted]: Visual reasoning via Knowledge Embedding Memory Model with Mutual Modulation

Adversarially regularized medication recommendation model with multi-hop memory network

Knowledge-Enhanced Graph Neural Networks for Sequential Recommendation

Hierarchical Contextualized Representation for Named Entity Recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Key-value Memory Networks Research Articles

Related Topics

Articles published on Key-value Memory Networks

Enhanced Dynamic Key-Value Memory Networks for Personalized Student Modeling and Learning Ability Classification

Explore Bayesian analysis in Cognitive-aware Key–Value Memory Networks for knowledge tracing in online learning

Exploring the Teaching Mode of Secondary English Education Based on Big Data Technology

DKVMN-KAPS: Dynamic Key-Value Memory Networks Knowledge Tracing With Students’ Knowledge-Absorption Ability and Problem-Solving Ability

Confidence-based dynamic cross-modal memory network for image aesthetic assessment

Use of a Deep Learning Approach for the Evaluation of Students' Online Learning Cognitive Ability

Document-Level Event Role Filler Extraction Using Key-Value Memory Network

Kcr-FLAT: A Chinese-Named Entity Recognition Model with Enhanced Semantic Information.

Interpretable Knowledge Tracing: Simple and Efficient Student Modeling with Causal Relations

Knowledge interaction enhanced sequential modeling for interpretable learner knowledge diagnosis in intelligent tutoring systems

An Extensible Heterogeneous Network Embedding Framework for Knowledge Tracing

Coffee With a Hint of Data: Towards Using Data-Driven Approaches in Personalised Long-Term Interactions.

Learning to Respond with Your Favorite Stickers

Dynamic Key-Value Memory Networks With Rich Features for Knowledge Tracing.

A Dynamic Knowledge Diagnosis Approach Integrating Cognitive Features

Improving biomedical named entity recognition with syntactic information

KM[formula omitted]: Visual reasoning via Knowledge Embedding Memory Model with Mutual Modulation

Adversarially regularized medication recommendation model with multi-hop memory network

Knowledge-Enhanced Graph Neural Networks for Sequential Recommendation

Hierarchical Contextualized Representation for Named Entity Recognition