Abstract

AbstractAt Linguistic Technology Systems, we are using Sequence Package Analysis (SPA) to architect a new, pragmatically-based part of speech tagging program to better conform to the fluidity and dynamism of human speech. This would allow natural language-driven voice user interfaces and audio mining programs – for use in both commercial and government applications – to adapt to the in situ construction of dialog, marked by the imprecision, ambiguity and vagueness extant in real-world communications. While conventional part of speech (POS) tagging programs consist of parsing structures derived from syntactic (and semantic) analysis, speech system developers (and users) are also very much aware of the fact that speech recognition difficulties still plague such conventional spoken dialog systems. This is because the inherent inexactitude, vagueness, and uncertainty that are inextricable to the dynamic and fluid nature of human dialog in the real world (e.g., a sudden accretion of anger/frustration may transform a simple question into a rhetorical one; or transform an otherwise simple and straightforward assessment into a gratuitous/sardonic remark) cannot be adequately addressed by conventional POS tagging programs based on syntactic and/or semantic analysis. If we consider for a moment that the biological organism of the human mind does not appear (for the most part) to have much difficulty following the vagarious ebb and flow of dialog with remarkable accuracy and comprehension, so that business transactions and social acts are consummated with a fair amount of regularity and predictability in our quotidian lives, why can’t we design spoken dialog systems to emulate the human mind? To do this, we must first uncover the special formulae that humans regularly invoke to understand humanto- human dialog which by virtue of its fluid and dynamic constitution is often punctuated by ambiguities, obscurities, repetitions, ellipses, and deixes (indirect referents) – the same stubborn and ineluctable features of natural language which individually and collectively impede the performance of speech systems. Using a unique set of parsing structures – consisting of context-free grammatical units, with notations for related prosodic features – to capture the fluid/dynamic nature of human speech, SPA meets the goal of soft computing to exploit the tolerance for imprecision, uncertainty, obscurity, and approximation in order to achieve tractability, robustness and low solution cost. And as a hybrid method – uniquely combining conversation analysis with computational linguistics – SPA is complementary to artificial neural networks and fuzzy logic because in building a flexible and adaptable natural language speech interface, neural networks, or connectionist models, may be viewed as the natural choice for investigating the patterns underlying the orderliness of talk, as they are equipped to handle the ambiguities of natural language due to their capacity, when confronted with incomplete or somewhat conflicting information, to produce a fuzzy set.KeywordsSequence Package AnalysisPart-of-Speech TaggingArtificial Neural NetworksFuzzy LogicConversation AnalysisNatural Language UnderstandingSoft ComputingVoice-User InterfaceAudio Mining

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.