Automatic Information Retrieval Research Articles

Crowdsourcing ideas from consumers can enrich idea input in new product development. After a decade of initiatives (e.g., Starbucks’ MyStarbucksIdea, Dell's IdeaStorm), the implications of crowdsourcing for idea generation are well understood, but challenges remain in dealing with the large volume of rapidly generated ideas produced in crowdsourcing communities. This study proposes a model that can assist managers in efficiently processing crowdsourced ideas by identifying the aspects of ideas that are most predictive of future implementation and identifies three sources of information available for an idea: its content, the contributor proposing it, and the crowd's feedback on the idea (the “3Cs”). These information sources differ in their time of availability (content/contributor information is available immediately; crowd feedback accumulates over time) and in the extent to which they comprise structured or unstructured data. This study draws from prior research to operationalize variables corresponding to the 3Cs and develops a new measure to quantify an idea's distinctiveness. Applying automated information retrieval methods (latent semantic indexing) and testing several linear methods (linear discriminant analysis, regularized logistic regression) and nonlinear machine‐learning algorithms (stochastic adaptive boosting, random forests), this article identifies the variables that are most useful towards predicting idea implementation in a crowdsourcing community for an IT product (Mendeley). Our results indicate that consideration of content and contributor information improves ranking performance between 22.6 and 26.0% over random idea selection, and that adding crowd‐related information further improves performance by up to 48.1%. Crowd feedback is the best predictor of idea implementation, followed by idea content and distinctiveness, and the contributor's past idea‐generation experience. Firms are advised to implement two idea selection support systems: one to rank new ideas in real time based on content and contributor experience, and another that integrates the crowd's idea evaluation after it has had sufficient time to provide feedback.

By digitising legacy taxonomic literature using XML mark-up the contents become accessible to other taxonomic and nomenclatural information systems. Appropriate schemas need to be interoperable with other sectorial schemas, atomise to appropriate content elements and carry appropriate metadata to, for example, enable algorithmic assessment of availability of a name under the Code. Legacy (and new) literature delivered in this fashion will become part of a global taxonomic resource from which users can extract tailored content to meet their particular needs, be they nomenclatural, taxonomic, faunistic or other.To date, most digitisation of taxonomic literature has led to a more or less simple digital copy of a paper original – the output of the many efforts has effectively been an electronic copy of a traditional library. While this has increased accessibility of publications through internet access, the means by which many scientific papers are indexed and located is much the same as with traditional libraries. OCR and born-digital papers allow use of web search engines to locate instances of taxon names and other terms, but OCR efficiency in recognising taxonomic names is still relatively poor, people’s ability to use search engines effectively is mixed, and many papers cannot be searched directly. Instead of building digital analogues of traditional publications, we should consider what properties we require of future taxonomic information access. Ideally the content of each new digital publication should be accessible in the context of all previous published data, and the user able to retrieve nomenclatural, taxonomic and other data / information in the form required without having to scan all of the original papers and extract target content manually. This opens the door to dynamic linking of new content with extant systems: automatic population and updating of taxonomic catalogues, ZooBank and faunal lists, all descriptions of a taxon and its children instantly accessible with a single search, comparison of classifications used in different publications, and so on. A means to do this is through marking up content into XML, and the more atomised the mark-up the greater the possibilities for data retrieval and integration. Mark-up requires XML that accommodates the required content elements and is interoperable with other XML schemas, and there are now several written to do this, particularly TaxPub, taxonX and taXMLit, the last of these being the most atomised. We now need to automate this process as far as possible. Manual and automatic data and information retrieval is demonstrated by projects such as INOTAXA and Plazi. As we move to creating and using taxonomic products through the power of the internet, we need to ensure the output, while satisfying in its production the requirements of the Code, is fit for purpose in the future.

Automatic Information Retrieval Research Articles

Related Topics

Articles published on Automatic Information Retrieval

Arabic Dialect Identification based on Probabilistic-Phonetic Modeling

A selectional auto-encoder approach for document image binarization

Coherence in general and personal semantic knowledge: functional differences of the posterior and centro-parietal N400 ERP component.

Spoken keyword search system using improved ASR engine and novel template-based keyword scoring

Rumination impairs the control of stimulus-induced retrieval of irrelevant information, but not attention, control, or response selection in general

Similarity‐Based Summarization of Music Files for Support Vector Machines

Бібліографічна база даних як джерело наукометричних досліджень

Generating a Tolerogenic Cell Therapy Knowledge Graph from Literature.

Pivoted Document Length Normalization

Identifying New Product Ideas: Waiting for the Wisdom of the Crowd or Screening Ideas in Real Time

Verb generation task: early automatic information retrieval or late effortful decision-making?

Scattering-Based Nonlocal Means SAR Despeckling

Digitising legacy zoological taxonomic literature: Processes, products and using the output

Codified Hashtags for Weather Warning on Twitter: an Italian Case Study

Automatic and Controlled Semantic Retrieval: TMS Reveals Distinct Contributions of Posterior Middle Temporal Gyrus and Angular Gyrus.

What a Computer-Based Legal Reference Work Can and Must Deliver

Ten years of science news: A longitudinal analysis of scientific culture in the Spanish digital press.

GA-Based Adaptive Window Length Estimation for Highly Accurate Audio Segmentation

Methods of Processing and Retrieval of Information from Digital Particle Holograms and Their Application

Research of Drug Name Entity Recognition Based on Constructed Dictionary and Conditional Random Field

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Automatic Information Retrieval Research Articles

Related Topics

Articles published on Automatic Information Retrieval

Arabic Dialect Identification based on Probabilistic-Phonetic Modeling

A selectional auto-encoder approach for document image binarization

Coherence in general and personal semantic knowledge: functional differences of the posterior and centro-parietal N400 ERP component.

Spoken keyword search system using improved ASR engine and novel template-based keyword scoring

Rumination impairs the control of stimulus-induced retrieval of irrelevant information, but not attention, control, or response selection in general

Similarity‐Based Summarization of Music Files for Support Vector Machines

Бібліографічна база даних як джерело наукометричних досліджень

Generating a Tolerogenic Cell Therapy Knowledge Graph from Literature.

Pivoted Document Length Normalization

Identifying New Product Ideas: Waiting for the Wisdom of the Crowd or Screening Ideas in Real Time

Verb generation task: early automatic information retrieval or late effortful decision-making?

Scattering-Based Nonlocal Means SAR Despeckling

Digitising legacy zoological taxonomic literature: Processes, products and using the output

Codified Hashtags for Weather Warning on Twitter: an Italian Case Study

Automatic and Controlled Semantic Retrieval: TMS Reveals Distinct Contributions of Posterior Middle Temporal Gyrus and Angular Gyrus.

What a Computer-Based Legal Reference Work Can and Must Deliver

Ten years of science news: A longitudinal analysis of scientific culture in the Spanish digital press.

GA-Based Adaptive Window Length Estimation for Highly Accurate Audio Segmentation

Methods of Processing and Retrieval of Information from Digital Particle Holograms and Their Application

Research of Drug Name Entity Recognition Based on Constructed Dictionary and Conditional Random Field