Supervised Machine Learning Methods Research Articles

Supervised machine learning methods that use neural networks ("deep learning") have yielded substantial improvements to a multitude of Natural Language Processing (NLP) tasks in the past decade. Improvements to Information Retrieval (IR) tasks, such as ad-hoc search, lagged behind those in similar NLP tasks, despite considerable community efforts. Although there are several contributing factors, I argue in this dissertation that early attempts were not more successful because they did not properly consider the unique characteristics of IR tasks when designing and training ranking models. I first demonstrate this by showing how large-scale datasets containing weak relevance labels can successfully replace training on in-domain collections. This technique improves the variety of queries encountered when training and helps mitigate concerns of over-fitting particular test collections. I then show that dataset statistics available in specific IR tasks can be easily incorporated into neural ranking models alongside the textual features, resulting in more effective ranking models. I also demonstrate that contextualized representations, particularly those from transformer-based language models, considerably improve neural ad-hoc ranking performance. I find that this approach is neither limited to the task of ad-hoc ranking (as demonstrated by ranking clinical reports) nor English content (as shown by training effective cross-lingual neural rankers). These efforts demonstrate that neural approaches can be effective for ranking tasks. However, I observe that these techniques are impractical due to their high query-time computational costs. To overcome this, I study approaches for offloading computational cost to index-time, substantially reducing query-time latency. These techniques make neural methods practical for ranking tasks. Finally, I take a deep dive into better understanding the linguistic biases of the methods I propose compared to contemporary and traditional approaches. The findings from this analysis highlight potential pitfalls of recent methods and provide a way to measure progress in this area going forward.

Read full abstract

• Urban areas 3D digitisation using multispectral aerial imagery • Experimental low-cost multispectral camera for drone development • Supervised machine learning methods for 3D mesh segmentation using multispectral data • Performance evaluation of the proposed methods on a large scale case study • Web-based visualisation and annotation tool for segmentation fine-tuning Disaster risk management of movable and immovable cultural heritage is a highly significant research topic. In this work, we present a pipeline for 3D digitisation, segmentation and annotation of large scale urban areas in order to produce data that can be exploited in disaster management simulators (e.g fire spreading, crowd movement, firefighting training, evacuation planning, etc.). We have selected the old town of Xanthi (Greece) as a challenging case study. We developed a custom multispectral camera to be carried by a commercial drone. Using the structure from motion / multiview stereo (SFM/MVS) approach, we produced a 3D model of the urban area covering 0.5 k m 2 that is followed by a multilayer texture map which carries information from visible and near-infrared regions of the electromagnetic spectrum. We developed a set of machine learning approaches based on logistic regression, support vector machines and artificial neural networks that allow 3D model segmentation by exploiting not only morphological and structural features but also the multispectral behaviour of different material surfaces. We objectively evaluate the performance of the proposed segmentation approaches on six significant material-based classes (cobbled-roads granite kilns, building walls, ceramic roof-tiles, low-vegetation, high-vegetation and metal surfaces) that are used in simulating fire propagation and crowd movement. The experiments revealed that the segmentation accuracy can be enhanced by taking into consideration surface material multispectral properties as well as morphological features. A Web-based multi-user annotation tool complements our proposed pipeline by enabling further 3D model segmentation, fine tuning and semantics annotation (e.g. usage-based building classification and evacuation priorities, escape paths and gathering points).

Read full abstract

Supervised Machine Learning Methods Research Articles

Related Topics

Articles published on Supervised Machine Learning Methods

Identifying Personal Narratives in Chinese Weblog Posts

Bootstrap robust prescriptive analytics

Transparent Sequential Learning for Statistical Process Control of Serially Correlated Data

An Efficient Heart Disease Prediction System based on Supervised Machine Learning Methods

Analysis and visualization of COVID-19 discourse on Twitter using data science: a case study of the USA, the UK and India

Machine Learning Methods to Identify Missed Cases of Bladder Cancer in Population-Based Registries.

Effective and practical neural ranking

An algorithm for detecting leaks of insider information of financial markets in investment consulting

Supervised Machine Learning Methods and Hyperspectral Imaging Techniques Jointly Applied for Brain Cancer Classification.

Semi‐supervised deep autoencoder for seismic facies classification

Zero Initialised Unsupervised Active Learning by Optimally Balanced Entropy-Based Sampling for Imbalanced Problems

Comparative Analysis of Multiple Neurodegenerative Diseases Based on Advanced Epigenetic Aging Brain.

Technology opportunity discovery of proton exchange membrane fuel cells based on generative topographic mapping

Unsupervised colonoscopic depth estimation by domain translations with a Lambertian-reflection keeping auxiliary task.

MarkIt: A Collaborative Artificial Intelligence Annotation Platform Leveraging Blockchain For Medical Imaging Research.

Automatic inference of demographic parameters using generative adversarial networks.

Multispectral aerial imagery-based 3D digitisation, segmentation and annotation of large scale urban areas of significant cultural value

A supervised machine learning method to detect anomalous real-time broiler breeder body weight data recorded by a precision feeding system

A Morphological Classification Model to Identify Unresolved PanSTARRS1 Sources. II. Update to the PS1 Point Source Catalog

Predicting Quality of Castings via Supervised Learning Method

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Supervised Machine Learning Methods Research Articles

Related Topics

Articles published on Supervised Machine Learning Methods

Identifying Personal Narratives in Chinese Weblog Posts

Bootstrap robust prescriptive analytics

Transparent Sequential Learning for Statistical Process Control of Serially Correlated Data

An Efficient Heart Disease Prediction System based on Supervised Machine Learning Methods

Analysis and visualization of COVID-19 discourse on Twitter using data science: a case study of the USA, the UK and India

Machine Learning Methods to Identify Missed Cases of Bladder Cancer in Population-Based Registries.

Effective and practical neural ranking

An algorithm for detecting leaks of insider information of financial markets in investment consulting

Supervised Machine Learning Methods and Hyperspectral Imaging Techniques Jointly Applied for Brain Cancer Classification.

Semi‐supervised deep autoencoder for seismic facies classification

Zero Initialised Unsupervised Active Learning by Optimally Balanced Entropy-Based Sampling for Imbalanced Problems

Comparative Analysis of Multiple Neurodegenerative Diseases Based on Advanced Epigenetic Aging Brain.

Technology opportunity discovery of proton exchange membrane fuel cells based on generative topographic mapping

Unsupervised colonoscopic depth estimation by domain translations with a Lambertian-reflection keeping auxiliary task.

MarkIt: A Collaborative Artificial Intelligence Annotation Platform Leveraging Blockchain For Medical Imaging Research.

Automatic inference of demographic parameters using generative adversarial networks.

Multispectral aerial imagery-based 3D digitisation, segmentation and annotation of large scale urban areas of significant cultural value

A supervised machine learning method to detect anomalous real-time broiler breeder body weight data recorded by a precision feeding system

A Morphological Classification Model to Identify Unresolved PanSTARRS1 Sources. II. Update to the PS1 Point Source Catalog

Predicting Quality of Castings via Supervised Learning Method