Traditional Computer Vision Techniques Research Articles

Rephotography is the process of recapturing the photograph of a location from the same perspective in which it was captured earlier. A rephotographed image is the best presentation to visualize and study the social changes of a location over time. Traditionally, only expert artists and photographers are capable of generating the rephotograph of any specific location. Manual editing or human eye judgment that is considered for generating rephotographs normally requires a lot of precision, effort and is not always accurate. In the era of computer science and deep learning, computer vision techniques make it easier and faster to perform precise operations to an image. Until now many research methodologies have been proposed for rephotography but none of them is fully automatic. Some of these techniques require manual input by the user or need multiple images of the same location with 3D point cloud data while others are only suggestions to the user to perform rephotography. In historical records/archives most of the time we can find only one 2D image of a certain location. Computational rephotography is a challenge in the case of using only one image of a location captured at different timestamps because it is difficult to find the accurate perspective of a single 2D historical image. Moreover, in the case of building rephotography, it is required to maintain the alignments and regular shape. The features of a building may change over time and in most of the cases, it is not possible to use a features detection algorithm to detect the key features. In this research paper, we propose a methodology to rephotograph house images by combining deep learning and traditional computer vision techniques. The purpose of this research is to rephotograph an image of the past based on a single image. This research will be helpful not only for computer scientists but also for history and cultural heritage research scholars to study the social changes of a location during a specific time period, and it will allow users to go back in time to see how a specific place looked in the past. We have achieved good, fully automatic rephotographed results based on façade segmentation using only a single image.

Read full abstract

In respect of pig instance segmentation, the application of traditional computer vision techniques is constrained by sundries barrier, overlapping, and different perspectives in the pig breeding environment. In recent years, the attention-based methods have achieved remarkable performance. In this paper, we introduce two types of attention blocks into the feature pyramid network (FPN) (see nomenclature table) framework, which encode the semantic interdependencies in the channel (named channel attention block (CAB)) (see nomenclature table) and spatial (named spatial attention block (SAB)) (see nomenclature table) dimensions, respectively. By integrating the associated features, the CAB selectively emphasizes the interdependencies among the channels. Meanwhile, the SAB selectively aggregates the features at each position through a weighted sum of the features at all positions. A dual attention block (DAB) (see nomenclature table) is proposed to integrate CAB features with SAB information flexibly. A total of 45 pigs with 8 pens are captured as the experiment subjects. In comparison with such state-of-art attention modules as convolutional block attention module (CBAM) (see nomenclature table), bottleneck attention module (BAM) (see nomenclature table), and spatial-channel squeeze & excitation (SCSE) (see nomenclature table), embedding DAB can contribute to the most significant performance improvement in different task networks with distinct backbone networks. Especially with HTC-R101-DAB (hybrid task cascade) (see nomenclature table), the best performance is produced, with the AP0.5 (average precision) (see nomenclature table) AP0.75, AP0.5:0.95, and AP0.5:0.95-large reaching 93.1%, 84.1%, 69.4%, and 71.8%, respectively. Also, as indicated by ablation experiments, the SAB contributes more than CAB. Meanwhile, the predictive results appear a trend of increasing initially and decreasing afterwards after different numbers of SAB are merged. Besides, as revealed by the visualization of attention maps, attention blocks can extract regions with similar semantic information. The attention-based models also produce outstanding segmentation performance on public dataset, which evidences the practicability of our attention blocks. Ourbaseline models are available11https://github.com/zhiweihu1103/pig-instance-segmentation.

Read full abstract

Traditional Computer Vision Techniques Research Articles

Articles published on Traditional Computer Vision Techniques

Deep Learning Based Hand Wrist Segmentation using Mask R-CNN

Vision-based positioning of Unmanned Surface Vehicles using Fiducial Markers for automatic docking

Model Assumptions and Data Characteristics: Impacts on Domain Adaptation in Building Segmentation

Deep Neural Network Training and Testing Datasets for License Plate Recognition

LICENSE PLATE RECOGNITION TECHNIQUES: COMPARATIVE STUDY

DCRN: An Optimized Deep Convolutional Regression Network for Building Orientation Angle Estimation in High-Resolution Satellite Images

Detection of Cervical Cancer Cells in Whole Slide Images Using Deformable and Global Context Aware Faster RCNN-FPN.

Wave Peel Tracking: A New Approach for Assessing Surf Amenity and Analysis of Breaking Waves

Single Image Façade Segmentation and Computational Rephotography of House Images Using Deep Learning

Towards Lifespan Automation for Caenorhabditis elegans Based on Deep Learning: Analysing Convolutional and Recurrent Neural Networks for Dead or Live Classification.

Determining Chess Game State from an Image.

Dual attention-guided feature pyramid network for instance segmentation of group pigs

Online Photometric Calibration of Automatic Gain Thermal Infrared Cameras

Deep learning-based high-throughput phenotyping can drive future discoveries in plant reproductive biology

Real-Time Hair Segmentation Using Mobile-Unet

A Deep Learning-Based Benchmarking Framework for Lane Segmentation in the Complex and Dynamic Road Scenes

Study on Intelligent Image Recognition of Non-linear Short Pointer SF6 Meter Readings

Hair Segmentation and Removal in Dermoscopic Images Using Deep Learning

Deep learning approach for automatic microplastics counting and classification

Fi-Fo Detector: Figure and Formula Detection Using Deformable Networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Traditional Computer Vision Techniques Research Articles

Articles published on Traditional Computer Vision Techniques

Deep Learning Based Hand Wrist Segmentation using Mask R-CNN

Vision-based positioning of Unmanned Surface Vehicles using Fiducial Markers for automatic docking

Model Assumptions and Data Characteristics: Impacts on Domain Adaptation in Building Segmentation

Deep Neural Network Training and Testing Datasets for License Plate Recognition

LICENSE PLATE RECOGNITION TECHNIQUES: COMPARATIVE STUDY

DCRN: An Optimized Deep Convolutional Regression Network for Building Orientation Angle Estimation in High-Resolution Satellite Images

Detection of Cervical Cancer Cells in Whole Slide Images Using Deformable and Global Context Aware Faster RCNN-FPN.

Wave Peel Tracking: A New Approach for Assessing Surf Amenity and Analysis of Breaking Waves

Single Image Façade Segmentation and Computational Rephotography of House Images Using Deep Learning

Towards Lifespan Automation for Caenorhabditis elegans Based on Deep Learning: Analysing Convolutional and Recurrent Neural Networks for Dead or Live Classification.

Determining Chess Game State from an Image.

Dual attention-guided feature pyramid network for instance segmentation of group pigs

Online Photometric Calibration of Automatic Gain Thermal Infrared Cameras

Deep learning-based high-throughput phenotyping can drive future discoveries in plant reproductive biology

Real-Time Hair Segmentation Using Mobile-Unet

A Deep Learning-Based Benchmarking Framework for Lane Segmentation in the Complex and Dynamic Road Scenes

Study on Intelligent Image Recognition of Non-linear Short Pointer SF6 Meter Readings

Hair Segmentation and Removal in Dermoscopic Images Using Deep Learning

Deep learning approach for automatic microplastics counting and classification

Fi-Fo Detector: Figure and Formula Detection Using Deformable Networks