Viewpoint Images Research Articles

The objective of this work is to reconstruct the 3D surfaces of sculptures from one or more images using a view-dependent representation. To this end, we train a network, SiDeNet, to predict the Silhouette and Depth of the surface given a variable number of images; the silhouette is predicted at a different viewpoint from the inputs (e.g. from the side), while the depth is predicted at the viewpoint of the input images. This has three benefits. First, the network learns a representation of shape beyond that of a single viewpoint, as the silhouette forces it to respect the visual hull, and the depth image forces it to predict concavities (which don’t appear on the visual hull). Second, as the network learns about 3D using the proxy tasks of predicting depth and silhouette images, it is not limited by the resolution of the 3D representation. Finally, using a view-dependent representation (e.g. additionally encoding the viewpoint with the input image) improves the network’s generalisability to unseen objects. Additionally, the network is able to handle the input views in a flexible manner. First, it can ingest a different number of views during training and testing, and it is shown that the reconstruction performance improves as additional views are added at test-time. Second, the additional views do not need to be photometrically consistent. The network is trained and evaluated on two synthetic datasets—a realistic sculpture dataset (SketchFab), and ShapeNet. The design of the network is validated by comparing to state of the art methods for a set of tasks. It is shown that (i) passing the input viewpoint (i.e. using a view-dependent representation) improves the network’s generalisability at test time. (ii) Predicting depth/silhouette images allows for higher quality predictions in 2D, as the network is not limited by the chosen latent 3D representation. (iii) On both datasets the method of combining views in a global manner performs better than a local method. Finally, we show that the trained network generalizes to real images, and probe how the network has encoded the latent 3D shape.

Read full abstract

We present and evaluate a fully automated 2D-3D intensity-based registration framework using a single limited field-of-view (FOV) 2D kV radiograph and a 3D kV CBCT for 3D estimation of patient setup errors during brain radiotherapy. We evaluated two similarity measures, the Pearson correlation coefficient on image intensity values (ICC) and maximum likelihood measure with Gaussian noise (MLG), derived from the statistics of transmission images. Pose determination experiments were conducted on 2D kV radiographs in the anterior-posterior (AP) and left lateral (LL) views and 3D kV CBCTs of an anthropomorphic head phantom. In order to minimize radiation exposure and exclude nonrigid structures from the registration, limited FOV 2D kV radiographs were employed. A spatial frequency band useful for the 2D-3D registration was identified from the bone-to-no-bone spectral ratio (BNBSR) of digitally reconstructed radiographs (DRRs) computed from the 3D kV planning CT of the phantom. The images being registered were filtered accordingly prior to computation of the similarity measures. We evaluated the registration accuracy achievable with a single 2D kV radiograph and with the registration results from the AP and LL views combined. We also compared the performance of the 2D-3D registration solutions proposed to that of a commercial 3D-3D registration algorithm, which used the entire skull for the registration. The ground truth was determined from markers affixed to the phantom and visible in the CBCT images. The accuracy of the 2D-3D registration solutions, as quantified by the root mean squared value of the target registration error (TRE) calculated over a radius of 3 cm for all poses tested, was ICCAP : 0.56 mm, MLGAP : 0.74 mm, ICCLL : 0.57 mm, MLGLL : 0.54 mm, ICC (AP and LL combined): 0.19 mm, and MLG (AP and LL combined): 0.21 mm. The accuracy of the 3D-3D registration algorithm was 0.27 mm. There was no significant difference in mean TRE for the 2D-3D registration algorithms using a single 2D kV radiograph with similarity measure and image view point. There was no significant difference in mean TRE between ICCLL , MLGLL , ICC (AP and LL combined), MLG (AP and LL combined), and the 3D-3D registration algorithm despite the smaller FOV used for the 2D-3D registration. While submillimeter registration accuracy was obtained with both ICC and MLG using a single 2D kV radiograph, combining the results from the two projection views resulted in a significantly smaller (P≤0.05) mean TRE. Our results indicate that it is possible to achieve submillimeter registration accuracy with both ICC and MLG using either single or dual limited FOV 2D kV radiographs of the head in the AP and LL views. The registration accuracy suggests that the 2D-3D registration solutions presented are suitable for the estimation of patient setup errors not only during conventional brain radiation therapy, but also during stereotactic procedures and proton radiation therapy where tighter setup margins are required.

Read full abstract

Viewpoint Images Research Articles

Related Topics

Articles published on Viewpoint Images

Action recognition for depth video using multi-view dynamic images

Multiview Synthetic Aperture Radar Automatic Target Recognition Optimization: Modeling and Implementation

Learning to Predict 3D Surfaces of Sculptures from Single and Multiple Views

Digital blind watermarking based on depth variation prediction map and DWT for DIBR free-viewpoint image

Light-Field Rendering In the View Interpolation Region without Dense Light-Field Reconstruction

A Group of Viewpoint Changes in the Conformal Clifford Algebra

Vision-based Bed Detection for Hospital Patient Monitoring System.

Three-dimensional holoscopic image-coding scheme using a sparse viewpoint image array and disparities

Elemental image array generation method by using optimized depth image‐based rendering algorithm for integral imaging display

21‐1: Reducing Image Quality Variation with Motion Parallax for Glassless 3D Screens using Linear Blending Technology

Multi-viewpoint Image Array Virtual Viewpoint Rapid Generation Algorithm Based on Image Layering

2D–3D registration for cranial radiation therapy using a 3D kV CBCT and a single limited field‐of‐view 2D kV radiograph

BoxCars: Improving Fine-Grained Recognition of Vehicles Using 3-D Bounding Boxes in Traffic Surveillance

Efficient Light Field Images Compression Method Based on Depth Estimation and Optimization

Another View Point Image Generation with Shifted Perspective Using Two In-Vehicle Camera Images for Car Navigation System Based on Vehicular Ad-hoc Networks

A Fast Orientation Invariant Detector Based on the One-stage Method

A Work Area Visualization by Multi-View Camera-Based Diminished Reality

Remote Sensing Image Registration Using Multiple Image Features

CrossbowCam: a handheld adjustable multi-camera system

Human detection in occluded scenes through optically inspired multi-camera image fusion.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Viewpoint Images Research Articles

Related Topics

Articles published on Viewpoint Images

Action recognition for depth video using multi-view dynamic images

Multiview Synthetic Aperture Radar Automatic Target Recognition Optimization: Modeling and Implementation

Learning to Predict 3D Surfaces of Sculptures from Single and Multiple Views

Digital blind watermarking based on depth variation prediction map and DWT for DIBR free-viewpoint image

Light-Field Rendering In the View Interpolation Region without Dense Light-Field Reconstruction

A Group of Viewpoint Changes in the Conformal Clifford Algebra

Vision-based Bed Detection for Hospital Patient Monitoring System.

Three-dimensional holoscopic image-coding scheme using a sparse viewpoint image array and disparities

Elemental image array generation method by using optimized depth image‐based rendering algorithm for integral imaging display

21‐1: Reducing Image Quality Variation with Motion Parallax for Glassless 3D Screens using Linear Blending Technology

Multi-viewpoint Image Array Virtual Viewpoint Rapid Generation Algorithm Based on Image Layering

2D–3D registration for cranial radiation therapy using a 3D kV CBCT and a single limited field‐of‐view 2D kV radiograph

BoxCars: Improving Fine-Grained Recognition of Vehicles Using 3-D Bounding Boxes in Traffic Surveillance

Efficient Light Field Images Compression Method Based on Depth Estimation and Optimization

Another View Point Image Generation with Shifted Perspective Using Two In-Vehicle Camera Images for Car Navigation System Based on Vehicular Ad-hoc Networks

A Fast Orientation Invariant Detector Based on the One-stage Method

A Work Area Visualization by Multi-View Camera-Based Diminished Reality

Remote Sensing Image Registration Using Multiple Image Features

CrossbowCam: a handheld adjustable multi-camera system

Human detection in occluded scenes through optically inspired multi-camera image fusion.