Abstract

Large-scale image datasets are constructed with a significant amount of time and human effort, however, an image may have biases in the camera positions that render the target object. In this study, we propose a framework that automatically conducts 3D models, multi-view images, and even category definition. We automatically generate 3D models based on fractal geometry, which is the regularity behind natural phenomena and render images from multiple viewpoints. By following those generation processes, we can automatically construct a generally large-scale image dataset that takes into account the viewpoint position. The experimental results show that the classification accuracy is improved over the conventional baseline in the context of pre-training for image recognition tasks. We show that our proposed method provides an effective method for automatically constructing pre-training datasets for image recognition tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.