3D模型检索：基于空间信息的体素编码方法

117 浏览量更新于2024-08-23 收藏 5.36MB PDF 举报

"这篇研究论文探讨了一种针对3D模型检索的体素编码描述符方法，通过探索模型的空间信息来实现高效检索。该方法旨在解决由多个组件组装而成的产品的视觉基检索问题。作者包括Jin-Yuan Jia、Qian Zhang、Long Zeng和Shuang Liang，分别来自同济大学软件工程学院和香港科技大学机械与航空航天工程系。文章发表在2014年的《机械科学与技术杂志》(Journal of Mechanical Science and Technology)，卷28，第7期，页码为2459-2467。" 正文: 在3D模型检索领域，寻找与给定模型相似的产品是一项重要的任务，尤其在产品设计、制造和虚拟现实应用中。然而，现有的视觉基检索方法在处理由多个组件构成的复杂产品时常常遇到挑战，因为这些方法往往无法有效地捕捉和比较复杂的几何结构和空间关系。论文提出了一种创新的体素编码描述符（Voxel-encoded descriptor），利用体素化的方法来表达3D模型的空间信息。体素是3D空间中的基本单元，类似于二维图像的像素。通过将3D模型细分为体素网格，可以对模型的几何形状进行离散化表示，从而便于计算和比较。在该方法中，首先对3D模型进行体素化，然后通过对每个体素赋予特定的特征值来编码模型的空间信息。这些特征可能包括体素的位置、大小、形状、密度等属性。接着，通过统计和组合这些特征，形成一个具有高区分度的描述符向量，这个向量能够代表模型的整体形状和空间结构。在检索过程中，通过比较不同模型的描述符向量，可以计算它们之间的相似度，从而实现高效的模型匹配。此外，论文还可能涉及了优化算法，以减少体素编码描述符的计算复杂性和存储需求，同时保持足够的精度。可能还包括实验部分，展示了在不同复杂程度的3D模型上应用该方法的效果，对比了与其他传统检索技术的性能差异，证明了新方法的有效性。这篇研究论文为3D模型检索提供了一个新的视角，即利用体素编码来捕捉和比较复杂的空间信息，对于改进多组件产品的检索效率具有重要意义。通过这种方式，可以更准确地检索到与查询模型结构相似的3D产品，对于工业设计、产品数据库管理和虚拟展示等领域具有实际应用价值。

Journal of Mechanical Science and Technology 28 (7) (2014) 2459~2467

www.springerlink.com/content/1738-494x

DOI 10.1007/s12206-014-0603-7

Voxel-encoded descriptor for 3D model retrieval by exploring model’s

spatial information

†

Jin-Yuan Jia

, Qian Zhang

, Long Zeng

and Shuang Liang

1,*

School of Software Engineering, Tongji University, Shanghai, China

Department of Mechanical and Aerospace Engineering, Hong Kong University of Science and Technology, Hong Kong, China

(Manuscript Received October 11, 2013; Revised March 2, 2014; Accepted April 23, 2014)

----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

Abstract

Retrieving similar products with a given one has attracted considerable attention. However, products are usually assembled by multiple

components, frustrating the previous visual-based retrieval descriptors. We design a voxel-encoded descriptor (VED) by exploring mod-

els’ spatial information, i.e., boundary data and internal data. This descriptor is computed in three steps. First, the posture of a polygonal

model is normalized by improved voxel-based principal component analysis technique. Then, six color images are generated by project-

ing the voxels along its six local axes. The color value of each pixel encodes status of all voxels intersecting with the ray starting from the

pixel and parallel to the axis. The status of all voxels along a ray embodies the spatial distribution of the model along this ray. Finally, the

VED is computed by applying 2D Fourier transformation to the six color images. With VED, we can distinguish a hollow sphere from a

solid sphere. To improve the retrieval efficiency, the database structure is optimized by an improved geometric manifold entropy

(iGEOMEN) scheme. VED and iGEOMEN are integrated into a model retrieval system. Experimental results demonstrate that the VED

descriptor outperforms the previous visual-based shape descriptors, especially for complex assembly models.

Keywords: iGEOMEN; Voxel-encoded descriptor; Model retrieval; Visual similarity; Voxel representation; Shape descriptor

1. Introduction

This work is motivated by an industrial project, where de-

signers usually need to find similar 3D designs with a given

model. Previous visual-based retrieval methods, e.g., silhou-

ettes [1], binary images [2], depth images [3, 4], characteristic

lines [5], etc., are popular because of their efficiency. How-

ever, the problem encountered is that most design models are

assembled from multiple components, and the accuracy of

previous visual-based retrieval methods decreases considera-

bly, especially for complex assembly models. This is because

the previous shape descriptors only analyze the data of a mod-

el’s boundary, not exploring a model’s spatial structures.

That is, for an assembly model, components may be oc-

cluded by other components when viewed from a specific

angle. The regions between these components just act as its

internal structures. Thus, an assembly model usually has a

complex internal structure from a specific view. Even though

two complex models have the same appearance for all views,

they may have significantly different internal structure.

We propose here a new shape descriptor (VED) based on

voxel representation, to encode not only a model’s visualized

characteristics but also its internal structures. For a given 3D

model, it is approximated with six color images projected

from its six local axes. For each pixel of an image, a ray start-

ing from the pixel and parallel to the local axis is constructed.

Suppose there are n voxels intersecting with this ray and each

voxel has two states, i.e., occupied or empty, corresponding to

1 or 0 binary codes. Then, the state of all voxels of this ray can

be written as a string of binary code of length n. This string is

divided into three sub-strings and each is translated into a

color value of the pixel, corresponding to red, green and blue

channel. Finally, 2D Fourier transformation is applied to the

six color images to extract the VED.

An assembly model is normalized by an improved voxel-

based pose normalization process, for two reasons. First, the

VED descriptor is invariant to translation, scale (uniform) and

rotation, due to the voxel-based principal component analysis

technique (denoted as VPCA and detailed in Sec. 3). Second,

because of the voxelization, the VED has certain tolerance

level to noise and defects, e.g., holes and cracks etc. [6], which

are common in digitized models.

In addition, to be efficient in retrieving models from a large-

scale 3D model database, the database structure is optimized

with improved geometric manifold entropy technique, denoted

Corresponding author. Tel.: +86 21 69585491, Fax.: +86 21 69583731

E-mail address: shuangliang@tongji.edu.cn

†

Recommended by Associate Editor Gil Ho Yoon

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38575456

粉丝: 4
资源: 952

3D模型检索：基于空间信息的体素编码方法

matlab体素法代码-Generative-and-Discriminative-Voxel-Modeling:基于体素的变分自动编码器，V

基于C++的三维模型体素化程序

3d地图 体素 汽车射线

open3D体素滤波

在unity创建Compute Shader并编写体素切割算法。在算法中，将体素中被模型覆盖的部分标记为有效体素

在matlab中将点云转化为体素模型

体素网格,利用3d卷积处理点云数据

stable diffusion生成3d模型

stable diffusion怎么生成3d模型

matlab 点云体素化

最新资源

3d地图体素汽车射线