Neurocomputing 259 (2017) 176–182
Contents lists available at ScienceDirect
Neurocomputing
journal homepage: www.elsevier.com/locate/neucom
3D models retrieval algorithm based on multimodal data
Anan Liu, Wenhui Li, Weizhi Nie
∗
, Yuting Su
The School of Electronic Information Engineering, Tianjin University, China
a r t i c l e i n f o
Article history:
Received 24 January 2016
Revised 2 June 2016
Accepted 18 June 2016
Available online 14 February 2017
Keywords:
3D model retrieval
Multimodal
Multimodal fusion
a b s t r a c t
With the development of computer vision in recent year, 3D models have been utilized in many appli-
cations, such as virtual reality, me dical surgical, geographic information system. With the growth of 3D
models, it is necessary to develop effective 3D model retrieval methods for data management. In this pa-
per, we proposed a novel algorithm based on multimodal 3D model data to handle model retrieval prob-
lem. First, we extract structure information and visual information from each virtual 3D model. Then, a
universal graph matching is employed to handle similarity measure in different modals respectively. Fi-
nally, a simple statistical model is utilized to handle similarity measure and finish retrieval process. The
final comparing experiments demonstrate the superiority of our approach.
©2017 Elsevier B.V. All rights reserved.
1. Introduction
The rapid development in computer vision has made it more
practicable to make use of the 3D object information. The key of
3D object utilization is 3D object retrieval and recognition, and ef-
fective algorithms for them are increasingly demanded. In recent
years, many algorithms are proposed to handle 3D model retrieval
and recognition problem [19,21] .
Sundar et al . [28] proposed the 3D model retrieval method
based on skeletal information. They encoded the geometric and
topological information in the form of a skeletal graph. Then, graph
matching method is utilized to handle similarity measure. Gao
et al. [10] proposed a 3D model descriptor, Spatial Structure Cir-
cular Descriptor (SSCD) which contains the spatial structure of a
3D model described by 2D projection images. The SSCD can effec-
tively preserve the global spatial structure of 3D models and guar-
antee the accuracy of similarity measure. Liu et al. [20] proposed a
graph-based method for 3D model retrieval. This method used the
grab clustering method for representation view extraction and the
random-walk algorithm is leveraged to update the weight of each
representation view. Then the similarity measurement of two mod-
els was converted into graph matching problem by considering the
view set as a graph model.
However, all of these methods only focus on the structure in-
formation of 3D model, while ignoring the visual information. In
this paper, we proposed a novel method, which can effectively uti-
lize structure and visual information to handle 3D model retrieval
∗
Corresponding author.
E-mail addresses: truman.nie@gmail.com , weizhinie@tju.edu.cn (W. Nie).
problem. First, we extract a set of 2D views of 3D model from dif-
ferent angles. At the same time, K -means is utilized to extract a set
of key points of 3D model from three-dimensional space. Second,
different modals are utilized to construct different graph models
in order to represent the structure and visual information of 3D
model respectively. Finally, the high-order graph matching, and SM
algorithm are applied to compute the similarity of structure graphs
and the similarity between visual graph model respectively, which
are leveraged to get the final similarity between different models
and handle retrieval problem.
The contributions of this paper are followed as:
• We proposed an effective 3D retrieval method based on mul-
timodal data containing the visual information and the spatial
information. For the visual information, we use the classic SM
method to leverage the similarity problem. For spatial informa-
tion, we cluster the points of the 3D models and use the loca-
tions of keypoints to represent the model and handle the sim-
ilarity between the different models by using computing the
similarity of the clusters from the different models.
• We proposed a novel high-order graph matching to handle sim-
ilarity between different 3D models in three-dimensional space.
Compared to the traditional two-order graph matching method
which only considers the correlation between pairwise and ig-
nores the spatial information among points which are impor-
tant for matching. So we propose to use tensors to solve the
three-order spatial graph matching problem.
• We successfully utilized multimodal information of 3D model
to guarantee the accuracy of similarity. The final comparison
experiments also demonstrate the superiority of the retrieval
framework.
http://dx.doi.org/10.1016/j.neucom.2016.06.087
0925-2312/© 2017 Elsevier B.V. All rights reserved.