基于草图的自由曲面建模：SketchCNN与神经预测

需积分: 10 97 浏览量更新于2024-09-05 收藏 15.24MB PDF 举报

"SketchCNN.pdf 是一篇关于使用深度学习技术，特别是卷积神经网络(CNN)来处理基于素描的自由形式表面建模的论文。该论文探讨了如何利用CNN来解决从草图中准确重建3D形状的问题，旨在克服传统方法对用户注释的依赖以及对特定形状类别的限制。" 在当前的计算机图形学领域，素描被广泛用作自由形式设计的直观工具，因为人类可以通过简单的线条快速表达复杂的形状。然而，将这些草图转化为精确的3D模型是一项挑战，因为草图中存在多种模糊性和不确定性。传统的建模方法要么需要大量的人工干预以消除歧义，要么只能处理特定类型的形状。论文“RobustFlow-Guided Neural Prediction for Sketch-Based Freeform Surface Modeling”提出了一种新的方法，该方法结合了稀疏表达和二维草图的卷积神经网络。这种方法的关键在于，它能够从草图中提取关键信息，如轮廓曲线和曲率暗示，然后使用CNN预测3D表面补丁。多个补丁可以融合在一起形成完整的3D形状。图1展示了这种方法的应用，显示了从不同草图生成的3D形状，并强调了边界位置数据和曲率提示的重要性。论文中提到，对于人类绘制的更暗或过度描绘的线条，通常对应着更强的曲率，这使得生成的3D形状看起来更加自然和直观。通过引入CNN，系统可以学习到这种模式并据此生成逼真的3D模型。这种方法不仅提高了自动化程度，还扩展了可以建模的形状范围，不再局限于预先定义的类别。此外，论文可能还讨论了如何引导神经网络进行预测，以增强其鲁棒性，例如通过流动指导(Flow-Guided)策略，使模型能够处理不完整或不精确的输入。同时，论文可能还涵盖了训练策略、损失函数设计以及评估方法，以验证所提方法的有效性。 SketchCNN是一种创新的建模技术，它结合了机器学习和传统几何建模的优点，为基于草图的自由形式表面建模提供了解决方案，降低了对用户交互的需求，增强了形状重建的灵活性和准确性。

Robust Flow-Guided Neural Prediction for Sketch-Based Freeform Surface Modeling • 238:3

and shape from contour, feature and representative curvature lines.

On the other hand, while these previous methods rely on detailed

user annotations to parse the sketches into curves of dierent func-

tions and often use expensive nonlinear numerical optimizations

to solve the 2D to 3D conversion, we utilize CNN models to parse

the sketch and infer the geometry with improved eciency and

reduced user sketch and annotations. See Sec. 5.2 for comparisons.

Data-driven methods. For many common objects and scenes, it

is usually reasoned that we humans envision their 3D shapes by

rst recognizing what they are and then matching a prior shapes

of the same category in memory to the observations. This idea

underlies a range of data-driven methods for sketch-based modeling,

as they generally separate the modeling task into two steps: rst

search matching shapes through a database against an input sketch,

and then adapt the retrieved shapes as necessary to t the input

sketch. Examples include pure sketch-based retrieval [Eitz et al

2012;

Su et al

2015; Wang et al

2015c], and retrieval with subsequent

adaptation and composition, like Sketch2Scene [Xu et al

2013] for

scene modeling and [Guo et al

2016; Lee and Funkhouser 2008; Xie

et al

2013] for object modeling. While these approaches signicantly

ease the user burden by providing abundant a prior knowledge for

a specic category of objects that the user tries to model, the tool

built for one category however does not generalize to others. In

comparison, our machine learning model learns the more generic

geometric reconstruction process rather than the knowledge of

class-specic 3D objects, which makes our method possibly less

ecient for modeling a particular class of objects but more generic

with ner levels of shape control provided to the user.

Later works in this domain do not explicitly separate the model

searching and adapting steps, but rely on the powerful deep neural

networks to map directly from sketch to 3D data, examples including

[Delanoy et al

2017; Lun et al

2017; Su et al

2018]. [Su et al

2018]

predicts normal maps from a category-specic 2D sketch by an

encoder-decoder network, which minimizes normal tting loss and

adversarial loss, and takes as optional input user specied normal

samples. [Delanoy et al

2017] uses a CNN to map sketches to a

volumetric occupancy grid representing the 3D shape, and allows the

incremental update of the shape through an updater CNN as the user

sketches in new views. However, it is shown that the CNN trained

for each object category does not generalize to other categories.

Besides, the volumetric representation restricts the resolution of

modeled shapes. The work by Lun et al. [2017] inputs category-

specic planar sketches from canonical viewpoints (front, side, top)

to a CNN with an encoder and thirteen decoders, each of which

outputs the depth and normal maps for one of thirteen predened

viewpoints, which are then fused together into a 3D mesh.

Dierent from [Delanoy et al

2017] and [Lun et al

2017] that

solve the generation of complete 3D shapes of trained categories, our

work focuses on modeling freeform surfaces that are represented as

depth maps, while also providing a multi-view fusion approach to

combine the surfaces into full 3D models. By modeling a surface at

a time using general geometric rules and learned priors for shape

from sketch, our approach is agnostic to shape categories. However,

we note that to break up a complete 3D shape into multiple surface

patches to be modeled sequentially is not always straightforward

to conceive for users, which we regard as a due price to pay for the

category-free advantage. To help the user modeling, our multi-view

interactive process allows the user to sketch in arbitrary views for

dierent parts of the shape incrementally, with surfaces modeled in

other views assisting the sketching in new view (Sec. 4).

Procedural and parametric models provide another kind of prior

knowledge, which eectively reduces the modeling task to a map-

ping from sketches to model parameters. Many works learn the

mapping from data, for modeling urban architectures [Nishida et al

2016], faces [Han et al

2017], and others [Huang et al

2016]. These

methods are tailored for the given parametric models and do not

generalize to other freeform shapes.

Recent works directly reconstruct from 2D images the 3D shapes

and scenes represented in depth and normal maps [Eigen and Fergus

2015; Tatarchenko et al

2016; Wang et al

2015a], point cloud [Fan

et al

2017] or volumetric grids [Choy et al

2016; Tatarchenko et al

2017; Wu et al

2016], utilizing data-driven and CNN models. In

this paper we focus on reconstructing high quality 3D shapes from

sketches which contain much sparser information than images, and

provide the user convenient control for 3D modeling.

3 SINGLE VIEW MODELING

In a single view, we recover depth and normal data from a sparse

planar sketch. There are two primary challenges for this process.

First, the sparse strokes in a sketch have dierent meanings, each af-

fecting proximate regions of the corresponding 3D shape dierently;

we need to parse the strokes consistently and interpolate their data

over the entire planar region to infer the 3D surface. To solve this

problem, we rely on a CNN model to parse the dierent input lines

automatically with minimal user specication, thus saving much

user eort. See Fig. 2 for an example where the ridges and valleys

are distinguished automatically from input unlabeled strokes.

Second, the 2D sketches have inherent ambiguity of what 3D

shapes they represent, which can fail whatever powerful machine

learning model that tries brute force regression of the reconstruction.

Previous approaches resolve the ambiguity usually by restricting to

shapes of common classes; as a result, such a model works well for

the particular category it is trained for but does not generalize to

others [Delanoy et al

2017; Lun et al

2017]. We instead strive for

more general freeform shape modeling and focus on using geometric

principles with optional user input to combat ambiguities.

To summarize, at the core of our single view modeling is a two-

stage CNN regression model (Fig. 2):

•

Given the input sketches, a rst stage subnetwork (DFNet)

regresses the ow eld, a dense signal that describes the

surface curvature directions and guides its reconstruction

(Sec. 3.2).

•

A second stage subnetwork (GeomNet) takes the sketch and

ow eld guidance, and predicts depth/normal maps, and a

condence map that shows how much ambiguity there is for

each point of the input sketch (Sec. 3.3).

In addition, the user can further modify the surface and resolve

ambiguity, by providing curvature hints over strokes, or depth values

on sparse sample points; our CNN model is trained to utilize these

optional inputs. Next we discuss the single view modeling in detail.

ACM Transactions on Graphics, Vol. 37, No. 6, Article 238. Publication date: November 2018.

剩余11页未读，继续阅读

mtxxxx

粉丝: 22

基于草图的自由曲面建模：SketchCNN与神经预测

cole_02_0507.pdf

工程硕士开题报告：无线传感器网络路由技术及能量优化LEACH协议研究

【东海期货-2025研报】东海贵金属周度策略：金价高位回落，阶段性回调趋势初现.pdf

图像数据处理工具+数据(帮助用户快速划分数据集并增强图像数据集。通过自动化数据处理流程，简化了深度学习项目的数据准备工作)

diminico_02_0709.pdf

agenda_3cd_01_0716.pdf

A课件Python全栈开发线下班.zip

diminico_02_1108.pdf

基于人工智能大模型技术的果蔬农技知识智能问答系统.pdf

diminico_02_0307.pdf

最新资源